ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	32
Since 2006 (last 20 years)	46

Descriptor

Item Response Theory	76
Testing Problems	76
Test Items	38
Test Construction	17
Achievement Tests	16
Computer Assisted Testing	14
Simulation	14
Equated Scores	12
Educational Assessment	11
Estimation (Mathematics)	11
Foreign Countries	11
Comparative Analysis	10
Evaluation Methods	10
Scores	10
Difficulty Level	9
Test Validity	9
Adaptive Testing	8
Equations (Mathematics)	8
Error of Measurement	8
Item Bias	8
Mathematical Models	8
Models	8
Multiple Choice Tests	8
Scoring	8
Ability	7
More ▼

Publication Type

Journal Articles	53
Reports - Research	42
Reports - Evaluative	25
Speeches/Meeting Papers	15
Information Analyses	3
Reports - Descriptive	3
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Opinion Papers	2
Books	1
Collected Works - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Secondary Education	5
Secondary Education	5
Adult Education	1
Elementary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Location

Denmark	1
Germany	1
Kentucky	1
Latin America	1
Poland	1
Sweden	1
Taiwan (Taipei)	1
Texas	1
Turkey	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
National Assessment of…	4
Indiana Statewide Testing for…	2
SAT (College Admission Test)	2
Armed Services Vocational…	1
Graduate Management Admission…	1
Kaufman Test of Educational…	1
Progress in International…	1
Wechsler Individual…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 76 results Save | Export

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Detecting Rating Scale Malfunctioning with the Partial Credit Model and Generalized Partial Credit Model

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2023

Rating scale analysis techniques provide researchers with practical tools for examining the degree to which ordinal rating scales (e.g., Likert-type scales or performance assessment rating scales) function in psychometrically useful ways. When rating scales function as expected, researchers can interpret ratings in the intended direction (i.e.,…

Descriptors: Rating Scales, Testing Problems, Item Response Theory, Models

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Spoilt for Choice? Issues around the Use and Comparability of Optional Exam Questions

Peer reviewed

Direct link

Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019

For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…

Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries

To What Degree Does Rapid Guessing Distort Aggregated Test Scores? A Meta-Analytic Investigation

Peer reviewed

Direct link

Rios, Joseph A.; Deng, Jiayi; Ihlenfeldt, Samuel D. – Educational Assessment, 2022

The present meta-analysis sought to quantify the average degree of aggregated test score distortion due to rapid guessing (RG). Included studies group-administered a low-stakes cognitive assessment, identified RG via response times, and reported the rate of examinees engaging in RG, the percentage of RG responses observed, and/or the degree of…

Descriptors: Guessing (Tests), Testing Problems, Scores, Item Response Theory

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Lord's Equity Theorem Revisited

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…

Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods

Rasch Model Extensions for Enhanced Formative Assessments in MOOCs

Peer reviewed

Direct link

Abbakumov, Dmitry; Desmet, Piet; Van den Noortgate, Wim – Applied Measurement in Education, 2020

Formative assessments are an important component of massive open online courses (MOOCs), online courses with open access and unlimited student participation. Accurate conclusions on students' proficiency via formative, however, face several challenges: (a) students are typically allowed to make several attempts; and (b) student performance might…

Descriptors: Item Response Theory, Formative Evaluation, Online Courses, Response Style (Tests)

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Journal of Educational…	11
Educational and Psychological…	7
Applied Measurement in…	5
Educational Measurement:…	4
Journal of Educational and…	4
ETS Research Report Series	3
International Journal of…	3
Online Submission	3
Applied Psychological…	2
Measurement:…	2
ProQuest LLC	2
AERA Online Paper Repository	1
Assessment in Education:…	1
Educational Assessment	1
Educational Assessment,…	1
Electronic Journal of Science…	1
International Journal for the…	1
International Journal of…	1
Journal of Experimental…	1
Large-scale Assessments in…	1
Participatory Educational…	1
Physical Review Physics…	1
Psychometrika	1
Research Matters	1
More ▼

Sinharay, Sandip	6
Choi, Seung W.	2
Cohen, Allan S.	2
Debeer, Dries	2
Forsyth, Robert A.	2
Hambleton, Ronald K.	2
Janssen, Rianne	2
Kim, Dong-In	2
Kim, Seock-Ho	2
Lee, Yi-Hsuan	2
Rios, Joseph A.	2
Robitzsch, Alexander	2
Schumacker, Randall E.	2
Wainer, Howard	2
Wan, Ping	2
Wind, Stefanie A.	2
von Davier, Matthias	2
Abbakumov, Dmitry	1
Ackerman, Terry	1
Al-Karni, Ali	1
Allison Ames	1
Andrés Christiansen	1
Baghi, Heibatollah	1
Baker, Frank B.	1
More ▼