ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	27
Since 2016 (last 10 years)	57
Since 2006 (last 20 years)	119

Descriptor

Item Response Theory	145
Scores	145
Computer Assisted Testing	73
Test Items	53
Adaptive Testing	32
Scoring	32
Comparative Analysis	31
Testing	31
Psychometrics	27
Test Reliability	25
Test Validity	24
Foreign Countries	23
Models	21
Test Construction	21
Correlation	20
Statistical Analysis	20
Test Format	18
Item Analysis	17
Simulation	17
Test Bias	17
Accuracy	16
Error of Measurement	16
Academic Achievement	14
Difficulty Level	14
Evaluation Methods	14
More ▼

Publication Type

Journal Articles	105
Reports - Research	92
Reports - Evaluative	30
Speeches/Meeting Papers	11
Numerical/Quantitative Data	10
Reports - Descriptive	9
Dissertations/Theses -…	5
Tests/Questionnaires	4
Opinion Papers	3
Collected Works - Proceedings	2
Information Analyses	2
Book/Product Reviews	1
Books	1
Guides - Classroom - Learner	1
Guides - General	1
More ▼

Education Level

Higher Education	22
Postsecondary Education	17
Elementary Education	15
Secondary Education	15
High Schools	10
Junior High Schools	10
Middle Schools	10
Grade 6	9
Grade 8	9
Grade 4	8
Grade 7	8
Grade 9	8
Intermediate Grades	8
Elementary Secondary Education	7
Grade 5	7
Early Childhood Education	6
Grade 3	6
Primary Education	6
Grade 10	4
Grade 11	3
Adult Education	1
Grade 12	1
Kindergarten	1
More ▼

Audience

Practitioners	1
Researchers	1
Students	1

Location

China	3
Germany	3
Indonesia	3
New Zealand	3
Taiwan	3
United States	3
Australia	2
Canada	2
Finland	2
Florida	2
India	2
Malaysia	2
Netherlands	2
New Mexico	2
Ohio	2
Philippines	2
Singapore	2
Turkey	2
United Kingdom	2
Arkansas	1
Azerbaijan	1
China (Shanghai)	1
Colorado	1
Denmark	1
District of Columbia	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 145 results Save | Export

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Multidimensional Forced-Choice CAT with Dominance Items: An Empirical Comparison with Optimal Static Testing under Different Desirability Matching

Peer reviewed

Direct link

Lin, Yin; Brown, Anna; Williams, Paul – Educational and Psychological Measurement, 2023

Several forced-choice (FC) computerized adaptive tests (CATs) have emerged in the field of organizational psychology, all of them employing ideal-point items. However, despite most items developed historically follow dominance response models, research on FC CAT using dominance items is limited. Existing research is heavily dominated by…

Descriptors: Measurement Techniques, Computer Assisted Testing, Adaptive Testing, Industrial Psychology

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Methods for Imputing Scores When All Responses Are Missing for One or More Polytomous Items: Accuracy and Impact on Psychometric Property. Research Report. ETS RR-23-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023

Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…

Descriptors: Scores, Test Items, Accuracy, Psychometrics

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Comparing Examinee-Based and Response-Based Motivation Filtering Methods in Remote Low-Stakes Testing

Peer reviewed

Direct link

Sarah Alahmadi; Christine E. DeMars – Applied Measurement in Education, 2024

Large-scale educational assessments are sometimes considered low-stakes, increasing the possibility of confounding true performance level with low motivation. These concerns are amplified in remote testing conditions. To remove the effects of low effort levels in responses observed in remote low-stakes testing, several motivation filtering methods…

Descriptors: Multiple Choice Tests, Item Response Theory, College Students, Scores

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

On the Generalized S-X[superscript 2]-Test of Item Fit: Some Variants, Residuals, and a Graphical Visualization

Peer reviewed

Direct link

Ranger, Jochen; Brauer, Kay – Journal of Educational and Behavioral Statistics, 2022

The generalized S-X[superscript 2]-test is a test of item fit for items with polytomous responses format. The test is based on a comparison of the observed and expected number of responses in strata defined by the test score. In this article, we make four contributions. We demonstrate that the performance of the generalized S-X[superscript 2]-test…

Descriptors: Goodness of Fit, Test Items, Statistical Analysis, Item Response Theory

To What Degree Does Rapid Guessing Distort Aggregated Test Scores? A Meta-Analytic Investigation

Peer reviewed

Direct link

Rios, Joseph A.; Deng, Jiayi; Ihlenfeldt, Samuel D. – Educational Assessment, 2022

The present meta-analysis sought to quantify the average degree of aggregated test score distortion due to rapid guessing (RG). Included studies group-administered a low-stakes cognitive assessment, identified RG via response times, and reported the rate of examinees engaging in RG, the percentage of RG responses observed, and/or the degree of…

Descriptors: Guessing (Tests), Testing Problems, Scores, Item Response Theory

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

ETS Research Report Series	11
Educational and Psychological…	10
Applied Measurement in…	7
Journal of Educational…	7
Applied Psychological…	5
International Journal of…	5
Journal of Educational and…	5
Measurement:…	5
Partnership for Assessment of…	5
ProQuest LLC	5
Educational Measurement:…	4
Language Assessment Quarterly	4
ACT, Inc.	3
IEEE Transactions on Learning…	3
International Educational…	3
Computer Assisted Language…	2
Educational Technology &…	2
Educational Testing Service	2
Intelligence	2
Journal of Applied Testing…	2
Journal of Research on…	2
New Meridian Corporation	2
Online Submission	2
Practical Assessment,…	2
Psychological Assessment	2
More ▼

Sinharay, Sandip	5
Wise, Steven L.	4
Foorman, Barbara R.	3
Meijer, Rob R.	3
Petscher, Yaacov	3
Sykes, Robert C.	3
Zwick, Rebecca	3
Andrich, David	2
Aryadoust, Vahid	2
Brown, Anna	2
Capar, Nilufer K.	2
Choi, Seung W.	2
Davey, Tim	2
Donoghue, John R.	2
Guo, Hongwen	2
Hartig, Johannes	2
Keng, Leslie	2
Kim, Dong-In	2
Kim, Sooyeon	2
Liu, Ou Lydia	2
Quesen, Sarah	2
Rios, Joseph A.	2
Rizavi, Saba	2
Steedle, Jeffrey	2
More ▼

National Assessment of…	4
Test of English as a Foreign…	3
ACT Assessment	2
Early Childhood Longitudinal…	2
Graduate Record Examinations	2
Indiana Statewide Testing for…	2
Program for International…	2
Raven Progressive Matrices	2
Armed Forces Qualification…	1
Edinburgh Handedness Inventory	1
Florida Comprehensive…	1
Gates MacGinitie Reading Tests	1
International English…	1
Minnesota Multiphasic…	1
NEO Personality Inventory	1
Peabody Picture Vocabulary…	1
Program for the International…	1
Progress in International…	1
Stanford Achievement Tests	1
More ▼