ERIC - Search Results

Publication Date

In 2025	12
Since 2024	46
Since 2021 (last 5 years)	215
Since 2016 (last 10 years)	452
Since 2006 (last 20 years)	782

Descriptor

Scores	1127
Test Items	1127
Foreign Countries	285
Item Response Theory	272
Test Construction	225
Difficulty Level	209
Item Analysis	204
Comparative Analysis	196
Test Validity	179
Correlation	169
Test Reliability	169
Statistical Analysis	153
Multiple Choice Tests	150
Test Format	145
Psychometrics	137
Language Tests	131
Mathematics Tests	131
Computer Assisted Testing	130
Second Language Learning	126
English (Second Language)	123
Test Bias	117
Achievement Tests	108
College Students	88
Higher Education	87
Models	85
More ▼

Education Level

Higher Education	221
Postsecondary Education	179
Secondary Education	140
Elementary Education	101
High Schools	68
Middle Schools	61
Grade 8	42
Junior High Schools	41
Elementary Secondary Education	36
Grade 4	30
Intermediate Grades	30
Grade 5	21
Grade 6	18
Grade 7	18
Early Childhood Education	16
Grade 9	15
Grade 3	14
Primary Education	12
Grade 11	10
Grade 12	10
Kindergarten	5
Preschool Education	5
Adult Education	4
Grade 10	4
Grade 1	3
More ▼

Audience

Researchers	23
Practitioners	16
Teachers	11
Administrators	3
Community	2
Policymakers	2
Counselors	1
Parents	1
Students	1

Location

Canada	28
Turkey	26
Japan	18
Iran	17
United States	16
Australia	15
China	11
Germany	10
United Kingdom	10
United Kingdom (England)	10
Taiwan	9
Indonesia	8
Massachusetts	8
New York	8
South Korea	8
California	7
Netherlands	7
Finland	6
Florida	6
Israel	6
Ohio	6
Oklahoma	5
Pennsylvania	5
Sweden	5
Texas	5
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Race to the Top	2
Comprehensive Education…	1
Head Start	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Showing 1 to 15 of 1,127 results Save | Export

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Under the Weather? The Effects of Temperature on Student Test Performance. EdWorkingPaper No. 24-910

Download full text

Deven Carlson; Adam Shepardson – Annenberg Institute for School Reform at Brown University, 2024

As students are exposed to extreme temperatures with ever-increasing frequency, it is important to understand how such exposure affects student learning. In this paper we draw upon detailed student achievement data, combined with high-resolution weather records, to paint a clear portrait of the effect of temperature on student learning across a…

Descriptors: Weather, Climate, Heat, Academic Achievement

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

Examination of the Aggregate Scoring Method in a Judgment Concordance Test

Peer reviewed
PDF on ERIC

Download full text

Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023

The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…

Descriptors: Scoring, Tests, Evaluation Methods, Test Items

Methods for Imputing Scores When All Responses Are Missing for One or More Polytomous Items: Accuracy and Impact on Psychometric Property. Research Report. ETS RR-23-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023

Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…

Descriptors: Scores, Test Items, Accuracy, Psychometrics

Student-Group Item Parameter Drift Impact on Individual and Aggregate Observed Scores

Direct link

Hess, Jessica – ProQuest LLC, 2023

This study was conducted to further research into the impact of student-group item parameter drift (SIPD) --referred to as subpopulation item parameter drift in previous research-- on ability estimates and proficiency classification accuracy when occurring in the discrimination parameter of a 2-PL item response theory (IRT) model. Using Monte…

Descriptors: Test Items, Groups, Ability, Item Response Theory

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 76

Educational and Psychological…	50
ProQuest LLC	47
Journal of Educational…	43
ETS Research Report Series	38
Applied Measurement in…	36
Educational Measurement:…	25
Language Testing	23
Applied Psychological…	21
International Journal of…	19
Educational Assessment	18
Online Submission	16
Journal of Educational and…	15
Grantee Submission	13
Practical Assessment,…	13
Language Assessment Quarterly	12
College Board	11
Measurement:…	9
International Journal of…	8
Physical Review Physics…	8
Psychometrika	8
Assessment & Evaluation in…	7
College Entrance Examination…	7
Journal of Experimental…	7
Education and Information…	6
Educational Testing Service	6
More ▼

Meijer, Rob R.	12
Sijtsma, Klaas	10
Haberman, Shelby J.	9
Sinharay, Sandip	9
Dorans, Neil J.	8
Bridgeman, Brent	7
Lee, Yi-Hsuan	7
Liu, Ou Lydia	7
Sireci, Stephen G.	6
Sykes, Robert C.	6
Engelhard, George, Jr.	5
Hambleton, Ronald K.	5
Livingston, Samuel A.	5
Thompson, Bruce	5
Wainer, Howard	5
Wise, Steven L.	5
Ackerman, Terry	4
Baghaei, Purya	4
Bennett, Randy Elliot	4
Bulut, Okan	4
Cawthon, Stephanie W.	4
Clauser, Brian E.	4
Cohen, Allan S.	4
Dimitrov, Dimiter M.	4
More ▼

Journal Articles	798
Reports - Research	781
Reports - Evaluative	192
Speeches/Meeting Papers	114
Reports - Descriptive	57
Tests/Questionnaires	54
Dissertations/Theses -…	48
Numerical/Quantitative Data	32
Guides - Non-Classroom	18
Information Analyses	17
Opinion Papers	15
Collected Works - Proceedings	3
Guides - General	3
Non-Print Media	3
Reference Materials - General	3
Books	2
Collected Works - General	2
Collected Works - Serials	2
Guides - Classroom - Teacher	2
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Legal/Legislative/Regulatory…	1
Reports - General	1
More ▼

Program for International…	34
SAT (College Admission Test)	33
Test of English as a Foreign…	28
National Assessment of…	22
Trends in International…	18
ACT Assessment	15
Graduate Record Examinations	15
Peabody Picture Vocabulary…	8
Test of English for…	8
Progress in International…	7
Advanced Placement…	6
Raven Progressive Matrices	6
International English…	4
Iowa Tests of Basic Skills	4
Measures of Academic Progress	4
Gates MacGinitie Reading Tests	3
General Educational…	3
Stanford Achievement Tests	3
Beck Depression Inventory	2
California Achievement Tests	2
Center for Epidemiologic…	2
Flesch Kincaid Grade Level…	2
Graduate Management Admission…	2
Minnesota Multiphasic…	2
Myers Briggs Type Indicator	2
More ▼