ERIC - Search Results

Publication Date

In 2025	112
Since 2024	316
Since 2021 (last 5 years)	1124

Descriptor

Test Items	1124
Foreign Countries	498
Test Construction	299
Item Response Theory	293
Test Validity	253
Item Analysis	246
Test Reliability	228
Difficulty Level	227
Scores	177
Multiple Choice Tests	157
Computer Assisted Testing	152
Language Tests	151
English (Second Language)	140
Second Language Learning	139
Psychometrics	129
Science Tests	129
Comparative Analysis	128
Mathematics Tests	120
Undergraduate Students	117
Test Format	115
Accuracy	114
Achievement Tests	103
Elementary School Students	101
Secondary School Students	100
Correlation	93
More ▼

Publication Type

Reports - Research	1124
Journal Articles	1041
Tests/Questionnaires	61
Speeches/Meeting Papers	37
Information Analyses	8
Numerical/Quantitative Data	5
Collected Works - General	1

Education Level

Higher Education	311
Postsecondary Education	311
Secondary Education	252
Elementary Education	179
Middle Schools	116
Junior High Schools	93
High Schools	77
Intermediate Grades	60
Early Childhood Education	48
Primary Education	42
Grade 8	38
Elementary Secondary Education	33
Grade 4	32
Grade 3	26
Grade 5	26
Grade 7	17
Grade 2	15
Grade 6	15
Grade 9	10
Grade 12	9
Grade 1	8
Grade 10	8
Adult Education	6
Kindergarten	5
Preschool Education	5
More ▼

Audience

Practitioners	3
Counselors	1

Location

Turkey	77
Indonesia	39
China	35
Germany	31
Iran	23
Japan	21
United States	16
Canada	15
United Kingdom	15
Malaysia	13
South Africa	12
Australia	11
Europe	11
South Korea	11
Taiwan	11
Italy	10
Saudi Arabia	10
India	9
Netherlands	9
Nigeria	8
Thailand	8
Philippines	7
United Kingdom (England)	7
New Zealand	6
Norway	6
More ▼

Laws, Policies, & Programs

Head Start

What Works Clearinghouse Rating

Showing 1 to 15 of 1,124 results Save | Export

A Comparison of Anchor Selection Strategies for DIF Analysis

Peer reviewed

Direct link

Haeju Lee; Kyung Yong Kim – Journal of Educational Measurement, 2025

When no prior information of differential item functioning (DIF) exists for items in a test, either the rank-based or iterative purification procedure might be preferred. The rank-based purification selects anchor items based on a preliminary DIF test. For a preliminary DIF test, likelihood ratio test (LRT) based approaches (e.g.,…

Descriptors: Test Items, Equated Scores, Test Bias, Accuracy

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

Another Look at Yen's Q3: Is 0.2 an Appropriate Cut-Off?

Peer reviewed

Direct link

Kelsey Nason; Christine DeMars – Journal of Educational Measurement, 2025

This study examined the widely used threshold of 0.2 for Yen's Q3, an index for violations of local independence. Specifically, a simulation was conducted to investigate whether Q3 values were related to the magnitude of bias in estimates of reliability, item parameters, and examinee ability. Results showed that Q3 values below the typical cut-off…

Descriptors: Item Response Theory, Statistical Bias, Test Reliability, Test Items

Measuring Item Influence for Diagnostic Classification Models

Peer reviewed

Direct link

Daniel P. Jurich; Matthew J. Madison – Educational Assessment, 2023

Diagnostic classification models (DCMs) are psychometric models that provide probabilistic classifications of examinees on a set of discrete latent attributes. When analyzing or constructing assessments scored by DCMs, understanding how each item influences attribute classifications can clarify the meaning of the measured constructs, facilitate…

Descriptors: Test Items, Models, Classification, Influences

Generalizability Theory Approach to Analyzing Automated-Item Generated Test Forms

Peer reviewed

Direct link

Stella Y. Kim; Sungyeun Kim – Educational Measurement: Issues and Practice, 2025

This study presents several multivariate Generalizability theory designs for analyzing automatic item-generated (AIG) based test forms. The study used real data to illustrate the analysis procedure and discuss practical considerations. We collected the data from two groups of students, each group receiving a different form generated by AIG. A…

Descriptors: Generalizability Theory, Automation, Test Items, Students

The Frequency, Type, and Function of Visual Displays in Upper Elementary Standardized Science Tests

Peer reviewed

Direct link

Daibao Guo; Katherine Landau Wright; Lianne Josbacher; Eun Hye Son – Elementary School Journal, 2025

Limited research has explored the use of visual displays (ViDis) in science tests, making it challenging to know how these tests align with classroom instruction and what skills students need to be successful on these tests. Therefore, the current study aims to describe the use of ViDis in upper elementary grade standardized science tests. We…

Descriptors: Standardized Tests, Science Tests, Elementary Education, Science Education

Integration of Historical Data for the Analysis of Multiple Assessment Studies

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2023

Integrative data analyses have recently been shown to be an effective tool for researchers interested in synthesizing datasets from multiple studies in order to draw statistical or substantive conclusions. The actual process of integrating the different datasets depends on the availability of some common measures or items reflecting the same…

Descriptors: Data Analysis, Synthesis, Test Items, Simulation

Item Parameter Estimation of the 2PL IRT Model with Fixed Ability Estimates: Choices of Ability Estimation Methods and Priors on Slopes

Peer reviewed
PDF on ERIC

Download full text

Jianbin Fu; TsungHan Ho; Xuan Tan – Practical Assessment, Research & Evaluation, 2025

Item parameter estimation using an item response theory (IRT) model with fixed ability estimates is useful in equating with small samples on anchor items. The current study explores the impact of three ability estimation methods (weighted likelihood estimation [WLE], maximum a posteriori [MAP], and posterior ability distribution estimation [PST])…

Descriptors: Item Response Theory, Test Items, Computation, Equated Scores

Functional Approaches for Modeling Unfolding Data

Peer reviewed

Direct link

Engelhard, George – Educational and Psychological Measurement, 2023

The purpose of this study is to introduce a functional approach for modeling unfolding response data. Functional data analysis (FDA) has been used for examining cumulative item response data, but a functional approach has not been systematically used with unfolding response processes. A brief overview of FDA is presented and illustrated within the…

Descriptors: Data Analysis, Models, Responses, Test Items

Scoring Running Records: Complexities and Affordances

Peer reviewed

Direct link

Rodgers, Emily; D'Agostino, Jerome V.; Berenbon, Rebecca; Johnson, Tracy; Winkler, Christa – Journal of Early Childhood Literacy, 2023

Running Records are thought to be an excellent formative assessment tool because they generate results that educators can use to make their teaching more responsive. Despite the technical nature of scoring Running Records and the kinds of important decisions that are attached to their analysis, few studies have investigated assessor accuracy. We…

Descriptors: Formative Evaluation, Scoring, Accuracy, Difficulty Level

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

EQGG: Automatic Question Group Generation

Peer reviewed

Direct link

Po-Chun Huang; Ying-Hong Chan; Ching-Yu Yang; Hung-Yuan Chen; Yao-Chung Fan – IEEE Transactions on Learning Technologies, 2024

Question generation (QG) task plays a crucial role in adaptive learning. While significant QG performance advancements are reported, the existing QG studies are still far from practical usage. One point that needs strengthening is to consider the generation of question group, which remains untouched. For forming a question group, intrafactors…

Descriptors: Automation, Test Items, Computer Assisted Testing, Test Construction

Optimal Calibration of Items for Multidimensional Achievement Tests

Peer reviewed

Direct link

Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024

Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…

Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 75

Educational and Psychological…	54
International Journal of…	49
Grantee Submission	34
Journal of Educational and…	32
Journal of Educational…	31
Applied Measurement in…	28
Language Testing	23
Educational Measurement:…	22
International Journal of…	22
Online Submission	22
Education and Information…	17
Journal of Psychoeducational…	17
Language Testing in Asia	17
Physical Review Physics…	16
Practical Assessment,…	15
SAGE Open	15
Large-scale Assessments in…	14
Measurement:…	14
ETS Research Report Series	13
Language Assessment Quarterly	11
Educational Assessment	10
International Journal of…	10
Pegem Journal of Education…	9
Field Methods	8
Interactive Learning…	8
More ▼

Joshua B. Gilbert	6
Luke W. Miratrix	6
Wollack, James A.	6
Lee, Won-Chan	5
Robitzsch, Alexander	5
Benjamin W. Domingue	4
Bulut, Okan	4
Chun Wang	4
Guo, Hongwen	4
Katherine S. Binder	4
Kroehne, Ulf	4
Kuan-Yu Jin	4
Scott P. Ardoin	4
Wind, Stefanie A.	4
Allan S. Cohen	3
Arikan, Serkan	3
Aryadoust, Vahid	3
Aybek, Eren Can	3
Baghaei, Purya	3
Boone, William J.	3
Braeken, Johan	3
Choe, Edison M.	3
Goldhammer, Frank	3
Gorney, Kylie	3
Guo, Wenjing	3
More ▼

Program for International…	47
Trends in International…	19
National Assessment of…	11
Test of English as a Foreign…	10
International English…	8
Measures of Academic Progress	8
ACT Assessment	7
Test of English for…	6
Program for the International…	5
Progress in International…	5
Big Five Inventory	4
Force Concept Inventory	4
Peabody Picture Vocabulary…	4
Gates MacGinitie Reading Tests	3
Social Skills Improvement…	3
Digit Span Test	2
Flesch Kincaid Grade Level…	2
Raven Progressive Matrices	2
Remote Associates Test	2
SAT (College Admission Test)	2
Woodcock Johnson Tests of…	2
Ages and Stages Questionnaires	1
Autism Diagnostic Observation…	1
Behavioral and Emotional…	1
Career Thoughts Inventory	1
More ▼