ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	16

Descriptor

Mathematics Tests	56
Test Items	22
Item Response Theory	16
Multiple Choice Tests	13
Comparative Analysis	12
Item Analysis	11
Scores	11
Achievement Tests	10
College Entrance Examinations	10
High Schools	9
Difficulty Level	8
Foreign Countries	8
High School Students	8
Test Format	8
Computer Assisted Testing	7
Models	7
Psychometrics	7
Test Bias	7
Comparative Testing	6
Junior High Schools	6
Mathematical Models	6
Reading Tests	6
Sex Differences	6
Test Construction	6
Algebra	5
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	56
Reports - Research	42
Reports - Evaluative	12
Speeches/Meeting Papers	3
Reports - Descriptive	2
Book/Product Reviews	1

Education Level

Secondary Education	6
Elementary Education	2
Elementary Secondary Education	2
Grade 7	2
Junior High Schools	2
Middle Schools	2
Grade 8	1
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Hong Kong	1
Ireland	1
Netherlands	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	5
National Assessment of…	3
Program for International…	3
Trends in International…	3
ACT Assessment	1
California Achievement Tests	1
Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 56 results Save | Export

A One-Parameter Diagnostic Classification Model with Familiar Measurement Properties

Peer reviewed

Direct link

Matthew J. Madison; Stefanie A. Wind; Lientje Maas; Kazuhiro Yamaguchi; Sergio Haab – Journal of Educational Measurement, 2024

Diagnostic classification models (DCMs) are psychometric models designed to classify examinees according to their proficiency or nonproficiency of specified latent characteristics. These models are well suited for providing diagnostic and actionable feedback to support intermediate and formative assessment efforts. Several DCMs have been developed…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

Exploring the Impact of Random Guessing in Distractor Analysis

Peer reviewed

Direct link

Jin, Kuan-Yu; Siu, Wai-Lok; Huang, Xiaoting – Journal of Educational Measurement, 2022

Multiple-choice (MC) items are widely used in educational tests. Distractor analysis, an important procedure for checking the utility of response options within an MC item, can be readily implemented in the framework of item response theory (IRT). Although random guessing is a popular behavior of test-takers when answering MC items, none of the…

Descriptors: Guessing (Tests), Multiple Choice Tests, Item Response Theory, Attention

Partial Identification of Answer Reviewing Effects in Multiple-Choice Exams

Peer reviewed

Direct link

Kim, Yongnam – Journal of Educational Measurement, 2020

Does reviewing previous answers during multiple-choice exams help examinees increase their final score? This article formalizes the question using a rigorous causal framework, the potential outcomes framework. Viewing examinees' reviewing status as a treatment and their final score as an outcome, the article first explains the challenges of…

Descriptors: Review (Reexamination), Multiple Choice Tests, Scores, Identification

Random Responders in the TIMSS 2015 Student Questionnaire: A Threat to Validity?

Peer reviewed

Direct link

van Laar, Saskia; Braeken, Johan – Journal of Educational Measurement, 2022

The low-stakes character of international large-scale educational assessments implies that a participating student might at times provide unrelated answers as if s/he was not even reading the items and choosing a response option randomly throughout. Depending on the severity of this invalid response behavior, interpretations of the assessment…

Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries

A Factor Mixture Model for Item Responses and Certainty of Response Indices to Identify Student Knowledge Profiles

Peer reviewed

Direct link

Chen, Chia-Wen; Andersson, Björn; Zhu, Jinxin – Journal of Educational Measurement, 2023

The certainty of response index (CRI) measures respondents' confidence level when answering an item. In conjunction with the answers to the items, previous studies have used descriptive statistics and arbitrary thresholds to identify student knowledge profiles with the CRIs. Whereas this approach overlooked the measurement error of the observed…

Descriptors: Item Response Theory, Factor Analysis, Psychometrics, Test Items

Examining Psychometric Properties and Level Classification of the van Hiele Geometry Test Using CTT and CDM Frameworks

Peer reviewed

Direct link

Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019

The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…

Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

A Response Time Process Model for Not-Reached and Omitted Items

Peer reviewed

Direct link

Lu, Jing; Wang, Chun – Journal of Educational Measurement, 2020

Item nonresponses are prevalent in standardized testing. They happen either when students fail to reach the end of a test due to a time limit or quitting, or when students choose to omit some items strategically. Oftentimes, item nonresponses are nonrandom, and hence, the missing data mechanism needs to be properly modeled. In this paper, we…

Descriptors: Item Response Theory, Test Items, Standardized Tests, Responses

Semiparametric Item Response Functions in the Context of Guessing

Peer reviewed

Direct link

Falk, Carl F.; Cai, Li – Journal of Educational Measurement, 2016

We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood-based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…

Descriptors: Item Response Theory, Guessing (Tests), Mathematics Tests, Simulation

An Odds Ratio Approach for Detecting DDF under the Nested Logit Modeling Framework

Peer reviewed

Direct link

Terzi, Ragip; Suh, Youngsuk – Journal of Educational Measurement, 2015

An odds ratio approach (ORA) under the framework of a nested logit model was proposed for evaluating differential distractor functioning (DDF) in multiple-choice items and was compared with an existing ORA developed under the nominal response model. The performances of the two ORAs for detecting DDF were investigated through an extensive…

Descriptors: Test Bias, Multiple Choice Tests, Test Items, Comparative Analysis

Diagnostic Profiles: A Standard Setting Method for Use with a Cognitive Diagnostic Model

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Journal of Educational Measurement, 2016

This article introduces the Diagnostic Profiles (DP) standard setting method for setting a performance standard on a test developed from a cognitive diagnostic model (CDM), the outcome of which is a profile of mastered and not-mastered skills or attributes rather than a single test score. In the DP method, the key judgment task for panelists is a…

Descriptors: Models, Standard Setting, Profiles, Diagnostic Tests

The Random-Effect DINA Model

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Journal of Educational Measurement, 2014

The DINA (deterministic input, noisy, and gate) model has been widely used in cognitive diagnosis tests and in the process of test development. The outcomes known as slip and guess are included in the DINA model function representing the responses to the items. This study aimed to extend the DINA model by using the random-effect approach to allow…

Descriptors: Models, Guessing (Tests), Probability, Ability

Relative and Absolute Fit Evaluation in Cognitive Diagnosis Modeling

Peer reviewed

Direct link

Chen, Jinsong; de la Torre, Jimmy; Zhang, Zao – Journal of Educational Measurement, 2013

As with any psychometric models, the validity of inferences from cognitive diagnosis models (CDMs) determines the extent to which these models can be useful. For inferences from CDMs to be valid, it is crucial that the fit of the model to the data is ascertained. Based on a simulation study, this study investigated the sensitivity of various fit…

Descriptors: Models, Psychometrics, Goodness of Fit, Statistical Analysis

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Comparison of the One- and Bi-Direction Chained Equipercentile Equating

Peer reviewed

Direct link

Oh, Hyeonjoo; Moses, Tim – Journal of Educational Measurement, 2012

This study investigated differences between two approaches to chained equipercentile (CE) equating (one- and bi-direction CE equating) in nearly equal groups and relatively unequal groups. In one-direction CE equating, the new form is linked to the anchor in one sample of examinees and the anchor is linked to the reference form in the other…

Descriptors: Equated Scores, Statistical Analysis, Comparative Analysis, Differences

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Tatsuoka, Kikumi K.	4
Bennett, Randy Elliot	3
Harris, Deborah J.	2
Lane, Suzanne	2
Mehrens, William A.	2
Phillips, S. E.	2
Skaggs, Gary	2
Andersson, Björn	1
Beretvas, S. Natasha	1
Berger, Aliza E.	1
Bielinski, John	1
Birenbaum, Menucha	1
Boyle, Bill	1
Braeken, Johan	1
Bridgeman, Brent	1
Burket, George R.	1
Cai, Li	1
Chen, Chia-Wen	1
Chen, Jinsong	1
Chen, Yi-Hsin	1
Cleary, T. Anne	1
Cohen, Jon	1
Crocker, Linda	1
Davison, Mark L.	1
More ▼