ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	22
Since 2006 (last 20 years)	29

Descriptor

Achievement Tests	113
Test Validity	24
Test Items	23
Item Response Theory	22
Foreign Countries	20
Scores	20
Test Construction	19
International Assessment	18
Testing Problems	18
Models	17
Secondary School Students	17
Standardized Tests	17
Test Reliability	16
Elementary Education	13
Latent Trait Theory	13
Comparative Analysis	12
Elementary Secondary Education	12
Item Analysis	12
Mathematical Models	12
Reading Tests	11
Test Results	11
Equated Scores	10
Mathematics Tests	10
Simulation	10
Statistical Analysis	9
More ▼

Source

Journal of Educational…

113

Publication Type

Journal Articles	92
Reports - Research	65
Reports - Evaluative	15
Information Analyses	5
Opinion Papers	4
Reports - Descriptive	4
Book/Product Reviews	2
Speeches/Meeting Papers	2
Reports - General	1

Education Level

Secondary Education	17
Elementary Secondary Education	2
Elementary Education	1
Grade 4	1
Grade 8	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Researchers	4
Practitioners	2

Location

Belgium	1
Netherlands	1
Turkey	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Journal of Educational Measurement X

Showing 1 to 15 of 113 results Save | Export

Optimal Calibration of Items for Multidimensional Achievement Tests

Peer reviewed

Direct link

Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024

Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…

Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction

MSAEM Estimation for Confirmatory Multidimensional Four-Parameter Normal Ogive Models

Peer reviewed

Direct link

Jia Liu; Xiangbin Meng; Gongjun Xu; Wei Gao; Ningzhong Shi – Journal of Educational Measurement, 2024

In this paper, we develop a mixed stochastic approximation expectation-maximization (MSAEM) algorithm coupled with a Gibbs sampler to compute the marginalized maximum a posteriori estimate (MMAPE) of a confirmatory multidimensional four-parameter normal ogive (M4PNO) model. The proposed MSAEM algorithm not only has the computational advantages of…

Descriptors: Algorithms, Achievement Tests, Foreign Countries, International Assessment

Variation in Respondent Speed and Its Implications: Evidence from an Adaptive Testing Scenario

Peer reviewed

Direct link

Benjamin W. Domingue; Klint Kanopka; Ben Stenhaug; James Soland; Megan Kuhfeld; Steve Wise; Chris Piech – Journal of Educational Measurement, 2021

The more frequent collection of response time data is leading to an increased need for an understanding of how such data can be included in measurement models. Models for response time have been advanced, but relatively limited large-scale empirical investigations have been conducted. We take advantage of a large data set from the adaptive NWEA…

Descriptors: Achievement Tests, Reaction Time, Reading Tests, Accuracy

Using Retest Data to Evaluate and Improve Effort-Moderated Scoring

Peer reviewed

Direct link

Wise, Steven L.; Kuhfeld, Megan R. – Journal of Educational Measurement, 2021

There has been a growing research interest in the identification and management of disengaged test taking, which poses a validity threat that is particularly prevalent with low-stakes tests. This study investigated effort-moderated (E-M) scoring, in which item responses classified as rapid guesses are identified and excluded from scoring. Using…

Descriptors: Scoring, Data Use, Response Style (Tests), Guessing (Tests)

Incorporating Test-Taking Engagement into Multistage Adaptive Testing Design for Large-Scale Assessments

Peer reviewed

Direct link

Okan Bulut; Guher Gorgun; Hacer Karamese – Journal of Educational Measurement, 2025

The use of multistage adaptive testing (MST) has gradually increased in large-scale testing programs as MST achieves a balanced compromise between linear test design and item-level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or can…

Descriptors: Response Style (Tests), Testing Problems, Testing Accommodations, Measurement

DIF Detection for Multiple Groups: Comparing Three-Level GLMMs and Multiple-Group IRT Models

Peer reviewed

Direct link

Carmen Köhler; Lale Khorramdel; Artur Pokropek; Johannes Hartig – Journal of Educational Measurement, 2024

For assessment scales applied to different groups (e.g., students from different states; patients in different countries), multigroup differential item functioning (MG-DIF) needs to be evaluated in order to ensure that respondents with the same trait level but from different groups have equal response probabilities on a particular item. The…

Descriptors: Measures (Individuals), Test Bias, Models, Item Response Theory

Differences in Time Usage as a Competing Hypothesis for Observed Group Differences in Accuracy with an Application to Observed Gender Differences in PISA Data

Peer reviewed

Direct link

Radhika Kapoor; Erin Fahle; Klint Kanopka; David Klinowski; Ana Trindade Ribeiro; Benjamin W. Domingue – Journal of Educational Measurement, 2024

Group differences in test scores are a key metric in education policy. Response time offers novel opportunities for understanding these differences, especially in low-stakes settings. Here, we describe how observed group differences in test accuracy can be attributed to group differences in latent response speed or group differences in latent…

Descriptors: Foreign Countries, Secondary School Students, Achievement Tests, International Assessment

Random Responders in the TIMSS 2015 Student Questionnaire: A Threat to Validity?

Peer reviewed

Direct link

van Laar, Saskia; Braeken, Johan – Journal of Educational Measurement, 2022

The low-stakes character of international large-scale educational assessments implies that a participating student might at times provide unrelated answers as if s/he was not even reading the items and choosing a response option randomly throughout. Depending on the severity of this invalid response behavior, interpretations of the assessment…

Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries

Linking via Pseudo-Equivalent Group Design: Methodological Considerations and an Application to the PISA and PIACC Assessments

Peer reviewed

Direct link

Pokropek, Artur; Borgonovi, Francesca – Journal of Educational Measurement, 2020

This article presents the pseudo-equivalent group approach and discusses how it can enhance the quality of linking in the presence of nonequivalent groups. The pseudo-equivalent group approach allows to achieve pseudo-equivalence using propensity score reweighting techniques. We use it to perform linking to establish scale concordance between two…

Descriptors: Foreign Countries, Secondary School Students, Achievement Tests, International Assessment

On Joining a Signal Detection Choice Model with Response Time Models

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021

In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…

Descriptors: Perception, Bias, Theories, Test Items

Multiple-Group Joint Modeling of Item Responses, Response Times, and Action Counts with the Conway-Maxwell-Poisson Distribution

Peer reviewed

Direct link

Qiao, Xin; Jiao, Hong; He, Qiwei – Journal of Educational Measurement, 2023

Multiple group modeling is one of the methods to address the measurement noninvariance issue. Traditional studies on multiple group modeling have mainly focused on item responses. In computer-based assessments, joint modeling of response times and action counts with item responses helps estimate the latent speed and action levels in addition to…

Descriptors: Multivariate Analysis, Models, Item Response Theory, Statistical Distributions

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Explanatory Cognitive Diagnostic Modeling Incorporating Response Times

Peer reviewed

Direct link

Qiao, Xin; Jiao, Hong – Journal of Educational Measurement, 2021

This study proposes explanatory cognitive diagnostic model (CDM) jointly incorporating responses and response times (RTs) with the inclusion of item covariates related to both item responses and RTs. The joint modeling of item responses and RTs intends to provide more information for cognitive diagnosis while item covariates can be used to predict…

Descriptors: Cognitive Measurement, Models, Reaction Time, Test Items

A More Flexible Bayesian Multilevel Bifactor Item Response Theory Model

Peer reviewed

Direct link

Fujimoto, Ken A. – Journal of Educational Measurement, 2020

Multilevel bifactor item response theory (IRT) models are commonly used to account for features of the data that are related to the sampling and measurement processes used to gather those data. These models conventionally make assumptions about the portions of the data structure that represent these features. Unfortunately, when data violate these…

Descriptors: Bayesian Statistics, Item Response Theory, Achievement Tests, Secondary School Students

A Response Time Process Model for Not-Reached and Omitted Items

Peer reviewed

Direct link

Lu, Jing; Wang, Chun – Journal of Educational Measurement, 2020

Item nonresponses are prevalent in standardized testing. They happen either when students fail to reach the end of a test due to a time limit or quitting, or when students choose to omit some items strategically. Oftentimes, item nonresponses are nonrandom, and hence, the missing data mechanism needs to be properly modeled. In this paper, we…

Descriptors: Item Response Theory, Test Items, Standardized Tests, Responses

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Linn, Robert L.	4
Yen, Wendy M.	4
Phillips, S. E.	3
Wise, Steven L.	3
Benjamin W. Domingue	2
Birenbaum, Menucha	2
Choi, Seung W.	2
Debeer, Dries	2
Forsyth, Robert A.	2
Hoover, H. D.	2
Janssen, Rianne	2
Jiao, Hong	2
Kim, Dong-In	2
Klint Kanopka	2
Loyd, Brenda H.	2
Mehrens, William A.	2
Qiao, Xin	2
Rutkowski, Leslie	2
Sinharay, Sandip	2
Thissen, David	2
Wan, Ping	2
Williams, Valerie S. L.	2
Airasian, Peter W.	1
Ambrosino, Robert J.	1
More ▼

Program for International…	17
Iowa Tests of Basic Skills	5
National Assessment of…	4
California Achievement Tests	3
Metropolitan Achievement Tests	3
SAT (College Admission Test)	3
Comprehensive Tests of Basic…	2
Indiana Statewide Testing for…	2
North Carolina End of Course…	2
Stanford Achievement Tests	2
Trends in International…	2
General Educational…	1
Iowa Tests of Educational…	1
Kaufman Assessment Battery…	1
McCarthy Scales of Childrens…	1
Measures of Academic Progress	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
Preschool Inventory	1
Program for the International…	1
Progress in International…	1
SRA Achievement Series	1
Sequential Tests of…	1
State Trait Anxiety Inventory	1
Test of Standard Written…	1
More ▼