ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	15

Descriptor

Foreign Countries	16
Test Items	16
Achievement Tests	8
Item Response Theory	8
International Assessment	7
Models	7
Secondary School Students	6
Scoring	4
Statistical Analysis	4
Accuracy	3
Comparative Analysis	3
Computation	3
Grade 8	3
Mathematics Tests	3
Maximum Likelihood Statistics	3
Probability	3
Cognitive Measurement	2
Cognitive Tests	2
College Entrance Examinations	2
Efficiency	2
Goodness of Fit	2
Item Analysis	2
Mathematics Achievement	2
National Competency Tests	2
Psychometrics	2
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	16
Reports - Research	11
Reports - Descriptive	2
Reports - Evaluative	2
Opinion Papers	1

Education Level

Secondary Education	10
Elementary Education	3
Grade 8	3
Higher Education	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Elementary Secondary Education	1
Grade 12	1
Grade 4	1
High Schools	1
Intermediate Grades	1
More ▼

Audience

Location

China	3
Sweden	2
Belgium	1
Germany	1
Netherlands (Amsterdam)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	2
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Generalizing beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features

Peer reviewed

Direct link

Maria Bolsinova; Jesper Tijmstra; Leslie Rutkowski; David Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Profile analysis is one of the main tools for studying whether differential item functioning can be related to specific features of test items. While relevant, profile analysis in its current form has two restrictions that limit its usefulness in practice: It assumes that all test items have equal discrimination parameters, and it does not test…

Descriptors: Test Items, Item Analysis, Generalizability Theory, Achievement Tests

Diagnosing Primary Students' Reading Progression: Is Cognitive Diagnostic Computerized Adaptive Testing the Way Forward?

Peer reviewed

Direct link

Li, Yan; Huang, Chao; Liu, Jia – Journal of Educational and Behavioral Statistics, 2023

Cognitive diagnostic computerized adaptive testing (CD-CAT) is a cutting-edge technology in educational measurement that targets at providing feedback on examinees' strengths and weaknesses while increasing test accuracy and efficiency. To date, most CD-CAT studies have made methodological progress under simulated conditions, but little has…

Descriptors: Computer Assisted Testing, Cognitive Tests, Diagnostic Tests, Reading Tests

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

A Class of Cognitive Diagnosis Models for Polytomous Data

Peer reviewed

Direct link

Gao, Xuliang; Ma, Wenchao; Wang, Daxun; Cai, Yan; Tu, Dongbo – Journal of Educational and Behavioral Statistics, 2021

This article proposes a class of cognitive diagnosis models (CDMs) for polytomously scored items with different link functions. Many existing polytomous CDMs can be considered as special cases of the proposed class of polytomous CDMs. Simulation studies were carried out to investigate the feasibility of the proposed CDMs and the performance of…

Descriptors: Cognitive Measurement, Models, Test Items, Scoring

Detecting Noneffortful Responses Based on a Residual Method Using an Iterative Purification Process

Peer reviewed

Direct link

Liu, Yue; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2021

The prevalence and serious consequences of noneffortful responses from unmotivated examinees are well-known in educational measurement. In this study, we propose to apply an iterative purification process based on a response time residual method with fixed item parameter estimates to detect noneffortful responses. The proposed method is compared…

Descriptors: Response Style (Tests), Reaction Time, Test Items, Accuracy

Kernel Equating Using Propensity Scores for Nonequivalent Groups

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2019

When equating two test forms, the equated scores will be biased if the test groups differ in ability. To adjust for the ability imbalance between nonequivalent groups, a set of common items is often used. When no common items are available, it has been suggested to use covariates correlated with the test scores instead. In this article, we reduce…

Descriptors: Equated Scores, Test Items, Probability, College Entrance Examinations

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

Category-Level Model Selection for the Sequential G-DINA Model

Peer reviewed

Direct link

Ma, Wenchao; de la Torre, Jimmy – Journal of Educational and Behavioral Statistics, 2019

Solving a constructed-response item usually requires successfully performing a sequence of tasks. Each task could involve different attributes, and those required attributes may be "condensed" in various ways to produce the responses. The sequential generalized deterministic input noisy "and" gate model is a general cognitive…

Descriptors: Test Items, Cognitive Measurement, Models, Hypothesis Testing

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

Absolute and Relative Measures of Instructional Sensitivity

Peer reviewed

Direct link

Naumann, Alexander; Hartig, Johannes; Hochweber, Jan – Journal of Educational and Behavioral Statistics, 2017

Valid inferences on teaching drawn from students' test scores require that tests are sensitive to the instruction students received in class. Accordingly, measures of the test items' instructional sensitivity provide empirical support for validity claims about inferences on instruction. In the present study, we first introduce the concepts of…

Descriptors: Test Items, Item Response Theory, Instructional Effectiveness, Psychometrics

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Profiles in Research: Fumiko Samejima

Peer reviewed

Direct link

Wainer, Howard; Robinson, Daniel H. – Journal of Educational and Behavioral Statistics, 2007

Fumiko Samejima is best known for her pioneering work in polytomous response item response theory (IRT), yielding the eponymous model that has been used broadly for more than 30 years. In this interview, Samejima, on the verge of retiring from her faculty position at the University of Tennessee, discusses her life and career. She also describes…

Descriptors: Foreign Countries, Psychometrics, Item Response Theory, Test Items

Previous Page | Next Page »

Pages: 1 | 2

Ma, Wenchao	2
Robitzsch, Alexander	2
Wiberg, Marie	2
Cai, Yan	1
David Rutkowski	1
De Boeck, Paul	1
Gao, Xuliang	1
Hartig, Johannes	1
Hochweber, Jan	1
Huang, Chao	1
Jesper Tijmstra	1
Jiang, Yu	1
Kuijpers, Renske E.	1
Leslie Rutkowski	1
Li, Yan	1
Liu, Hongyun	1
Liu, Jia	1
Liu, Yue	1
Lüdtke, Oliver	1
Maria Bolsinova	1
Molenaar, Dylan	1
Monroe, Scott	1
Naumann, Alexander	1
Ramsay, James O.	1
Robinson, Daniel H.	1
More ▼