ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	6

Descriptor

Item Response Theory	7
Psychometrics	7
Testing Problems	7
Test Items	5
Evaluation Methods	4
Evaluation Research	4
Diagnostic Tests	3
Educational Assessment	3
Evaluation Problems	3
Measurement	3
Simulation	3
Test Construction	3
Cognitive Tests	2
Test Validity	2
Algorithms	1
Bayesian Statistics	1
College Students	1
Computer Assisted Testing	1
Correlation	1
Decision Making	1
Difficulty Level	1
Education Majors	1
Educational Testing	1
Effect Size	1
Elementary Secondary Education	1
More ▼

Source

Journal of Educational…	3
Educational and Psychological…	1
Electronic Journal of Science…	1
Journal of Educational and…	1
Measurement:…	1

Author

Chen, Yunxiao	1
Cui, Ying	1
Engelhard, George, Jr.	1
Fugate, Joshua Z.	1
González-Espada, Wilson J.	1
Karelitz, Tzur M.	1
Knell, Janie L.	1
Lee, Yi-Hsuan	1
Leighton, Jacqueline P.	1
Lewis, Charles	1
Li, Xiaoou	1
Robitzsch, Alexander	1
Rupp, Andre A.	1
Sullivan, Rubye K.	1
Wainer, Howard	1
Wilhoite, Andrea P.	1
de La Torre, Jimmy	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	3
Reports - Research	2
Opinion Papers	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Location

Kentucky

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Using Item Response Theory to Improve Locally-Constructed Multiple Choice Tests: Measuring Knowledge Gains and Curricular Effectiveness

Peer reviewed
PDF on ERIC

Download full text

Knell, Janie L.; Wilhoite, Andrea P.; Fugate, Joshua Z.; González-Espada, Wilson J. – Electronic Journal of Science Education, 2015

Current science education reform efforts emphasize teaching K-12 science using hands-on, inquiry activities. For maximum learning and probability of implementation among inservice teachers, these strategies must be modeled in college science courses for preservice teachers. About a decade ago, Morehead State University revised their science…

Descriptors: Item Response Theory, Multiple Choice Tests, Test Construction, Psychometrics

Impact of Diagnosticity on the Adequacy of Models for Cognitive Diagnosis under a Linear Attribute Structure: A Simulation Study

Peer reviewed

Direct link

de La Torre, Jimmy; Karelitz, Tzur M. – Journal of Educational Measurement, 2009

Compared to unidimensional item response models (IRMs), cognitive diagnostic models (CDMs) based on latent classes represent examinees' knowledge and item requirements using discrete structures. This study systematically examines the viability of retrofitting CDMs to IRM-based data with a linear attribute structure. The study utilizes a procedure…

Descriptors: Simulation, Item Response Theory, Psychometrics, Evaluation Methods

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Toward a Psychometrics for Testlets.

Peer reviewed

Wainer, Howard; Lewis, Charles – Journal of Educational Measurement, 1990

Three different applications of the testlet concept are presented, and the psychometric models most suitable for each application are described. Difficulties that testlets can help overcome include (1) context effects; (2) item ordering; and (3) content balancing. Implications for test construction are discussed. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Elementary Secondary Education, Item Response Theory

Re-Conceptualizing Validity within the Context of a New Measure of Mathematical Knowledge for Teaching

Peer reviewed

Direct link

Engelhard, George, Jr.; Sullivan, Rubye K. – Measurement: Interdisciplinary Research and Perspectives, 2007

In this journal issue, the authors of the focus articles have provided a suite of very stimulating and thoughtful articles. The overarching purpose of this research is to explore the application of principles derived from the view of validity proposed by Kane (2004) to their research on issues related to the measurement of mathematical knowledge…

Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research