ERIC - Search Results

Publication Date

In 2025	3
Since 2024	6
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	68

Descriptor

Item Response Theory	77
Psychometrics	77
Simulation	77
Test Items	35
Models	34
Evaluation Methods	20
Computation	16
Measurement	16
Error of Measurement	14
Comparative Analysis	13
Scores	13
Data Analysis	12
Measurement Techniques	12
Goodness of Fit	10
Statistical Analysis	10
Evaluation Research	9
Maximum Likelihood Statistics	9
Computer Assisted Testing	8
Difficulty Level	8
Educational Assessment	8
Test Construction	8
Achievement Tests	7
Sample Size	7
Accuracy	6
Bayesian Statistics	6
More ▼

Publication Type

Journal Articles	62
Reports - Research	43
Reports - Evaluative	24
Dissertations/Theses -…	7
Reports - Descriptive	2
Speeches/Meeting Papers	2
Information Analyses	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Postsecondary Education	2
Adult Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Canada	1
Denmark	1
Florida	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
Armed Services Vocational…	1
Behavioral Risk Factor…	1
Big Five Inventory	1
Florida Comprehensive…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 77 results Save | Export

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

Exploring the Effects of Collapsing Rating Scale Categories in Polytomous Item Response Theory Analyses: An Illustration and Simulation Study

Peer reviewed

Direct link

Chia-Lin Tsai; Stefanie Wind; Samantha Estrada – Measurement: Interdisciplinary Research and Perspectives, 2025

Researchers who work with ordinal rating scales sometimes encounter situations where the scale categories do not function in the intended or expected way. For example, participants' use of scale categories may result in an empirical difficulty ordering for the categories that does not match what was intended. Likewise, the level of distinction…

Descriptors: Rating Scales, Item Response Theory, Psychometrics, Self Efficacy

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

Application of Change Point Analysis of Response Time Data to Detect Test Speededness

Peer reviewed

Direct link

Cheng, Ying; Shao, Can – Educational and Psychological Measurement, 2022

Computer-based and web-based testing have become increasingly popular in recent years. Their popularity has dramatically expanded the availability of response time data. Compared to the conventional item response data that are often dichotomous or polytomous, response time has the advantage of being continuous and can be collected in an…

Descriptors: Reaction Time, Test Wiseness, Computer Assisted Testing, Simulation

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2022

Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…

Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length

Monotonicity as a Nonparametric Approach to Evaluating Rater Fit in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2020

Rater fit analyses provide insight into the degree to which rater judgments correspond to expected properties, as defined within a measurement framework. Parametric models such as the Rasch model provide a useful framework for evaluating rating quality; however, these models are not appropriate for all assessment contexts. The purpose of this…

Descriptors: Evaluators, Goodness of Fit, Simulation, Psychometrics

The Bayesian Multilevel Trifactor Item Response Theory Model

Peer reviewed

Direct link

Fujimoto, Ken A. – Educational and Psychological Measurement, 2019

Advancements in item response theory (IRT) have led to models for dual dependence, which control for cluster and method effects during a psychometric analysis. Currently, however, this class of models does not include one that controls for when the method effects stem from two method sources in which one source functions differently across the…

Descriptors: Bayesian Statistics, Item Response Theory, Psychometrics, Models

The Psychometric Modeling of Scientific Reasoning: A Review and Recommendations for Future Avenues

Peer reviewed

Direct link

Edelsbrunner, Peter A.; Dablander, Fabian – Educational Psychology Review, 2019

Psychometric modeling has become a frequently used statistical tool in research on scientific reasoning. We review psychometric modeling practices in this field, including model choice, model testing, and researchers' inferences based on their psychometric practices. A review of 11 empirical research studies reveals that the predominant…

Descriptors: Psychometrics, Science Process Skills, Item Response Theory, Educational Assessment

Using Existing Data to Inform Development of New Item Types. Research Report. ETS RR-20-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ling, Guangming; Frankel, Lois – ETS Research Report Series, 2020

With advances in technology, researchers and test developers are developing new item types to measure complex skills like problem solving and critical thinking. Analyzing such items is often challenging because of their complicated response patterns, and thus it is important to develop psychometric methods for practitioners and researchers to…

Descriptors: Test Construction, Test Items, Item Analysis, Psychometrics

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Reliably Assessing Growth with Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019

Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…

Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Peer reviewed

Direct link

Albano, Anthony D.; Cai, Liuhan; Lease, Erin M.; McConnell, Scott R. – Journal of Educational Measurement, 2019

Studies have shown that item difficulty can vary significantly based on the context of an item within a test form. In particular, item position may be associated with practice and fatigue effects that influence item parameter estimation. The purpose of this research was to examine the relevance of item position specifically for assessments used in…

Descriptors: Test Items, Computer Assisted Testing, Item Analysis, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Psychometrika	14
Educational and Psychological…	8
ProQuest LLC	7
Applied Psychological…	6
Journal of Educational…	6
Grantee Submission	5
Applied Measurement in…	3
International Journal of…	3
Measurement:…	3
Educational Measurement:…	2
Journal of Educational and…	2
Advances in Health Sciences…	1
American Journal of…	1
Asia Pacific Education Review	1
Creativity Research Journal	1
ETS Research Report Series	1
Educational Assessment	1
Educational Process:…	1
Educational Psychology Review	1
Educational Research and…	1
Journal of Applied Measurement	1
Journal of Applied Testing…	1
Multivariate Behavioral…	1
Psicologica: International…	1
Studies in Educational…	1
More ▼

Chun Wang	4
Gongjun Xu	3
Bolt, Daniel M.	2
Ceulemans, Eva	2
Mislevy, Robert J.	2
Penfield, Randall D.	2
Roberts, James S.	2
Robitzsch, Alexander	2
Rupp, Andre A.	2
Sijtsma, Klaas	2
Van Mechelen, Iven	2
Wilson, Mark	2
Zumbo, Bruno D.	2
Aiman Mohammad Freihat	1
Albano, Anthony D.	1
Andrich, David	1
Anselmi, Pasquale	1
Antal, Judit	1
Bartolucci, Francesco	1
Benjamin Goecke	1
Bergeron, Jennifer M.	1
Boris Forthmann	1
Brown, Richard S.	1
Busing, Frank M. T. A.	1
Cai, Li	1
More ▼