ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	24
Since 2017 (last 10 years)	55
Since 2007 (last 20 years)	101

Descriptor

Bayesian Statistics	172
Test Items	172
Item Response Theory	90
Models	47
Adaptive Testing	40
Simulation	37
Computer Assisted Testing	36
Computation	35
Maximum Likelihood Statistics	35
Monte Carlo Methods	33
Comparative Analysis	31
Difficulty Level	31
Test Construction	28
Item Analysis	27
Estimation (Mathematics)	26
Statistical Analysis	25
Accuracy	23
Markov Processes	23
Foreign Countries	21
Scores	19
Test Bias	19
Ability	17
Goodness of Fit	17
Achievement Tests	16
Mathematics Tests	16
More ▼

Publication Type

Journal Articles	118
Reports - Research	102
Reports - Evaluative	44
Speeches/Meeting Papers	14
Dissertations/Theses -…	10
Reports - Descriptive	10
Information Analyses	4
Numerical/Quantitative Data	4
Collected Works - Proceedings	1
Collected Works - Serials	1
Opinion Papers	1
More ▼

Education Level

Higher Education	15
Secondary Education	10
Postsecondary Education	8
Grade 8	5
Elementary Education	4
Elementary Secondary Education	4
Middle Schools	4
Grade 4	3
Intermediate Grades	3
Junior High Schools	3
Early Childhood Education	2
High Schools	2
Preschool Education	2
Grade 12	1
Grade 5	1
Grade 7	1
Grade 9	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Practitioners	1
Researchers	1

Location

Taiwan	3
Canada	2
Germany	2
Netherlands	2
Nigeria	2
Saudi Arabia	2
Africa	1
Botswana	1
Chile	1
Georgia Republic	1
Germany (Berlin)	1
Ghana	1
Malaysia	1
North Carolina (Charlotte)	1
Norway	1
Philippines	1
Poland	1
Russia	1
Singapore	1
South Africa	1
Switzerland	1
Thailand	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	4
National Assessment of…	3
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
ACT Assessment	1
Armed Services Vocational…	1
COMPASS (Computer Assisted…	1
California Achievement Tests	1
Law School Admission Test	1
MacArthur Communicative…	1
Michigan Test of English…	1
Progress in International…	1
School and College Ability…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 172 results Save | Export

Item Parameter Estimation of the 2PL IRT Model with Fixed Ability Estimates: Choices of Ability Estimation Methods and Priors on Slopes

Peer reviewed
PDF on ERIC

Download full text

Jianbin Fu; TsungHan Ho; Xuan Tan – Practical Assessment, Research & Evaluation, 2025

Item parameter estimation using an item response theory (IRT) model with fixed ability estimates is useful in equating with small samples on anchor items. The current study explores the impact of three ability estimation methods (weighted likelihood estimation [WLE], maximum a posteriori [MAP], and posterior ability distribution estimation [PST])…

Descriptors: Item Response Theory, Test Items, Computation, Equated Scores

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

Extending an Identified Four-Parameter IRT Model: The Confirmatory Set-4PNO Model

Peer reviewed

Direct link

Justin L. Kern – Journal of Educational and Behavioral Statistics, 2024

Given the frequent presence of slipping and guessing in item responses, models for the inclusion of their effects are highly important. Unfortunately, the most common model for their inclusion, the four-parameter item response theory model, potentially has severe deficiencies related to its possible unidentifiability. With this issue in mind, the…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Generalization

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Bayesian Logistic Regression: A New Method to Calibrate Pretest Items in Multistage Adaptive Testing

Peer reviewed

Direct link

TsungHan Ho – Applied Measurement in Education, 2023

An operational multistage adaptive test (MST) requires the development of a large item bank and the effort to continuously replenish the item bank due to concerns about test security and validity over the long term. New items should be pretested and linked to the item bank before being used operationally. The linking item volume fluctuations in…

Descriptors: Bayesian Statistics, Regression (Statistics), Test Items, Pretesting

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Detecting Preknowledge Cheating via Innovative Measures: A Mixture Hierarchical Model for Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffrey R. – Educational and Psychological Measurement, 2023

Preknowledge cheating jeopardizes the validity of inferences based on test results. Many methods have been developed to detect preknowledge cheating by jointly analyzing item responses and response times. Gaze fixations, an essential eye-tracker measure, can be utilized to help detect aberrant testing behavior with improved accuracy beyond using…

Descriptors: Cheating, Reaction Time, Test Items, Responses

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Learning to Love LLMs for Answer Interpretation: Chain-of-Thought Prompting and the AMMORE Dataset

Peer reviewed
PDF on ERIC

Download full text

Owen Henkel; Hannah Horne-Robinson; Maria Dyshel; Greg Thompson; Ralph Abboud; Nabil Al Nahin Ch; Baptiste Moreau-Pernet; Kirk Vanacore – Journal of Learning Analytics, 2025

This paper introduces AMMORE, a new dataset of 53,000 math open-response question-answer pairs from Rori, a mathematics learning platform used by middle and high school students in several African countries. Using this dataset, we conducted two experiments to evaluate the use of large language models (LLM) for grading particularly challenging…

Descriptors: Learning Analytics, Learning Management Systems, Mathematics Instruction, Middle School Students

Using Machine Learning to Predict Bloom's Taxonomy Level for Certification Exam Items

Peer reviewed

Direct link

Mead, Alan D.; Zhou, Chenxuan – Journal of Applied Testing Technology, 2022

This study fit a Naïve Bayesian classifier to the words of exam items to predict the Bloom's taxonomy level of the items. We addressed five research questions, showing that reasonably good prediction of Bloom's level was possible, but accuracy varies across levels. In our study, performance for Level 2 was poor (Level 2 items were misclassified…

Descriptors: Artificial Intelligence, Prediction, Taxonomy, Natural Language Processing

Test Fraud: Practical Applications and Operational Considerations for the Detection of Item Preknowledge and Compromised Content with Real Data

Direct link

Ross, Linette P. – ProQuest LLC, 2022

One of the most serious forms of cheating occurs when examinees have item preknowledge and prior access to secure test material before taking an exam for the purpose of obtaining an inflated test score. Examinees that cheat and have prior knowledge of test content before testing may have an unfair advantage over examinees that do not cheat. Item…

Descriptors: Testing, Deception, Cheating, Identification

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Educational and Psychological…	23
Applied Psychological…	21
Journal of Educational and…	17
Journal of Educational…	11
ProQuest LLC	10
Psychometrika	9
Applied Measurement in…	7
Grantee Submission	5
ETS Research Report Series	4
Educational Measurement:…	3
Assessment & Evaluation in…	2
International Journal of…	2
Practical Assessment,…	2
Alberta Journal of…	1
Computers & Education	1
EURASIA Journal of…	1
Early Education and…	1
Education and Information…	1
Educational Research and…	1
Educational Technology &…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Working Group…	1
Journal of Applied Testing…	1
More ▼

Mislevy, Robert J.	6
van der Linden, Wim J.	6
Sinharay, Sandip	5
Zwick, Rebecca	5
Lewis, Charles	4
Man, Kaiwen	4
Chang, Hua-Hua	3
Glas, Cees A. W.	3
Harring, Jeffrey R.	3
Huang, Hung-Yu	3
Kim, Seock-Ho	3
Reckase, Mark D.	3
Revuelta, Javier	3
Tao, Jian	3
Vos, Hans J.	3
Wang, Chun	3
Weiss, David J.	3
Berger, Martijn P. F.	2
Bradlow, Eric T.	2
Chun Wang	2
De Boeck, Paul	2
Dodd, Barbara G.	2
Fox, Jean-Paul	2
Hambleton, Ronald K.	2
More ▼