ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	10

Descriptor

Item Response Theory	10
Statistical Inference	10
Computation	4
Test Items	4
Bayesian Statistics	3
Monte Carlo Methods	3
Difficulty Level	2
Educational Assessment	2
Maximum Likelihood Statistics	2
Psychometrics	2
Sample Size	2
Sampling	2
Scores	2
Statistical Analysis	2
Accuracy	1
Adaptive Testing	1
Alcohol Abuse	1
Algorithms	1
Bias	1
Causal Models	1
Cheating	1
Classification	1
College Students	1
Computer Assisted Testing	1
Correlation	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	2
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Early Childhood Education	1
Elementary Education	1
Grade 2	1
Higher Education	1
Postsecondary Education	1
Primary Education	1
Two Year Colleges	1

Audience

Location

North Carolina

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams

Peer reviewed

Direct link

Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023

This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…

Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory

Reporting Proficiency Levels for Examinees with Incomplete Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022

Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…

Descriptors: Computation, Data Analysis, Educational Testing, Accuracy

Handling Missing Data in Growth Mixture Models

Peer reviewed

Direct link

Lee, Daniel Y.; Harring, Jeffrey R. – Journal of Educational and Behavioral Statistics, 2023

A Monte Carlo simulation was performed to compare methods for handling missing data in growth mixture models. The methods considered in the current study were (a) a fully Bayesian approach using a Gibbs sampler, (b) full information maximum likelihood using the expectation-maximization algorithm, (c) multiple imputation, (d) a two-stage multiple…

Descriptors: Monte Carlo Methods, Research Problems, Statistical Inference, Bayesian Statistics

A Bias-Corrected RMSD Item Fit Statistic: An Evaluation and Comparison to Alternatives

Peer reviewed

Direct link

Köhler, Carmen; Robitzsch, Alexander; Hartig, Johannes – Journal of Educational and Behavioral Statistics, 2020

Testing whether items fit the assumptions of an item response theory model is an important step in evaluating a test. In the literature, numerous item fit statistics exist, many of which show severe limitations. The current study investigates the root mean squared deviation (RMSD) item fit statistic, which is used for evaluating item fit in…

Descriptors: Test Items, Goodness of Fit, Statistics, Bias

Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity

Peer reviewed

Direct link

Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025

Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…

Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics

Online Calibration of a Joint Model of Item Responses and Response Times in Computerized Adaptive Testing

Peer reviewed

Direct link

Kang, Hyeon-Ah; Zheng, Yi; Chang, Hua-Hua – Journal of Educational and Behavioral Statistics, 2020

With the widespread use of computers in modern assessment, online calibration has become increasingly popular as a way of replenishing an item pool. The present study discusses online calibration strategies for a joint model of responses and response times. The study proposes likelihood inference methods for item paramter estimation and evaluates…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Reaction Time

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

Discussion of David Thissen's Bad Questions: An Essay Involving Item Response Theory

Peer reviewed

Direct link

Wainer, Howard – Journal of Educational and Behavioral Statistics, 2016

The usual role of a discussant is to clarify and correct the paper being discussed, but in this case, the author, Howard Wainer, generally agrees with everything David Thissen says in his essay, "Bad Questions: An Essay Involving Item Response Theory." This essay expands on David Thissen's statement that there are typically two principal…

Descriptors: Item Response Theory, Educational Assessment, Sample Size, Statistical Inference

Interval Estimation of Latent Variable Scores in Item Response Theory

Peer reviewed

Direct link

Liu, Yang; Yang, Ji Seung – Journal of Educational and Behavioral Statistics, 2018

The uncertainty arising from item parameter estimation is often not negligible and must be accounted for when calculating latent variable (LV) scores in item response theory (IRT). It is particularly so when the calibration sample size is limited and/or the calibration IRT model is complex. In the current work, we treat two-stage IRT scoring as a…

Descriptors: Intervals, Scores, Item Response Theory, Bayesian Statistics

A Mixed Effects Randomized Item Response Model

Peer reviewed

Direct link

Fox, J.-P.; Wyrick, Cheryl – Journal of Educational and Behavioral Statistics, 2008

The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…

Descriptors: Item Response Theory, Models, Markov Processes, Monte Carlo Methods

Benjamin W. Domingue	1
Chang, Hua-Hua	1
Fox, J.-P.	1
Harring, Jeffrey R.	1
Hartig, Johannes	1
Joshua B. Gilbert	1
Kang, Hyeon-Ah	1
Kolstad, Andrew	1
Köhler, Carmen	1
Lang, Joseph B.	1
Lee, Daniel Y.	1
Liu, Yang	1
Luke W. Miratrix	1
Mridul Joshi	1
Oranje, Andreas	1
Robitzsch, Alexander	1
Sinharay, Sandip	1
Wainer, Howard	1
Wyrick, Cheryl	1
Yang, Ji Seung	1
Zheng, Yi	1
More ▼