ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	7

Descriptor

Item Response Theory	12
Probability	12
Statistical Distributions	12
Computation	4
Equations (Mathematics)	4
Error of Measurement	4
Goodness of Fit	4
Maximum Likelihood Statistics	4
Scores	4
Simulation	4
Ability	3
Bayesian Statistics	3
Mathematical Models	3
Models	3
Comparative Analysis	2
Estimation (Mathematics)	2
Measurement Techniques	2
Statistical Analysis	2
Test Construction	2
Test Items	2
Academic Achievement	1
Access to Information	1
Cheating	1
Classification	1
Educational Research	1
More ▼

Source

Journal of Educational and…	3
Applied Psychological…	2
ETS Research Report Series	1
Grantee Submission	1
International Journal of…	1
National Center for Research…	1
Teaching Statistics: An…	1

Publication Type

Journal Articles	8
Reports - Evaluative	6
Reports - Research	5
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Work Keys (ACT)

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

An Improved Inferential Procedure to Evaluate Item Discriminations in a Conditional Maximum Likelihood Framework

Peer reviewed

Direct link

Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024

A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…

Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Computation of the Response Similarity Index M4 in R under the Dichotomous and Nominal Item Response Models

Peer reviewed
PDF on ERIC

Download full text

Zopluoglu, Cengiz – International Journal of Assessment Tools in Education, 2019

Unusual response similarity among test takers may occur in testing data and be an indicator of potential test fraud (e.g., examinees copy responses from other examinees, send text messages or pre-arranged signals among themselves for the correct response, item pre-knowledge). One index to measure the degree of similarity between two response…

Descriptors: Item Response Theory, Computation, Cheating, Measurement Techniques

Summed Score Likelihood Based Indices for Testing Latent Variable Distribution Fit in Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Li, Zhen; Cai, Li – Grantee Submission, 2017

In standard item response theory (IRT) applications, the latent variable is typically assumed to be normally distributed. If the normality assumption is violated, the item parameter estimates can become biased. Summed score likelihood based statistics may be useful for testing latent variable distribution fit. We develop Satorra-Bentler type…

Descriptors: Scores, Goodness of Fit, Statistical Distributions, Item Response Theory

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

A New Statistic for Evaluating Item Response Theory Models for Ordinal Data. CRESST Report 839

Download full text

Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…

Descriptors: Item Response Theory, Models, Goodness of Fit, Probability

A Lower Bound for the Most Deviant Z Score

Peer reviewed

Direct link

Hayes, Kevin – Teaching Statistics: An International Journal for Teachers, 2004

This article demonstrates that the lower bound for the most deviant Z score and the upper bound for the sample standard deviation are attained simultaneously.

Descriptors: Statistical Analysis, Scores, Item Response Theory, Probability

The Use of Prior Distributions in Marginalized Bayesian Item Parameter Estimation: A Didactic.

Peer reviewed

Harwell, Michael R.; Baker, Frank B. – Applied Psychological Measurement, 1991

Previous work on the mathematical and implementation details of the marginalized maximum likelihood estimation procedure is extended to encompass the marginalized Bayesian procedure for estimating item parameters of R. J. Mislevy (1986) and to communicate this procedure to users of the BILOG computer program. (SLD)

Descriptors: Bayesian Statistics, Equations (Mathematics), Estimation (Mathematics), Item Response Theory

Conditional Standard Errors, Reliability and Decision Consistency of Performance Levels Using Polytomous IRT.

Wang, Tianyou; And Others – 1996

M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…

Descriptors: Ability, Classification, Error of Measurement, Goodness of Fit

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

A Conceptual Analysis of Differential Item Functioning in Terms of a Multidimensional Item Response Model.

Peer reviewed

Camilli, Gregory – Applied Psychological Measurement, 1992

A mathematical model is proposed to describe how group differences in distributions of abilities, which are distinct from the target ability, influence the probability of a correct item response. In the multidimensional approach, differential item functioning is considered a function of the educational histories of the examinees. (SLD)

Descriptors: Ability, Comparative Analysis, Equations (Mathematics), Factor Analysis

Cai, Li	2
Andreas Kurz	1
Baker, Frank B.	1
Camilli, Gregory	1
Can Gürer	1
Clemens Draxler	1
Donoghue, John R.	1
Harwell, Michael R.	1
Hayes, Kevin	1
Hess, Melinda R.	1
Jan Philipp Nolte	1
Kim, Seock-Ho	1
Li, Zhen	1
McClellan, Catherine A.	1
Monroe, Scott	1
Sinharay, Sandip	1
Wallin, Gabriel	1
Wang, Tianyou	1
Wiberg, Marie	1
Zopluoglu, Cengiz	1
More ▼