ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	20

Descriptor

Models	25
Item Response Theory	13
Goodness of Fit	12
Bayesian Statistics	7
Psychometrics	7
Statistical Analysis	7
Test Items	7
Measurement Techniques	5
Regression (Statistics)	5
Simulation	5
Computation	4
Correlation	4
Educational Testing	4
National Competency Tests	4
Scores	4
Scoring	4
Comparative Analysis	3
Diagnostic Tests	3
Educational Assessment	3
Grade 8	3
Item Analysis	3
Mathematics Tests	3
Methods	3
Reliability	3
Statistical Distributions	3
More ▼

Source

ETS Research Report Series	5
Journal of Educational and…	5
Applied Psychological…	2
Educational Measurement:…	2
Journal of Educational…	2
Psychometrika	2
Educational Testing Service	1
Educational and Psychological…	1
Grantee Submission	1
International Journal of…	1
Measurement:…	1
Multivariate Behavioral…	1
More ▼

Author

Sinharay, Sandip	25
Haberman, Shelby J.	7
Johnson, Matthew S.	5
von Davier, Matthias	4
Holland, Paul W.	2
Almond, Russell G.	1
Bejar, Isaac I.	1
Guo, Zhumei	1
Johnson, Matthew	1
Levy, Roy	1
Mislevy, Robert J.	1
Puhan, Gautam	1
Steinhauer, Eric W.	1
Stern, Hal S.	1
Sweeney, Sandra M.	1
Veldkamp, Bernard P.	1
Williamson, David M.	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	17
Reports - Evaluative	4
Reports - Descriptive	3
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Middle Schools	4
Grade 8	3
Elementary Education	2
Grade 4	2
Higher Education	2
Junior High Schools	2
Secondary Education	2
Grade 12	1
High Schools	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Graduate Record Examinations	2

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

A New Person-Fit Statistic for the Lognormal Model for Response Times

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

Response-time models are of increasing interest in educational and psychological testing. This article focuses on the lognormal model for response times, which is one of the most popular response-time models, and suggests a simple person-fit statistic for the model. The distribution of the statistic under the null hypothesis of no misfit is proved…

Descriptors: Reaction Time, Educational Testing, Psychological Testing, Models

A New Person-Fit Statistic for the Lognormal Model for Response Times

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Response-time models are of increasing interest in educational and psychological testing. This paper focuses on the lognormal model for response times (van der Linden, 2006), which is one of the most popular response-time models, and suggests a simple person-fit statistic for the model. The distribution of the statistic under the null hypothesis…

Descriptors: Reaction Time, Educational Testing, Psychological Testing, Models

How Often Is the Misfit of Item Response Theory Models Practically Significant?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Educational Measurement: Issues and Practice, 2014

Standard 3.9 of the Standards for Educational and Psychological Testing ([, 1999]) demands evidence of model fit when item response theory (IRT) models are employed to data from tests. Hambleton and Han ([Hambleton, R. K., 2005]) and Sinharay ([Sinharay, S., 2005]) recommended the assessment of practical significance of misfit of IRT models, but…

Descriptors: Item Response Theory, Goodness of Fit, Models, Tests

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

Descriptors: Educational Testing, Scores, Reports, Psychometrics

The Application of the Cumulative Logistic Regression Model to Automated Essay Scoring

Peer reviewed

Direct link

Haberman, Shelby J.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010

Most automated essay scoring programs use a linear regression model to predict an essay score from several essay features. This article applied a cumulative logit model instead of the linear regression model to automated essay scoring. Comparison of the performances of the linear regression model and the cumulative logit model was performed on a…

Descriptors: Scoring, Regression (Statistics), Essays, Computer Software

Reporting of Subscores Using Multidimensional Item Response Theory

Peer reviewed

Direct link

Haberman, Shelby J.; Sinharay, Sandip – Psychometrika, 2010

Recently, there has been increasing interest in reporting subscores. This paper examines reporting of subscores using multidimensional item response theory (MIRT) models (e.g., Reckase in "Appl. Psychol. Meas." 21:25-36, 1997; C.R. Rao and S. Sinharay (Eds), "Handbook of Statistics, vol. 26," pp. 607-642, North-Holland, Amsterdam, 2007; Beguin &…

Descriptors: Item Response Theory, Psychometrics, Statistical Analysis, Scores

Posterior Predictive Model Checking for Multidimensionality in Item Response Theory

Peer reviewed

Direct link

Levy, Roy; Mislevy, Robert J.; Sinharay, Sandip – Applied Psychological Measurement, 2009

If data exhibit multidimensionality, key conditional independence assumptions of unidimensional models do not hold. The current work pursues posterior predictive model checking, a flexible family of model-checking procedures, as a tool for criticizing models due to unaccounted for dimensions in the context of item response theory. Factors…

Descriptors: Item Response Theory, Models, Methods, Simulation

Assessing Fit of Latent Regression Models. Research Report. ETS RR-09-50

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Guo, Zhumei; von Davier, Matthias; Veldkamp, Bernard P. – ETS Research Report Series, 2009

The reporting methods used in large-scale educational assessments such as the National Assessment of Educational Progress (NAEP) rely on a "latent regression model". There is a lack of research on the assessment of fit of latent regression models. This paper suggests a simulation-based model-fit technique to assess the fit of such…

Descriptors: Regression (Statistics), Models, Goodness of Fit, National Competency Tests

Stochastic Approximation Methods for Latent Regression Item Response Models

Peer reviewed

Direct link

von Davier, Matthias; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010

This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…

Descriptors: Item Response Theory, Statistical Analysis, Regression (Statistics), Models

Use of Item Models in a Large-Scale Admissions Test: A Case Study

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S. – International Journal of Testing, 2008

"Item models" (LaDuca, Staples, Templeton, & Holzman, 1986) are classes from which it is possible to generate items that are equivalent/isomorphic to other items from the same model (e.g., Bejar, 1996, 2002). They have the potential to produce large numbers of high-quality items at reduced cost. This article introduces data from an…

Descriptors: College Entrance Examinations, Case Studies, Test Items, Models

Stochastic Approximation Methods for Latent Regression Item Response Models. Research Report. ETS RR-09-09

Download full text

von Davier, Matthias; Sinharay, Sandip – Educational Testing Service, 2009

This paper presents an application of a stochastic approximation EM-algorithm using a Metropolis-Hastings sampler to estimate the parameters of an item response latent regression model. Latent regression models are extensions of item response theory (IRT) to a 2-level latent variable model in which covariates serve as predictors of the…

Descriptors: Item Response Theory, Regression (Statistics), Models, Methods

Limits on Log Odds Ratios for Unidimensional Item Response Theory Models

Peer reviewed

Direct link

Haberman, Shelby J.; Holland, Paul W.; Sinharay, Sandip – Psychometrika, 2007

Bounds are established for log odds ratios (log cross-product ratios) involving pairs of items for item response models. First, expressions for bounds on log odds ratios are provided for one-dimensional item response models in general. Then, explicit bounds are obtained for the Rasch model and the two-parameter logistic (2PL) model. Results are…

Descriptors: Goodness of Fit, Item Response Theory, Research Methodology, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2