ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	31

Descriptor

Goodness of Fit	59
Simulation	59
Test Items	59
Item Response Theory	38
Models	15
Statistical Analysis	15
Item Analysis	13
Comparative Analysis	12
Sample Size	11
Test Length	10
Scores	8
Difficulty Level	7
Error of Measurement	7
Evaluation Methods	7
Latent Trait Theory	7
Mathematical Models	7
Maximum Likelihood Statistics	7
Probability	7
Accuracy	6
Factor Analysis	6
Monte Carlo Methods	6
Achievement Tests	5
Computation	5
Correlation	5
Data Analysis	5
More ▼

Source

Educational and Psychological…	9
Journal of Educational…	7
Journal of Educational and…	5
Applied Psychological…	4
ProQuest LLC	3
Journal of Outcome Measurement	2
Applied Measurement in…	1
ETS Research Report Series	1
Education Sciences	1
Educational Research and…	1
Grantee Submission	1
Measurement and Evaluation in…	1
Measurement:…	1
Multivariate Behavioral…	1
Practical Assessment,…	1
Psychometrika	1
Structural Equation Modeling	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	37
Reports - Research	31
Reports - Evaluative	20
Speeches/Meeting Papers	12
Reports - Descriptive	4
Dissertations/Theses -…	3

Education Level

Secondary Education	4
Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 12	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers

Location

Canada	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Trends in International…	2
ACT Assessment	1
Advanced Placement…	1
Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
Raven Advanced Progressive…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 59 results Save | Export

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Mousavi, Amin; Cui, Ying – Education Sciences, 2020

Often, important decisions regarding accountability and placement of students in performance categories are made on the basis of test scores generated from tests, therefore, it is important to evaluate the validity of the inferences derived from test results. One of the threats to the validity of such inferences is aberrant responding. Several…

Descriptors: Student Evaluation, Educational Testing, Psychological Testing, Item Response Theory

Monte Carlo Simulation in Item Response Theory Applications Using SAS

Peer reviewed

Direct link

Ames, Allison J.; Leventhal, Brian C.; Ezike, Nnamdi C. – Measurement: Interdisciplinary Research and Perspectives, 2020

Data simulation and Monte Carlo simulation studies are important skills for researchers and practitioners of educational and psychological measurement, but there are few resources on the topic specific to item response theory. Even fewer resources exist on the statistical software techniques to implement simulation studies. This article presents…

Descriptors: Monte Carlo Methods, Item Response Theory, Simulation, Computer Software

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Do Adaptive Representations of the Item-Position Effect in APM Improve Model Fit? A Simulation Study

Peer reviewed

Direct link

Zeller, Florian; Krampen, Dorothea; Reiß, Siegbert; Schweizer, Karl – Educational and Psychological Measurement, 2017

The item-position effect describes how an item's position within a test, that is, the number of previous completed items, affects the response to this item. Previously, this effect was represented by constraints reflecting simple courses, for example, a linear increase. Due to the inflexibility of these representations our aim was to examine…

Descriptors: Goodness of Fit, Simulation, Factor Analysis, Intelligence Tests

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

Identifying Aberrant Responding: Use of Multiple Measures

Direct link

Steinkamp, Susan Christa – ProQuest LLC, 2017

For test scores that rely on the accurate estimation of ability via an IRT model, their use and interpretation is dependent upon the assumption that the IRT model fits the data. Examinees who do not put forth full effort in answering test questions, have prior knowledge of test content, or do not approach a test with the intent of answering…

Descriptors: Test Items, Item Response Theory, Scores, Test Wiseness

Partially Compensatory Multidimensional Item Response Theory Models: Two Alternate Model Forms

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2016

Partially compensatory models may capture the cognitive skills needed to answer test items more realistically than compensatory models, but estimating the model parameters may be a challenge. Data were simulated to follow two different partially compensatory models, a model with an interaction term and a product model. The model parameters were…

Descriptors: Item Response Theory, Models, Thinking Skills, Test Items

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Modeling Information Accumulation in Psychological Tests Using Item Response Times

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jörg-Tobias – Journal of Educational and Behavioral Statistics, 2015

In this article, a latent trait model is proposed for the response times in psychological tests. The latent trait model is based on the linear transformation model and subsumes popular models from survival analysis, like the proportional hazards model and the proportional odds model. Core of the model is the assumption that an unspecified monotone…

Descriptors: Psychological Testing, Reaction Time, Statistical Analysis, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Smith, Richard M.	5
Reckase, Mark D.	4
Schumacker, Randall E.	3
Sinharay, Sandip	3
Cui, Ying	2
Emons, Wilco H. M.	2
Lee, Young-Sun	2
McKinley, Robert L.	2
Ranger, Jochen	2
Wollack, James A.	2
Yen, Wendy M.	2
Ames, Allison J.	1
Arendasy, Martin	1
Asparouhov, Tihomir	1
Bates, Simon P.	1
Bogan, Evelyn Doody	1
Bolsinova, Maria	1
Bolt, Daniel M.	1
Bush, M. Joan	1
Cook, Linda L.	1
Curry, Allen R.	1
Dardick, William R.	1
DeMars, Christine E.	1
Debelak, Rudolf	1
More ▼