ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	21

Descriptor

Difficulty Level	34
Monte Carlo Methods	34
Test Items	34
Item Response Theory	24
Comparative Analysis	13
Models	9
Correlation	8
Error of Measurement	8
Accuracy	7
Computation	7
Item Analysis	7
Markov Processes	7
Mathematical Models	7
Sample Size	7
Simulation	7
Statistical Bias	7
Test Construction	7
Test Length	7
Bayesian Statistics	6
Factor Analysis	6
Maximum Likelihood Statistics	6
Test Bias	6
Mathematics Tests	5
Test Reliability	5
Adaptive Testing	4
More ▼

Source

Educational and Psychological…	9
Journal of Educational…	4
Applied Measurement in…	2
Applied Psychological…	2
Asia Pacific Education Review	1
ETS Research Report Series	1
Educational Measurement:…	1
European Journal of…	1
Hacettepe University Journal…	1
Journal of Educational and…	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Publication Type

Reports - Research	27
Journal Articles	24
Speeches/Meeting Papers	7
Reports - Evaluative	6
Dissertations/Theses -…	1

Education Level

Higher Education	2
Postsecondary Education	2
Grade 8	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
ACT Assessment	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

IRT Models for Learning with Item-Specific Learning Parameters

Peer reviewed

Direct link

Yu, Albert; Douglas, Jeffrey A. – Journal of Educational and Behavioral Statistics, 2023

We propose a new item response theory growth model with item-specific learning parameters, or ISLP, and two variations of this model. In the ISLP model, either items or blocks of items have their own learning parameters. This model may be used to improve the efficiency of learning in a formative assessment. We show ways that the ISLP model's…

Descriptors: Item Response Theory, Learning, Markov Processes, Monte Carlo Methods

A Special Case of Brennan's Index for Tests That Aim to Select a Limited Number of Students: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Arikan, Serkan; Aybek, Eren Can – Educational Measurement: Issues and Practice, 2022

Many scholars compared various item discrimination indices in real or simulated data. Item discrimination indices, such as item-total correlation, item-rest correlation, and IRT item discrimination parameter, provide information about individual differences among all participants. However, there are tests that aim to select a very limited number…

Descriptors: Monte Carlo Methods, Item Analysis, Correlation, Individual Differences

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Assessing the Performance of Classical Test Theory Item Discrimination Estimators in Monte Carlo Simulations

Peer reviewed

Direct link

Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren – Asia Pacific Education Review, 2017

The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…

Descriptors: Test Items, Monte Carlo Methods, Item Response Theory, Correlation

Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications

Peer reviewed

Direct link

Frick, Hannah; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2015

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch…

Descriptors: Item Response Theory, Test Bias, Comparative Analysis, Scores

Reweighting Data in the Spirit of Tukey: Using Bayesian Posterior Probabilities as Rasch Residuals for Studying Misfit

Peer reviewed

Direct link

Dardick, William R.; Mislevy, Robert J. – Educational and Psychological Measurement, 2016

A new variant of the iterative "data = fit + residual" data-analytical approach described by Mosteller and Tukey is proposed and implemented in the context of item response theory psychometric models. Posterior probabilities from a Bayesian mixture model of a Rasch item response theory model and an unscalable latent class are expressed…

Descriptors: Bayesian Statistics, Probability, Data Analysis, Item Response Theory

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

Developing an Item Bank for Homogeneous Second Order Differential Equations by Calibrated Items

Peer reviewed
PDF on ERIC

Download full text

Aminifar, Elahe; Alipour, Mohammad – European Journal of Educational Sciences, 2014

Item bank is one of the main components of adaptive tests. In this research, a test was made in order to design and calibrate items for Homogeneous Second Order Differential Equations. The items were designed according to the goal-content's table of the subject and the Bloom's taxonomy learning domain. Validity and reliability of these items was…

Descriptors: Test Items, Calculus, Mathematics Tests, Mathematics Instruction

A Comparison of Uniform DIF Effect Size Estimators under the MIMIC and Rasch Models

Peer reviewed

Direct link

Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon; Penfield, Randall D. – Educational and Psychological Measurement, 2013

The Rasch model, a member of a larger group of models within item response theory, is widely used in empirical studies. Detection of uniform differential item functioning (DIF) within the Rasch model typically employs null hypothesis testing with a concomitant consideration of effect size (e.g., signed area [SA]). Parametric equivalence between…

Descriptors: Test Bias, Effect Size, Item Response Theory, Comparative Analysis

A Method for Imputing Response Options for Missing Data on Multiple-Choice Assessments

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Skorupski, William P. – Educational and Psychological Measurement, 2013

When missing values are present in item response data, there are a number of ways one might impute a correct or incorrect response to a multiple-choice item. There are significantly fewer methods for imputing the actual response option an examinee may have provided if he or she had not omitted the item either purposely or accidentally. This…

Descriptors: Multiple Choice Tests, Statistical Analysis, Models, Accuracy

l[subscript z] Person-Fit Index to Identify Misfit Students with Achievement Test Data

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2013

The usefulness of the l[subscript z] person-fit index was investigated with achievement test data from 20 exams given to more than 3,200 college students. Results for three methods of estimating ? showed that the distributions of l[subscript z] were not consistent with its theoretical distribution, resulting in general overfit to the item response…

Descriptors: Achievement Tests, College Students, Goodness of Fit, Item Response Theory

A Multilevel Testlet Model for Dual Local Dependence

Peer reviewed

Direct link

Jiao, Hong; Kamata, Akihito; Wang, Shudong; Jin, Ying – Journal of Educational Measurement, 2012

The applications of item response theory (IRT) models assume local item independence and that examinees are independent of each other. When a representative sample for psychometric analysis is selected using a cluster sampling method in a testlet-based assessment, both local item dependence and local person dependence are likely to be induced.…

Descriptors: Item Response Theory, Test Items, Markov Processes, Monte Carlo Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3

Finch, Holmes	3
Ahn, Soyeon	2
Jiao, Hong	2
Jin, Ying	2
Kamata, Akihito	2
Wang, Shudong	2
Alipour, Mohammad	1
Allen, Nancy L.	1
Aminifar, Elahe	1
Arikan, Serkan	1
Atar, Burcu	1
Aybek, Eren Can	1
Bacon, Tina P.	1
Bazaldua, Diego A. Luna	1
Carlson, James E.	1
Cohen, Jon	1
Dardick, William R.	1
Donoghue, John R.	1
Douglas, Jeffrey A.	1
Edwards, Julianne M.	1
Fellers, Lauren	1
Fifield, Steve	1
Ford, Danielle	1
French, Brian F.	1
More ▼