ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	96

Descriptor

Item Response Theory	68
Monte Carlo Methods	48
Evaluation Methods	46
Models	38
Computation	34
Simulation	30
Test Items	22
Comparative Analysis	19
Markov Processes	18
Correlation	16
Computer Software	14
Test Bias	14
Bayesian Statistics	13
Error of Measurement	13
Sample Size	13
Methods	12
Statistical Analysis	12
Equated Scores	11
Goodness of Fit	11
Computer Assisted Testing	10
Maximum Likelihood Statistics	10
Adaptive Testing	9
Factor Analysis	9
Measurement	9
Test Length	9
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	96
Reports - Research	43
Reports - Evaluative	39
Reports - Descriptive	12
Information Analyses	2

Education Level

Higher Education	10
Adult Education	1
Elementary Education	1
Grade 3	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	3
Practitioners	2

Location

Belgium	1
Denmark	1
Florida	1
Maryland	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	2
Florida Comprehensive…	1
Iowa Tests of Educational…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 96 results Save | Export

Coefficient Alpha Bootstrap Confidence Interval under Nonnormality

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin; Newton, Matthew – Applied Psychological Measurement, 2012

Three different bootstrap methods for estimating confidence intervals (CIs) for coefficient alpha were investigated. In addition, the bootstrap methods were compared with the most promising coefficient alpha CI estimation methods reported in the literature. The CI methods were assessed through a Monte Carlo simulation utilizing conditions…

Descriptors: Intervals, Monte Carlo Methods, Computation, Sampling

A Negative Binomial Regression Model for Accuracy Tests

Peer reviewed

Direct link

Hung, Lai-Fa – Applied Psychological Measurement, 2012

Rasch used a Poisson model to analyze errors and speed in reading tests. An important property of the Poisson distribution is that the mean and variance are equal. However, in social science research, it is very common for the variance to be greater than the mean (i.e., the data are overdispersed). This study embeds the Rasch model within an…

Descriptors: Social Science Research, Markov Processes, Reading Tests, Social Sciences

Iterative Linking with the Differential Functioning of Items and Tests (DFIT) Method: Comparison of Testwide and Item Parameter Replication (IPR) Critical Values

Peer reviewed

Direct link

Seybert, Jacob; Stark, Stephen – Applied Psychological Measurement, 2012

A Monte Carlo study was conducted to examine the accuracy of differential item functioning (DIF) detection using the differential functioning of items and tests (DFIT) method. Specifically, the performance of DFIT was compared using "testwide" critical values suggested by Flowers, Oshima, and Raju, based on simulations involving large numbers of…

Descriptors: Test Bias, Monte Carlo Methods, Form Classes (Languages), Simulation

The Performance of Local Dependence Measures with Psychological Data

Peer reviewed

Direct link

Houts, Carrie R.; Edwards, Michael C. – Applied Psychological Measurement, 2013

The violation of the assumption of local independence when applying item response theory (IRT) models has been shown to have a negative impact on all estimates obtained from the given model. Numerous indices and statistics have been proposed to aid analysts in the detection of local dependence (LD). A Monte Carlo study was conducted to evaluate…

Descriptors: Item Response Theory, Psychological Evaluation, Data, Statistical Analysis

Integrating Test-Form Formatting into Automated Test Assembly

Peer reviewed

Direct link

Diao, Qi; van der Linden, Wim J. – Applied Psychological Measurement, 2013

Automated test assembly uses the methodology of mixed integer programming to select an optimal set of items from an item bank. Automated test-form generation uses the same methodology to optimally order the items and format the test form. From an optimization point of view, production of fully formatted test forms directly from the item pool using…

Descriptors: Automation, Test Construction, Test Format, Item Banks

Recognizing Uncertainty in the Q-Matrix via a Bayesian Extension of the DINA Model

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Applied Psychological Measurement, 2012

In the typical application of a cognitive diagnosis model, the Q-matrix, which reflects the theory with respect to the skills indicated by the items, is assumed to be known. However, the Q-matrix is usually determined by expert judgment, and so there can be uncertainty about some of its elements. Here it is shown that this uncertainty can be…

Descriptors: Bayesian Statistics, Item Response Theory, Simulation, Models

The Application of the Monte Carlo Approach to Cognitive Diagnostic Computerized Adaptive Testing With Content Constraints

Peer reviewed

Direct link

Mao, Xiuzhen; Xin, Tao – Applied Psychological Measurement, 2013

The Monte Carlo approach which has previously been implemented in traditional computerized adaptive testing (CAT) is applied here to cognitive diagnostic CAT to test the ability of this approach to address multiple content constraints. The performance of the Monte Carlo approach is compared with the performance of the modified maximum global…

Descriptors: Monte Carlo Methods, Cognitive Tests, Diagnostic Tests, Computer Assisted Testing

Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

Peer reviewed

Direct link

Brossman, Bradley G.; Lee, Won-Chan – Applied Psychological Measurement, 2013

The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the multidimensional item response theory (MIRT) framework. Three equating procedures--two observed score procedures and one true score procedure--were created and described in detail. One observed score procedure was…

Descriptors: Equated Scores, True Scores, Item Response Theory, Mathematics Tests

A Mixture Rasch Model with a Covariate: A Simulation Study via Bayesian Markov Chain Monte Carlo Estimation

Peer reviewed

Direct link

Dai, Yunyun – Applied Psychological Measurement, 2013

Mixtures of item response theory (IRT) models have been proposed as a technique to explore response patterns in test data related to cognitive strategies, instructional sensitivity, and differential item functioning (DIF). Estimation proves challenging due to difficulties in identification and questions of effect size needed to recover underlying…

Descriptors: Item Response Theory, Test Bias, Computation, Bayesian Statistics

Exploratory Mokken Scale Analysis as a Dimensionality Assessment Tool: Why Scalability Does Not Imply Unidimensionality

Peer reviewed

Direct link

Smits, Iris A. M.; Timmerman, Marieke E.; Meijer, Rob R. – Applied Psychological Measurement, 2012

The assessment of the number of dimensions and the dimensionality structure of questionnaire data is important in scale evaluation. In this study, the authors evaluate two dimensionality assessment procedures in the context of Mokken scale analysis (MSA), using a so-called fixed lowerbound. The comparative simulation study, covering various…

Descriptors: Simulation, Measures (Individuals), Program Effectiveness, Item Response Theory

Accuracy of Person-Fit Statistics: A Monte Carlo Study of the Influence of Aberrance Rates

Peer reviewed

Direct link

St-Onge, Christina; Valois, Pierre; Abdous, Belkacem; Germain, Stephane – Applied Psychological Measurement, 2011

Using a Monte Carlo experimental design, this research examined the relationship between answer patterns' aberrance rates and person-fit statistics (PFS) accuracy. It was observed that as the aberrance rate increased, the detection rates of PFS also increased until, in some situations, a peak was reached and then the detection rates of PFS…

Descriptors: Monte Carlo Methods, Accuracy, Goodness of Fit, Statistics

A Latent Class Approach to Estimating Test-Score Reliability

Peer reviewed

Direct link

van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas – Applied Psychological Measurement, 2011

This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

Descriptors: Simulation, Reliability, Measurement, Psychology

Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

Peer reviewed

Direct link

He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei – Applied Psychological Measurement, 2013

Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

Descriptors: Regression (Statistics), Item Response Theory, Test Items, Equated Scores

Detection of Answer Copying Based on the Structure of a High-Stakes Test

Peer reviewed

Direct link

Belov, Dmitry I. – Applied Psychological Measurement, 2011

This article presents the Variable Match Index (VM-Index), a new statistic for detecting answer copying. The power of the VM-Index relies on two-dimensional conditioning as well as the structure of the test. The asymptotic distribution of the VM-Index is analyzed by reduction to Poisson trials. A computational study comparing the VM-Index with the…

Descriptors: Cheating, Journal Articles, Computation, Comparative Analysis

Item Response Modeling with Sum Scores

Peer reviewed

Direct link

Johnson, Timothy R. – Applied Psychological Measurement, 2013

One of the distinctions between classical test theory and item response theory is that the former focuses on sum scores and their relationship to true scores, whereas the latter concerns item responses and their relationship to latent scores. Although item response theory is often viewed as the richer of the two theories, sum scores are still…

Descriptors: Item Response Theory, Scores, Computation, Bayesian Statistics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Woods, Carol M.	7
de la Torre, Jimmy	6
Belov, Dmitry I.	4
Wang, Wen-Chung	4
van der Linden, Wim J.	4
Armstrong, Ronald D.	3
Finch, Holmes	3
Kreiner, Svend	3
Roberts, James S.	3
Abdous, Belkacem	2
DeCarlo, Lawrence T.	2
Diao, Qi	2
Ferrando, Pere J.	2
Germain, Stephane	2
Habing, Brian	2
Hong, Yuan	2
Nering, Michael L.	2
Shigemasu, Kazuo	2
Shih, Ching-Lin	2
Song, Hao	2
St-Onge, Christina	2
Stark, Stephen	2
Valois, Pierre	2
Yao, Lihua	2
Attali, Yigal	1
More ▼