ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	19

Descriptor

Comparative Analysis	31
Item Response Theory	19
Evaluation Methods	16
Monte Carlo Methods	14
Simulation	8
Test Items	8
Computer Simulation	6
Goodness of Fit	5
Maximum Likelihood Statistics	5
Models	5
Multiple Choice Tests	5
Test Length	5
Correlation	4
Equated Scores	4
Factor Analysis	4
Item Analysis	4
Mathematical Models	4
Measurement	4
Methods Research	4
Regression (Statistics)	4
Sample Size	4
College Students	3
Computation	3
Equations (Mathematics)	3
Error of Measurement	3
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	31
Reports - Research	17
Reports - Evaluative	13
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

High Schools	1
Higher Education	1

Audience

Practitioners	1
Researchers	1

Location

Israel

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Educational…	1
Law School Admission Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Coefficient Alpha Bootstrap Confidence Interval under Nonnormality

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin; Newton, Matthew – Applied Psychological Measurement, 2012

Three different bootstrap methods for estimating confidence intervals (CIs) for coefficient alpha were investigated. In addition, the bootstrap methods were compared with the most promising coefficient alpha CI estimation methods reported in the literature. The CI methods were assessed through a Monte Carlo simulation utilizing conditions…

Descriptors: Intervals, Monte Carlo Methods, Computation, Sampling

Iterative Linking with the Differential Functioning of Items and Tests (DFIT) Method: Comparison of Testwide and Item Parameter Replication (IPR) Critical Values

Peer reviewed

Direct link

Seybert, Jacob; Stark, Stephen – Applied Psychological Measurement, 2012

A Monte Carlo study was conducted to examine the accuracy of differential item functioning (DIF) detection using the differential functioning of items and tests (DFIT) method. Specifically, the performance of DFIT was compared using "testwide" critical values suggested by Flowers, Oshima, and Raju, based on simulations involving large numbers of…

Descriptors: Test Bias, Monte Carlo Methods, Form Classes (Languages), Simulation

Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

Peer reviewed

Direct link

Brossman, Bradley G.; Lee, Won-Chan – Applied Psychological Measurement, 2013

The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the multidimensional item response theory (MIRT) framework. Three equating procedures--two observed score procedures and one true score procedure--were created and described in detail. One observed score procedure was…

Descriptors: Equated Scores, True Scores, Item Response Theory, Mathematics Tests

Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

Peer reviewed

Direct link

He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei – Applied Psychological Measurement, 2013

Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

Descriptors: Regression (Statistics), Item Response Theory, Test Items, Equated Scores

Detection of Answer Copying Based on the Structure of a High-Stakes Test

Peer reviewed

Direct link

Belov, Dmitry I. – Applied Psychological Measurement, 2011

This article presents the Variable Match Index (VM-Index), a new statistic for detecting answer copying. The power of the VM-Index relies on two-dimensional conditioning as well as the structure of the test. The asymptotic distribution of the VM-Index is analyzed by reduction to Poisson trials. A computational study comparing the VM-Index with the…

Descriptors: Cheating, Journal Articles, Computation, Comparative Analysis

A Comparison of Four Methods of IRT Subscoring

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao; Hong, Yuan – Applied Psychological Measurement, 2011

Lack of sufficient reliability is the primary impediment for generating and reporting subtest scores. Several current methods of subscore estimation do so either by incorporating the correlational structure among the subtest abilities or by using the examinee's performance on the overall test. This article conducted a systematic comparison of four…

Descriptors: Item Response Theory, Scoring, Methods, Comparative Analysis

The MIMIC Model as a Tool for Differential Bundle Functioning Detection

Peer reviewed

Direct link

Finch, W. Holmes – Applied Psychological Measurement, 2012

Increasingly, researchers interested in identifying potentially biased test items are encouraged to use a confirmatory, rather than exploratory, approach. One such method for confirmatory testing is rooted in differential bundle functioning (DBF), where hypotheses regarding potential differential item functioning (DIF) for sets of items (bundles)…

Descriptors: Test Bias, Test Items, Statistical Analysis, Models

Two Approaches for Using Multiple Anchors in NEAT Equating: A Description and Demonstration

Peer reviewed

Direct link

Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Applied Psychological Measurement, 2011

Nonequivalent groups with anchor test (NEAT) equating functions that use a single anchor can have accuracy problems when the groups are extremely different and/or when the anchor weakly correlates with the tests being equated. Proposals have been made to address these issues by incorporating more than one anchor into NEAT equating functions. These…

Descriptors: Equated Scores, Tests, Comparative Analysis, Correlation

Recovery of Graded Response Model Parameters: A Comparison of Marginal Maximum Likelihood and Markov Chain Monte Carlo Estimation

Peer reviewed

Direct link

Kieftenbeld, Vincent; Natesan, Prathiba – Applied Psychological Measurement, 2012

Markov chain Monte Carlo (MCMC) methods enable a fully Bayesian approach to parameter estimation of item response models. In this simulation study, the authors compared the recovery of graded response model parameters using marginal maximum likelihood (MML) and Gibbs sampling (MCMC) under various latent trait distributions, test lengths, and…

Descriptors: Test Length, Markov Processes, Item Response Theory, Monte Carlo Methods

The Comparative Performance of Conditional Independence Indices

Peer reviewed

Direct link

Kim, Doyoung; De Ayala, R. J.; Ferdous, Abdullah A.; Nering, Michael L. – Applied Psychological Measurement, 2011

To realize the benefits of item response theory (IRT), one must have model-data fit. One facet of a model-data fit investigation involves assessing the tenability of the conditional item independence (CII) assumption. In this Monte Carlo study, the comparative performance of 10 indices for identifying conditional item dependence is assessed. The…

Descriptors: Item Response Theory, Monte Carlo Methods, Error of Measurement, Statistical Analysis

Consequences of Ignoring Guessing when Estimating the Latent Density in Item Response Theory

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2008

In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters. In extant Monte Carlo evaluations of RC-IRT, the item response function (IRF) used to fit the data is the same one used to generate the data. The present simulation study examines RC-IRT when the IRF is imperfectly…

Descriptors: Simulation, Item Response Theory, Monte Carlo Methods, Comparative Analysis

Within-Subject Comparison of Changes in a Pretest-Posttest Design

Peer reviewed

Direct link

Hennig, Christian; Mullensiefen, Daniel; Bargmann, Jens – Applied Psychological Measurement, 2010

The authors propose a method to compare the influence of a treatment on different properties within subjects. The properties are measured by several Likert-type-scaled items. The results show that many existing approaches, such as repeated measurement analysis of variance on sum and mean scores, a linear partial credit model, and a graded response…

Descriptors: Simulation, Pretests Posttests, Regression (Statistics), Comparative Analysis

A Comparison of Content-Balancing Procedures for Estimating Multiple Clinical Domains in Computerized Adaptive Testing: Relative Precision, Validity, and Detection of Persons with Misfitting Responses

Peer reviewed

Direct link

Riley, Barth B.; Dennis, Michael L.; Conrad, Kendon J. – Applied Psychological Measurement, 2010

This simulation study sought to compare four different computerized adaptive testing (CAT) content-balancing procedures designed for use in a multidimensional assessment with respect to measurement precision, symptom severity classification, validity of clinical diagnostic recommendations, and sensitivity to atypical responding. The four…

Descriptors: Simulation, Computer Assisted Testing, Adaptive Testing, Comparative Analysis

A Monte Carlo Study of the Effect of Item Characteristic Curve Estimation on the Accuracy of Three Person-Fit Statistics

Peer reviewed

Direct link

St-Onge, Christina; Valois, Pierre; Abdous, Belkacem; Germain, Stephane – Applied Psychological Measurement, 2009

To date, there have been no studies comparing parametric and nonparametric Item Characteristic Curve (ICC) estimation methods on the effectiveness of Person-Fit Statistics (PFS). The primary aim of this study was to determine if the use of ICCs estimated by nonparametric methods would increase the accuracy of item response theory-based PFS for…

Descriptors: Sample Size, Monte Carlo Methods, Nonparametric Statistics, Item Response Theory

Conversion of Proportion-Correct Standard-Setting Judgments to Cutoff Scores on the Item Response Theory Theta Scale

Peer reviewed

Direct link

Hurtz, Gregory M.; Jones, J. Patrick; Jones, Christian N. – Applied Psychological Measurement, 2008

This study compares the efficacy of different strategies for translating item-level, proportion-correct standard-setting judgments into a theta-metric test cutoff score for use with item response theory (IRT) scoring, using Monte Carlo methods. Simulated Angoff-type ratings, consisting of 1,000 independent 75 Item x13 Rater matrices, were…

Descriptors: Monte Carlo Methods, Measures (Individuals), Item Response Theory, Standard Setting

Previous Page | Next Page »

Pages: 1 | 2 | 3

Stark, Stephen	2
Woods, Carol M.	2
Abdous, Belkacem	1
Ackerman, Terry A.	1
Bargmann, Jens	1
Beller, Michael	1
Belov, Dmitry I.	1
Ben-Simon, Anat	1
Brossman, Bradley G.	1
Budescu, David V.	1
Chen, Hanwei	1
Conrad, Kendon J.	1
Cui, Zhongmin	1
De Ayala, R. J.	1
Deng, Weiling	1
Dennis, Michael L.	1
Derflinger, Gerhard	1
Divers, Jasmin	1
Donoghue, John R.	1
Drasgow, Fritz	1
Fang, Yu	1
Ferdous, Abdullah A.	1
Finch, W. Holmes	1
Finkelman, Matthew D.	1
More ▼