ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	28

Descriptor

Comparative Analysis	52
Simulation	36
Item Response Theory	30
Test Items	19
Computer Simulation	16
Computer Assisted Testing	11
Adaptive Testing	10
Computation	9
Mathematical Models	9
Models	9
Evaluation Methods	8
Equations (Mathematics)	7
Item Analysis	7
Maximum Likelihood Statistics	7
Monte Carlo Methods	7
Statistical Analysis	7
Bayesian Statistics	6
Estimation (Mathematics)	6
Item Bias	6
Test Construction	6
Measurement	5
Error of Measurement	4
Goodness of Fit	4
Nonparametric Statistics	4
Sample Size	4
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	52
Reports - Research	26
Reports - Evaluative	25
Speeches/Meeting Papers	3
Guides - Non-Classroom	1
Reports - Descriptive	1

Education Level

Higher Education

Audience

Practitioners

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Center for Epidemiologic…

What Works Clearinghouse Rating

Showing 1 to 15 of 52 results Save | Export

Confirming Testlet Effects

Peer reviewed

Direct link

DeMars, Christine E. – Applied Psychological Measurement, 2012

A testlet is a cluster of items that share a common passage, scenario, or other context. These items might measure something in common beyond the trait measured by the test as a whole; if so, the model for the item responses should allow for this testlet trait. But modeling testlet effects that are negligible makes the model unnecessarily…

Descriptors: Test Items, Item Response Theory, Comparative Analysis, Models

Coefficient Alpha Bootstrap Confidence Interval under Nonnormality

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin; Newton, Matthew – Applied Psychological Measurement, 2012

Three different bootstrap methods for estimating confidence intervals (CIs) for coefficient alpha were investigated. In addition, the bootstrap methods were compared with the most promising coefficient alpha CI estimation methods reported in the literature. The CI methods were assessed through a Monte Carlo simulation utilizing conditions…

Descriptors: Intervals, Monte Carlo Methods, Computation, Sampling

Iterative Linking with the Differential Functioning of Items and Tests (DFIT) Method: Comparison of Testwide and Item Parameter Replication (IPR) Critical Values

Peer reviewed

Direct link

Seybert, Jacob; Stark, Stephen – Applied Psychological Measurement, 2012

A Monte Carlo study was conducted to examine the accuracy of differential item functioning (DIF) detection using the differential functioning of items and tests (DFIT) method. Specifically, the performance of DFIT was compared using "testwide" critical values suggested by Flowers, Oshima, and Raju, based on simulations involving large numbers of…

Descriptors: Test Bias, Monte Carlo Methods, Form Classes (Languages), Simulation

A Comparison between Some Generalized Mantel-Haenszel Statistics for Detecting DIF in Data Simulated under the Graded Response Model

Peer reviewed

Direct link

Fidalgo, Angel M.; Bartram, Dave – Applied Psychological Measurement, 2010

The main objective of this study was to establish the relative efficacy of the generalized Mantel-Haenszel test (GMH) and the Mantel test for detecting large numbers of differential item functioning (DIF) patterns. To this end this study considered a topic not dealt with in the literature to date: the possible differential effect of type of scores…

Descriptors: Test Bias, Statistics, Scoring, Comparative Analysis

Curtailment and Stochastic Curtailment to Shorten the CES-D

Peer reviewed

Direct link

Finkelman, Matthew D.; Smits, Niels; Kim, Wonsuk; Riley, Barth – Applied Psychological Measurement, 2012

The Center for Epidemiologic Studies-Depression (CES-D) scale is a well-known self-report instrument that is used to measure depressive symptomatology. Respondents who take the full-length version of the CES-D are administered a total of 20 items. This article investigates the use of curtailment and stochastic curtailment (SC), two sequential…

Descriptors: Measures (Individuals), Depression (Psychology), Test Length, Computer Assisted Testing

Recovery of Graded Response Model Parameters: A Comparison of Marginal Maximum Likelihood and Markov Chain Monte Carlo Estimation

Peer reviewed

Direct link

Kieftenbeld, Vincent; Natesan, Prathiba – Applied Psychological Measurement, 2012

Markov chain Monte Carlo (MCMC) methods enable a fully Bayesian approach to parameter estimation of item response models. In this simulation study, the authors compared the recovery of graded response model parameters using marginal maximum likelihood (MML) and Gibbs sampling (MCMC) under various latent trait distributions, test lengths, and…

Descriptors: Test Length, Markov Processes, Item Response Theory, Monte Carlo Methods

A Binary Programming Approach to Automated Test Assembly for Cognitive Diagnosis Models

Peer reviewed

Direct link

Finkelman, Matthew D.; Kim, Wonsuk; Roussos, Louis; Verschoor, Angela – Applied Psychological Measurement, 2010

Automated test assembly (ATA) has been an area of prolific psychometric research. Although ATA methodology is well developed for unidimensional models, its application alongside cognitive diagnosis models (CDMs) is a burgeoning topic. Two suggested procedures for combining ATA and CDMs are to maximize the cognitive diagnostic index and to use a…

Descriptors: Automation, Test Construction, Programming, Models

Model Selection Indices for Polytomous Items

Peer reviewed

Direct link

Kang, Taehoon; Cohen, Allan S.; Sung, Hyun-Jung – Applied Psychological Measurement, 2009

This study examines the utility of four indices for use in model selection with nested and nonnested polytomous item response theory (IRT) models: a cross-validation index and three information-based indices. Four commonly used polytomous IRT models are considered: the graded response model, the generalized partial credit model, the partial credit…

Descriptors: Item Response Theory, Models, Selection, Simulation

A Comparison of Item Selection Techniques for Testlets

Peer reviewed

Direct link

Murphy, Daniel L.; Dodd, Barbara G.; Vaughn, Brandon K. – Applied Psychological Measurement, 2010

This study examined the performance of the maximum Fisher's information, the maximum posterior weighted information, and the minimum expected posterior variance methods for selecting items in a computerized adaptive testing system when the items were grouped in testlets. A simulation study compared the efficiency of ability estimation among the…

Descriptors: Simulation, Adaptive Testing, Item Analysis, Item Response Theory

A Method for the Comparison of Item Selection Rules in Computerized Adaptive Testing

Peer reviewed

Direct link

Barrada, Juan Ramon; Olea, Julio; Ponsoda, Vicente; Abad, Francisco Jose – Applied Psychological Measurement, 2010

In a typical study comparing the relative efficiency of two item selection rules in computerized adaptive testing, the common result is that they simultaneously differ in accuracy and security, making it difficult to reach a conclusion on which is the more appropriate rule. This study proposes a strategy to conduct a global comparison of two or…

Descriptors: Test Items, Simulation, Adaptive Testing, Item Analysis

A Modified Frequency Estimation Equating Method for the Common-Item Nonequivalent Groups Design

Peer reviewed

Direct link

Wang, Tianyou; Brennan, Robert L. – Applied Psychological Measurement, 2009

Frequency estimation, also called poststratification, is an equating method used under the common-item nonequivalent groups design. A modified frequency estimation method is proposed here, based on altering one of the traditional assumptions in frequency estimation in order to correct for equating bias. A simulation study was carried out to…

Descriptors: Computation, Bias, Comparative Analysis, Statistical Analysis

Item Response Theory with Estimation of the Latent Density Using Davidian Curves

Peer reviewed

Direct link

Woods, Carol M.; Lin, Nan – Applied Psychological Measurement, 2009

Davidian-curve item response theory (DC-IRT) is introduced, evaluated with simulations, and illustrated using data from the Schedule for Nonadaptive and Adaptive Personality Entitlement scale. DC-IRT is a method for fitting unidimensional IRT models with maximum marginal likelihood estimation, in which the latent density is estimated,…

Descriptors: Item Response Theory, Personality Measures, Computation, Simulation

Within-Subject Comparison of Changes in a Pretest-Posttest Design

Peer reviewed

Direct link

Hennig, Christian; Mullensiefen, Daniel; Bargmann, Jens – Applied Psychological Measurement, 2010

The authors propose a method to compare the influence of a treatment on different properties within subjects. The properties are measured by several Likert-type-scaled items. The results show that many existing approaches, such as repeated measurement analysis of variance on sum and mean scores, a linear partial credit model, and a graded response…

Descriptors: Simulation, Pretests Posttests, Regression (Statistics), Comparative Analysis

A Comparison of Content-Balancing Procedures for Estimating Multiple Clinical Domains in Computerized Adaptive Testing: Relative Precision, Validity, and Detection of Persons with Misfitting Responses

Peer reviewed

Direct link

Riley, Barth B.; Dennis, Michael L.; Conrad, Kendon J. – Applied Psychological Measurement, 2010

This simulation study sought to compare four different computerized adaptive testing (CAT) content-balancing procedures designed for use in a multidimensional assessment with respect to measurement precision, symptom severity classification, validity of clinical diagnostic recommendations, and sensitivity to atypical responding. The four…

Descriptors: Simulation, Computer Assisted Testing, Adaptive Testing, Comparative Analysis

A Parametric Cumulative Sum Statistic for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Applied Psychological Measurement, 2009

This article develops a new cumulative sum (CUSUM) statistic to detect aberrant item response behavior. Shifts in behavior are modeled with quadratic functions and a series of likelihood ratio tests are used to detect aberrancy. The new CUSUM statistic is compared against another CUSUM approach as well as traditional person-fit statistics. A…

Descriptors: Simulation, Item Response Theory, Personality Theories, High Stakes Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Cohen, Allan S.	4
Finkelman, Matthew D.	3
Woods, Carol M.	3
Brennan, Robert L.	2
Dodd, Barbara G.	2
Kang, Taehoon	2
Kim, Wonsuk	2
Kolen, Michael J.	2
Liou, Michelle	2
Olea, Julio	2
Ponsoda, Vicente	2
Stark, Stephen	2
Stocking, Martha L.	2
Swaminathan, Hariharan	2
Wang, Tianyou	2
Abad, Francisco J.	1
Abad, Francisco Jose	1
Armstrong, Ronald D.	1
Bargmann, Jens	1
Barrada, Juan Ramon	1
Bartram, Dave	1
Bonett, Douglas G.	1
Camilli, Gregory	1
Chen, Po-Hsi	1
More ▼