ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	46

Descriptor

Evaluation Methods	86
Item Response Theory	54
Models	27
Simulation	24
Test Items	21
Computation	20
Comparative Analysis	16
Equated Scores	16
Psychological Studies	15
Measurement Techniques	14
Monte Carlo Methods	14
Psychometrics	10
Correlation	9
Achievement Tests	8
Computer Assisted Testing	8
Computer Software	8
Error of Measurement	8
Measurement	8
Multiple Choice Tests	8
Scores	8
Evaluation Research	7
Goodness of Fit	7
Item Banks	7
Test Bias	7
Computer Simulation	6
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	86
Reports - Evaluative	36
Reports - Research	33
Reports - Descriptive	11
Information Analyses	6
Book/Product Reviews	2
Speeches/Meeting Papers	2

Education Level

Higher Education	8
Adult Education	1
Grade 8	1
High Schools	1

Audience

Practitioners	3
Researchers	2

Location

Denmark	1
Israel	1
Maryland	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Educational…	2
SAT (College Admission Test)	2
California Achievement Tests	1
California Learning…	1
Hidden Figures Test	1
Iowa Tests of Basic Skills	1

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

Coefficient Alpha Bootstrap Confidence Interval under Nonnormality

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin; Newton, Matthew – Applied Psychological Measurement, 2012

Three different bootstrap methods for estimating confidence intervals (CIs) for coefficient alpha were investigated. In addition, the bootstrap methods were compared with the most promising coefficient alpha CI estimation methods reported in the literature. The CI methods were assessed through a Monte Carlo simulation utilizing conditions…

Descriptors: Intervals, Monte Carlo Methods, Computation, Sampling

A Negative Binomial Regression Model for Accuracy Tests

Peer reviewed

Direct link

Hung, Lai-Fa – Applied Psychological Measurement, 2012

Rasch used a Poisson model to analyze errors and speed in reading tests. An important property of the Poisson distribution is that the mean and variance are equal. However, in social science research, it is very common for the variance to be greater than the mean (i.e., the data are overdispersed). This study embeds the Rasch model within an…

Descriptors: Social Science Research, Markov Processes, Reading Tests, Social Sciences

Iterative Linking with the Differential Functioning of Items and Tests (DFIT) Method: Comparison of Testwide and Item Parameter Replication (IPR) Critical Values

Peer reviewed

Direct link

Seybert, Jacob; Stark, Stephen – Applied Psychological Measurement, 2012

A Monte Carlo study was conducted to examine the accuracy of differential item functioning (DIF) detection using the differential functioning of items and tests (DFIT) method. Specifically, the performance of DFIT was compared using "testwide" critical values suggested by Flowers, Oshima, and Raju, based on simulations involving large numbers of…

Descriptors: Test Bias, Monte Carlo Methods, Form Classes (Languages), Simulation

Recognizing Uncertainty in the Q-Matrix via a Bayesian Extension of the DINA Model

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Applied Psychological Measurement, 2012

In the typical application of a cognitive diagnosis model, the Q-matrix, which reflects the theory with respect to the skills indicated by the items, is assumed to be known. However, the Q-matrix is usually determined by expert judgment, and so there can be uncertainty about some of its elements. Here it is shown that this uncertainty can be…

Descriptors: Bayesian Statistics, Item Response Theory, Simulation, Models

Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

Peer reviewed

Direct link

Brossman, Bradley G.; Lee, Won-Chan – Applied Psychological Measurement, 2013

The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the multidimensional item response theory (MIRT) framework. Three equating procedures--two observed score procedures and one true score procedure--were created and described in detail. One observed score procedure was…

Descriptors: Equated Scores, True Scores, Item Response Theory, Mathematics Tests

Exploratory Mokken Scale Analysis as a Dimensionality Assessment Tool: Why Scalability Does Not Imply Unidimensionality

Peer reviewed

Direct link

Smits, Iris A. M.; Timmerman, Marieke E.; Meijer, Rob R. – Applied Psychological Measurement, 2012

The assessment of the number of dimensions and the dimensionality structure of questionnaire data is important in scale evaluation. In this study, the authors evaluate two dimensionality assessment procedures in the context of Mokken scale analysis (MSA), using a so-called fixed lowerbound. The comparative simulation study, covering various…

Descriptors: Simulation, Measures (Individuals), Program Effectiveness, Item Response Theory

A Latent Class Approach to Estimating Test-Score Reliability

Peer reviewed

Direct link

van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas – Applied Psychological Measurement, 2011

This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

Descriptors: Simulation, Reliability, Measurement, Psychology

Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

Peer reviewed

Direct link

He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei – Applied Psychological Measurement, 2013

Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

Descriptors: Regression (Statistics), Item Response Theory, Test Items, Equated Scores

Detection of Answer Copying Based on the Structure of a High-Stakes Test

Peer reviewed

Direct link

Belov, Dmitry I. – Applied Psychological Measurement, 2011

This article presents the Variable Match Index (VM-Index), a new statistic for detecting answer copying. The power of the VM-Index relies on two-dimensional conditioning as well as the structure of the test. The asymptotic distribution of the VM-Index is analyzed by reduction to Poisson trials. A computational study comparing the VM-Index with the…

Descriptors: Cheating, Journal Articles, Computation, Comparative Analysis

Automated Test Assembly Using lp_Solve Version 5.5 in R

Peer reviewed

Direct link

Diao, Qi; van der Linden, Wim J. – Applied Psychological Measurement, 2011

This article reviews the use of the software program lp_solve version 5.5 for solving mixed-integer automated test assembly (ATA) problems. The program is freely available under Lesser General Public License 2 (LGPL2). It can be called from the statistical language R using the lpSolveAPI interface. Three empirical problems are presented to…

Descriptors: Adaptive Testing, Computer Software, Literature Reviews, Computer Assisted Testing

Item Vector Plots for the Multidimensional Three-Parameter Logistic Model

Peer reviewed

Direct link

Bryant, Damon; Davis, Larry – Applied Psychological Measurement, 2011

This brief technical note describes how to construct item vector plots for dichotomously scored items fitting the multidimensional three-parameter logistic model (M3PLM). As multidimensional item response theory (MIRT) shows promise of being a very useful framework in the test development life cycle, graphical tools that facilitate understanding…

Descriptors: Visual Aids, Item Response Theory, Evaluation Methods, Test Preparation

Two Approaches for Using Multiple Anchors in NEAT Equating: A Description and Demonstration

Peer reviewed

Direct link

Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Applied Psychological Measurement, 2011

Nonequivalent groups with anchor test (NEAT) equating functions that use a single anchor can have accuracy problems when the groups are extremely different and/or when the anchor weakly correlates with the tests being equated. Proposals have been made to address these issues by incorporating more than one anchor into NEAT equating functions. These…

Descriptors: Equated Scores, Tests, Comparative Analysis, Correlation

A Linear Variable-[theta] Model for Measuring Individual Differences in Response Precision

Peer reviewed

Direct link

Ferrando, Pere J. – Applied Psychological Measurement, 2011

Models for measuring individual response precision have been proposed for binary and graded responses. However, more continuous formats are quite common in personality measurement and are usually analyzed with the linear factor analysis model. This study extends the general Gaussian person-fluctuation model to the continuous-response case and…

Descriptors: Factor Analysis, Models, Individual Differences, Responses

Standard Errors and Confidence Intervals from Bootstrapping for Ramsay-Curve Item Response Theory Model Item Parameters

Peer reviewed

Direct link

Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011

Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…

Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods

Investigating Change in Intraindividual Factor Structure over Time

Peer reviewed

Direct link

Rausch, Joseph R. – Applied Psychological Measurement, 2009

The investigation of change in factor structure over time can provide new opportunities for the development of theory in psychology. The method proposed to investigate change in intraindividual factor structure over time is an extension of P-technique factor analysis, in which the P-technique factor model is fit within relatively small windows of…

Descriptors: Monte Carlo Methods, Factor Structure, Factor Analysis, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Woods, Carol M.	6
Ferrando, Pere J.	4
Belov, Dmitry I.	3
Brennan, Robert L.	3
Roberts, James S.	3
van der Linden, Wim J.	3
Armstrong, Ronald D.	2
Dorans, Neil J.	2
Hanson, Bradley A.	2
Harris, Deborah J.	2
Kolen, Michael J.	2
Penfield, Randall D.	2
Sijtsma, Klaas	2
Wang, Wen-Chung	2
van der Ark, L. Andries	2
Ackerman, Terry A.	1
Attali, Yigal	1
Bargmann, Jens	1
Bejar, Isaac I.	1
Ben-Simon, Anat	1
Beretvas, S. Natasha	1
Berger, Martijn P. F.	1
Bergeron, Jennifer M.	1
Bolt, Daniel M.	1
More ▼