ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	38

Descriptor

Error of Measurement	85
Item Response Theory	31
Simulation	20
Test Items	20
Computation	19
Scores	18
Reliability	17
Monte Carlo Methods	15
Statistical Analysis	15
Correlation	13
Equations (Mathematics)	12
Comparative Analysis	11
Models	11
Sample Size	11
Equated Scores	10
Estimation (Mathematics)	10
Test Bias	10
Test Reliability	10
Test Theory	10
Mathematical Models	9
Test Length	9
Adaptive Testing	8
Evaluation Methods	8
Factor Analysis	8
Maximum Likelihood Statistics	8
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	79
Reports - Evaluative	39
Reports - Research	24
Reports - Descriptive	15
Book/Product Reviews	2
Information Analyses	2
Speeches/Meeting Papers	2
Collected Works - General	1
Opinion Papers	1
Reports - General	1

Education Level

Elementary Education	1
Higher Education	1

Audience

Practitioners

Location

Germany	2
Australia	1
Canada (Toronto)	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Armed Forces Qualification…	1
Eysenck Personality Inventory	1
Law School Admission Test	1
National Assessment of…	1
Wechsler Preschool and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 85 results Save | Export

Comment on 3PL IRT Adjustment for Guessing

Peer reviewed

Direct link

Chiu, Ting-Wei; Camilli, Gregory – Applied Psychological Measurement, 2013

Guessing behavior is an issue discussed widely with regard to multiple choice tests. Its primary effect is on number-correct scores for examinees at lower levels of proficiency. This is a systematic error or bias, which increases observed test scores. Guessing also can inflate random error variance. Correction or adjustment for guessing formulas…

Descriptors: Item Response Theory, Guessing (Tests), Multiple Choice Tests, Error of Measurement

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Using the Graded Response Model to Control Spurious Interactions in Moderated Multiple Regression

Peer reviewed

Direct link

Morse, Brendan J.; Johanson, George A.; Griffeth, Rodger W. – Applied Psychological Measurement, 2012

Recent simulation research has demonstrated that using simple raw score to operationalize a latent construct can result in inflated Type I error rates for the interaction term of a moderated statistical model when the interaction (or lack thereof) is proposed at the latent variable level. Rescaling the scores using an appropriate item response…

Descriptors: Item Response Theory, Multiple Regression Analysis, Error of Measurement, Models

Projective Item Response Model for Test-Independent Measurement

Peer reviewed

Direct link

Ip, Edward Hak-Sing; Chen, Shyh-Huei – Applied Psychological Measurement, 2012

The problem of fitting unidimensional item-response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that contains a major dimension of interest but that may also contain minor nuisance dimensions. Because fitting a unidimensional model to multidimensional data results in…

Descriptors: Measurement, Item Response Theory, Scores, Computation

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

The MIMIC Model as a Tool for Differential Bundle Functioning Detection

Peer reviewed

Direct link

Finch, W. Holmes – Applied Psychological Measurement, 2012

Increasingly, researchers interested in identifying potentially biased test items are encouraged to use a confirmatory, rather than exploratory, approach. One such method for confirmatory testing is rooted in differential bundle functioning (DBF), where hypotheses regarding potential differential item functioning (DIF) for sets of items (bundles)…

Descriptors: Test Bias, Test Items, Statistical Analysis, Models

Evaluating EIV, OLS, and SEM Estimators of Group Slope Differences in the Presence of Measurement Error: The Single-Indicator Case

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2012

Measurement error significantly biases interaction effects and distorts researchers' inferences regarding interactive hypotheses. This article focuses on the single-indicator case and shows how to accurately estimate group slope differences by disattenuating interaction effects with errors-in-variables (EIV) regression. New analytic findings were…

Descriptors: Evidence, Test Length, Interaction, Regression (Statistics)

MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Shih, Ching-Lin – Applied Psychological Measurement, 2010

Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

Descriptors: Methods, Test Bias, Test Items, Error of Measurement

Conservativeness in Rejection of the Null Hypothesis when Using the Continuity Correction in the MH Chi-Square Test in DIF Applications

Peer reviewed

Direct link

Paek, Insu – Applied Psychological Measurement, 2010

Conservative bias in rejection of a null hypothesis from using the continuity correction in the Mantel-Haenszel (MH) procedure was examined through simulation in a differential item functioning (DIF) investigation context in which statistical testing uses a prespecified level [alpha] for the decision on an item with respect to DIF. The standard MH…

Descriptors: Test Bias, Statistical Analysis, Sample Size, Error of Measurement

Marginal Maximum A Posteriori Item Parameter Estimation for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Roberts, James S.; Thompson, Vanessa M. – Applied Psychological Measurement, 2011

A marginal maximum a posteriori (MMAP) procedure was implemented to estimate item parameters in the generalized graded unfolding model (GGUM). Estimates from the MMAP method were compared with those derived from marginal maximum likelihood (MML) and Markov chain Monte Carlo (MCMC) procedures in a recovery simulation that varied sample size,…

Descriptors: Statistical Analysis, Markov Processes, Computation, Monte Carlo Methods

Multidimensional Item Response Theory Parameter Estimation with Nonsimple Structure Items

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2011

Estimation of multidimensional item response theory (MIRT) model parameters can be carried out using the normal ogive with unweighted least squares estimation with the normal-ogive harmonic analysis robust method (NOHARM) software. Previous simulation research has demonstrated that this approach does yield accurate and efficient estimates of item…

Descriptors: Item Response Theory, Computation, Test Items, Simulation

Alternative Matching Scores to Control Type I Error of the Mantel-Haenszel Procedure for DIF in Dichotomously Scored Items Conforming to 3PL IRT and Nonparametric 4PBCB Models

Peer reviewed

Direct link

Monahan, Patrick O.; Ankenmann, Robert D. – Applied Psychological Measurement, 2010

When the matching score is either less than perfectly reliable or not a sufficient statistic for determining latent proficiency in data conforming to item response theory (IRT) models, Type I error (TIE) inflation may occur for the Mantel-Haenszel (MH) procedure or any differential item functioning (DIF) procedure that matches on summed-item…

Descriptors: Error of Measurement, Item Response Theory, Test Bias, Scores

The Comparative Performance of Conditional Independence Indices

Peer reviewed

Direct link

Kim, Doyoung; De Ayala, R. J.; Ferdous, Abdullah A.; Nering, Michael L. – Applied Psychological Measurement, 2011

To realize the benefits of item response theory (IRT), one must have model-data fit. One facet of a model-data fit investigation involves assessing the tenability of the conditional item independence (CII) assumption. In this Monte Carlo study, the comparative performance of 10 indices for identifying conditional item dependence is assessed. The…

Descriptors: Item Response Theory, Monte Carlo Methods, Error of Measurement, Statistical Analysis

Asymptotic and Sampling-Based Standard Errors for Two Population Invariance Measures in the Linear Equating Case

Peer reviewed

Direct link

Rijmen, Frank; Manalo, Jonathan R.; von Davier, Alina A. – Applied Psychological Measurement, 2009

This article describes two methods for obtaining the standard errors of two commonly used population invariance measures of equating functions: the root mean square difference of the subpopulation equating functions from the overall equating function and the root expected mean square difference. The delta method relies on an analytical…

Descriptors: Error of Measurement, Sampling, Equated Scores, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

van der Linden, Wim J.	4
Finch, Holmes	3
Nering, Michael L.	3
Ogasawara, Haruhiko	3
Andrich, David	2
Brennan, Robert L.	2
Camilli, Gregory	2
Culpepper, Steven Andrew	2
Forsyth, Robert A.	2
Hanson, Bradley A.	2
Humphreys, Lloyd G.	2
Levin, Joel R.	2
Oshima, T. C.	2
Raju, Nambury S.	2
Samejima, Fumiko	2
Shigemasu, Kazuo	2
Stark, Stephen	2
Subkoviak, Michael J.	2
Wang, Wen-Chung	2
Whitely, Susan E.	2
Woods, Carol M.	2
Zeng, Lingjia	2
Zimmerman, Donald W.	2
Aguinis, Herman	1
More ▼