ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	7

Descriptor

Error Patterns	13
Item Response Theory	9
Simulation	7
Evaluation Methods	6
Test Items	6
Error of Measurement	4
Computation	3
Higher Education	3
Models	3
Computer Assisted Testing	2
Nonparametric Statistics	2
Psychological Studies	2
Psychometrics	2
Scores	2
Test Bias	2
Test Interpretation	2
Test Reliability	2
Adaptive Testing	1
Bayesian Statistics	1
Change	1
Comparative Analysis	1
Control Groups	1
Data Analysis	1
Educational Testing	1
Error Correction	1
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	13
Reports - Research	7
Reports - Evaluative	4
Information Analyses	1
Reports - Descriptive	1
Reports - General	1

Education Level

Higher Education

Audience

Researchers

Location

Taiwan

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Comment on 3PL IRT Adjustment for Guessing

Peer reviewed

Direct link

Chiu, Ting-Wei; Camilli, Gregory – Applied Psychological Measurement, 2013

Guessing behavior is an issue discussed widely with regard to multiple choice tests. Its primary effect is on number-correct scores for examinees at lower levels of proficiency. This is a systematic error or bias, which increases observed test scores. Guessing also can inflate random error variance. Correction or adjustment for guessing formulas…

Descriptors: Item Response Theory, Guessing (Tests), Multiple Choice Tests, Error of Measurement

The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Peer reviewed

Direct link

Doebler, Anna – Applied Psychological Measurement, 2012

It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory

A Negative Binomial Regression Model for Accuracy Tests

Peer reviewed

Direct link

Hung, Lai-Fa – Applied Psychological Measurement, 2012

Rasch used a Poisson model to analyze errors and speed in reading tests. An important property of the Poisson distribution is that the mean and variance are equal. However, in social science research, it is very common for the variance to be greater than the mean (i.e., the data are overdispersed). This study embeds the Rasch model within an…

Descriptors: Social Science Research, Markov Processes, Reading Tests, Social Sciences

Immediate Feedback and Opportunity to Revise Answers: Application of a Graded Response IRT Model

Peer reviewed

Direct link

Attali, Yigal – Applied Psychological Measurement, 2011

Recently, Attali and Powers investigated the usefulness of providing immediate feedback on the correctness of answers to constructed response questions and the opportunity to revise incorrect answers. This article introduces an item response theory (IRT) model for scoring revised responses to questions when several attempts are allowed. The model…

Descriptors: Feedback (Response), Item Response Theory, Models, Error Correction

Item Selection and Hypothesis Testing for the Adaptive Measurement of Change

Peer reviewed

Direct link

Finkelman, Matthew D.; Weiss, David J.; Kim-Kang, Gyenam – Applied Psychological Measurement, 2010

Assessing individual change is an important topic in both psychological and educational measurement. An adaptive measurement of change (AMC) method had previously been shown to exhibit greater efficiency in detecting change than conventional nonadaptive methods. However, little work had been done to compare different procedures within the AMC…

Descriptors: Computer Assisted Testing, Hypothesis Testing, Measurement, Item Analysis

Modified Likelihood-Based Item Fit Statistics for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Roberts, James S. – Applied Psychological Measurement, 2008

Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…

Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models

Consistent Estimation of Rasch Item Parameters and Their Standard Errors under Complex Sample Designs

Peer reviewed

Direct link

Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008

U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…

Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation

The Number of Guttman Errors as a Simple and Powerful Person-Fit Statistic.

Peer reviewed

Meijer, Rob R. – Applied Psychological Measurement, 1994

Through simulation, the power of the U3 statistic was compared with the power of one of the simplest person-fit statistics, the sum of the number of Guttman errors. In most cases, a weighted version of the latter statistic performed as well as the U3 statistic. (SLD)

Descriptors: Error Patterns, Item Response Theory, Nonparametric Statistics, Power (Statistics)

Estimating Measurement Error on Highly Speeded Tests.

Peer reviewed

Whitely, Susan E. – Applied Psychological Measurement, 1979

A model which gives maximum likelihood estimates of measurement error within the context of a simplex model for practice effects is presented. The appropriateness of the model is tested for five traits, and error estimates are compared to the classical formula estimates. (Author/JKS)

Descriptors: Error of Measurement, Error Patterns, Higher Education, Mathematical Models

Sensitivity of the Linear Logistic Test Model to Misspecification of the Weight Matrix.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1993

Using simulation, the effect that misspecification of elements in the weight matrix has on estimates of basic parameters of the linear logistic test model was studied. Results indicate that, because specifying elements of the weight matrix is a subjective process, it must be done with great care. (SLD)

Descriptors: Error Patterns, Estimation (Mathematics), Item Response Theory, Matrices

Systematic Errors in Approximations to the Standard Error of Measurement and Reliability.

Peer reviewed

Kleinke, David J. – Applied Psychological Measurement, 1979

Lord's, Millman's and Saupe's methods of approximating the standard error of measurement are reviewed. Through an empirical demonstration involving 200 university classroom tests, all three approximations are shown to be biased. (Author/JKS)

Descriptors: Error of Measurement, Error Patterns, Higher Education, Mathematical Formulas

Pretesting as Determinant of Attitude Change in Evaluation Research.

Peer reviewed

Hoogstraten, Joh. – Applied Psychological Measurement, 1979

The biasing effects of a pretest on subsequent post-test results were investigated in two experimental studies. In general, the results argue for using designs without pretests. (Author/JKS)

Descriptors: Control Groups, Error Patterns, Evaluation Methods, Higher Education

Factors Influencing the Mantel and Generalized Mantel-Haenszel Methods for the Assessment of Differential Item Functioning in Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Su, Ya-Hui – Applied Psychological Measurement, 2004

Eight independent variables (differential item functioning [DIF] detection method, purification procedure, item response model, mean latent trait difference between groups, test length, DIF pattern, magnitude of DIF, and percentage of DIF items) were manipulated, and two dependent variables (Type I error and power) were assessed through…

Descriptors: Test Length, Test Bias, Simulation, Item Response Theory

Attali, Yigal	1
Baker, Frank B.	1
Camilli, Gregory	1
Chan, Tsze	1
Chiu, Ting-Wei	1
Cohen, Jon	1
Doebler, Anna	1
Finkelman, Matthew D.	1
Hoogstraten, Joh.	1
Hung, Lai-Fa	1
Jiang, Tao	1
Kim-Kang, Gyenam	1
Kleinke, David J.	1
Meijer, Rob R.	1
Roberts, James S.	1
Seburn, Mary	1
Su, Ya-Hui	1
Wang, Wen-Chung	1
Weiss, David J.	1
Whitely, Susan E.	1
More ▼