ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	29

Descriptor

Error of Measurement	121
True Scores	121
Test Reliability	37
Statistical Analysis	36
Reliability	30
Mathematical Models	27
Correlation	25
Scores	20
Item Response Theory	18
Test Interpretation	18
Comparative Analysis	17
Measurement Techniques	16
Raw Scores	14
Test Theory	14
Equated Scores	13
Test Items	13
Analysis of Variance	12
Estimation (Mathematics)	11
Item Analysis	10
Models	10
Simulation	10
Testing Problems	10
Criterion Referenced Tests	9
Cutting Scores	9
Measurement	9
More ▼

Publication Type

Journal Articles	58
Reports - Research	51
Reports - Evaluative	26
Speeches/Meeting Papers	11
Reports - Descriptive	6
Information Analyses	2
Opinion Papers	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Reports - General	1
More ▼

Education Level

Early Childhood Education	1
Elementary Education	1
Grade 2	1
Higher Education	1
Junior High Schools	1
Preschool Education	1

Audience

Researchers	5
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
Oregon	1
Taiwan	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
ACT Assessment	1
Dynamic Indicators of Basic…	1
Iowa Tests of Basic Skills	1
National Longitudinal Study…	1
Test of Standard Written…	1
Vineland Adaptive Behavior…	1
Wechsler Intelligence Scale…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Error of Measurement X

Showing 1 to 15 of 121 results Save | Export

Estimating Standard Errors of IRT True Score Equating Coefficients Using Imputed Item Parameters

Peer reviewed

Direct link

Zhang, Zhonghua – Journal of Experimental Education, 2022

Reporting standard errors of equating has been advocated as a standard practice when conducting test equating. The two most widely applied procedures for standard errors of equating including the bootstrap method and the delta method are either computationally intensive or confined to the derivations of complicated formulas. In the current study,…

Descriptors: Error of Measurement, Item Response Theory, True Scores, Equated Scores

Asymptotic Standard Errors of Equating Coefficients Using the Characteristic Curve Methods for the Graded Response Model

Peer reviewed

Direct link

Zhang, Zhonghua – Applied Measurement in Education, 2020

The characteristic curve methods have been applied to estimate the equating coefficients in test equating under the graded response model (GRM). However, the approaches for obtaining the standard errors for the estimates of these coefficients have not been developed and examined. In this study, the delta method was applied to derive the…

Descriptors: Error of Measurement, Computation, Equated Scores, True Scores

Modeling of Item Response Functions under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020

This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…

Descriptors: Item Response Theory, Scoring, True Scores, Scaling

Measurement Error and Equating Error in Power Analysis

Peer reviewed
PDF on ERIC

Download full text

Phillips, Gary W.; Jiang, Tao – Practical Assessment, Research & Evaluation, 2016

Power analysis is a fundamental prerequisite for conducting scientific research. Without power analysis the researcher has no way of knowing whether the sample size is large enough to detect the effect he or she is looking for. This paper demonstrates how psychometric factors such as measurement error and equating error affect the power of…

Descriptors: Error of Measurement, Statistical Analysis, Equated Scores, Sample Size

An Extension of IRT-Based Equating to the Dichotomous Testlet Response Theory Model

Peer reviewed

Direct link

Tao, Wei; Cao, Yi – Applied Measurement in Education, 2016

Current procedures for equating number-correct scores using traditional item response theory (IRT) methods assume local independence. However, when tests are constructed using testlets, one concern is the violation of the local item independence assumption. The testlet response theory (TRT) model is one way to accommodate local item dependence.…

Descriptors: Item Response Theory, Equated Scores, Test Format, Models

Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

Peer reviewed

Direct link

Cher Wong, Cheow – Journal of Educational Measurement, 2015

Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…

Descriptors: Item Response Theory, Error of Measurement, True Scores, Equated Scores

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

On the Relationship between Classical Test Theory and Item Response Theory: From One to the Other and Back

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2016

The frequently neglected and often misunderstood relationship between classical test theory and item response theory is discussed for the unidimensional case with binary measures and no guessing. It is pointed out that popular item response models can be directly obtained from classical test theory-based models by accounting for the discrete…

Descriptors: Test Theory, Item Response Theory, Models, Correlation

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Relationships of Measurement Error and Prediction Error in Observed-Score Regression

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2012

The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…

Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

Reliability Generalization: An Examination of the Positive Affect and Negative Affect Schedule

Peer reviewed

Direct link

Leue, Anja; Lange, Sebastian – Assessment, 2011

The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…

Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior

Measurement Properties of DIBELS Oral Reading Fluency in Grade 2: Implications for Equating Studies

Peer reviewed

Direct link

Stoolmiller, Michael; Biancarosa, Gina; Fien, Hank – Assessment for Effective Intervention, 2013

Lack of psychometric equivalence of oral reading fluency (ORF) passages used within a grade for screening and progress monitoring has recently become an issue with calls for the use of equating methods to ensure equivalence. To investigate the nature of the nonequivalence and to guide the choice of equating method to correct for nonequivalence,…

Descriptors: School Personnel, Reading Fluency, Emergent Literacy, Psychometrics

The Importance of Relying on the Manual: Scoring Error Variance in the WISC-IV Vocabulary Subtest

Peer reviewed

Direct link

Erdodi, Laszlo A.; Richard, David C. S.; Hopwood, Christopher – Journal of Psychoeducational Assessment, 2009

Classical test theory assumes that ability level has no effect on measurement error. Newer test theories, however, argue that the precision of a measurement instrument changes as a function of the examinee's true score. Research has shown that administration errors are common in the Wechsler scales and that subtests requiring subjective scoring…

Descriptors: Scoring, Error of Measurement, True Scores, Intelligence Tests

Coping with Memory Effect and Serial Correlation when Estimating Reliability in a Longitudinal Framework

Peer reviewed

Direct link

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010

Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…

Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	18
Journal of Educational…	17
Psychometrika	6
Applied Measurement in…	3
Applied Psychological…	3
Journal of Educational and…	3
Journal of Experimental…	3
Educational Measurement:…	2
Journal of School Psychology	2
Advances in Health Sciences…	1
Alberta Journal of…	1
Assessment	1
Assessment for Effective…	1
Canadian Journal of Program…	1
Child Abuse and Neglect: The…	1
Developmental Psychology	1
ETS Research Report Series	1
Educational Research	1
Engineering Education	1
International Journal of…	1
Journal of Consulting and…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Journal of Vocational Behavior	1
Language, Speech, and Hearing…	1
More ▼

Livingston, Samuel A.	6
Brennan, Robert L.	4
Lord, Frederic M.	4
Harris, Chester W.	3
Kolen, Michael J.	3
Linn, Robert L.	3
Moses, Tim	3
Werts, Charles E.	3
Williams, Richard H.	3
Cureton, Edward E.	2
Dimitrov, Dimiter M.	2
Edwards, Keith J.	2
Feldt, Leonard S.	2
Kristof, Walter	2
Lee, Won-Chan	2
Longford, Nicholas T.	2
Werts, C. E.	2
Woodruff, David	2
Zhang, Zhonghua	2
Allison, Paul A.	1
Alonso, Ariel	1
Alwin, Duane F.	1
Andrews, Benjamin James	1
Atkinson, Leslie	1
More ▼