ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	29
Since 2007 (last 20 years)	73

Descriptor

Correlation	127
Error of Measurement	127
Reliability	67
Test Reliability	49
Scores	35
Statistical Analysis	30
Interrater Reliability	25
Psychometrics	21
Foreign Countries	20
Measurement Techniques	20
Test Validity	19
Comparative Analysis	16
True Scores	16
Mathematical Models	13
Predictor Variables	12
Sample Size	12
Computation	11
Factor Analysis	11
Item Analysis	11
Sampling	11
Test Items	11
Validity	11
Academic Achievement	10
Analysis of Variance	10
Monte Carlo Methods	10
More ▼

Publication Type

Journal Articles	88
Reports - Research	71
Reports - Evaluative	26
Speeches/Meeting Papers	12
Reports - Descriptive	9
Dissertations/Theses -…	3
Tests/Questionnaires	3
Book/Product Reviews	2
Numerical/Quantitative Data	2
Guides - General	1

Education Level

Elementary Secondary Education	8
Higher Education	7
Secondary Education	7
Elementary Education	5
High Schools	4
Postsecondary Education	4
Adult Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Middle Schools	1
More ▼

Audience

Researchers	5
Administrators	1

Location

Australia	3
Canada	3
Florida	2
Portugal	2
Africa	1
California	1
Canada (Toronto)	1
China	1
Georgia	1
Germany	1
Illinois	1
Japan	1
Malaysia	1
Netherlands	1
Netherlands (Amsterdam)	1
Nevada	1
New York	1
New Zealand	1
North Carolina	1
Ohio	1
Rhode Island	1
Spain	1
Spain (Madrid)	1
Turkey	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests	2
ACT Assessment	1
Cognitive Abilities Test	1
Flesch Kincaid Grade Level…	1
General Educational…	1
Iowa Tests of Basic Skills	1
Praxis Series	1
Program for International…	1
Rosenberg Self Esteem Scale	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 127 results Save | Export

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022

Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Student Achievement, School Quality, and an Error-Prone Family Background Measure: Exploring the Sensitivity of the Heyneman-Loxley Effect in Southern and Eastern Africa

Peer reviewed

Direct link

Rew, W. Joshua; Andon, Anabelle; Luschei, Thomas F. – Large-scale Assessments in Education, 2022

Background: We examine the sensitivity of the Heyneman-Loxley Effect to the influence of an error-prone family background measure in 15 education systems from Southern and Eastern Africa. Our aim is to revisit a claim by Abby Riddell from the November 1989 issue of the "Comparative Education Review" concerning the reliability of family…

Descriptors: Foreign Countries, Family Characteristics, Background, Academic Achievement

Estimating Hazard Ratios from Published Kaplan-Meier Survival Curves: A Methods Validation Study

Peer reviewed

Direct link

Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019

Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…

Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials

On the Unlikely Case of an Error-Free Principal Component from a Set of Fallible Measures

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Li, Tenglong – Educational and Psychological Measurement, 2018

This note extends the results in the 2016 article by Raykov, Marcoulides, and Li to the case of correlated errors in a set of observed measures subjected to principal component analysis. It is shown that when at least two measures are fallible, the probability is zero for any principal component--and in particular for the first principal…

Descriptors: Factor Analysis, Error of Measurement, Correlation, Reliability

Updated Technical Manual for the IDEA Feedback System for Administrators. IDEA Technical Report No. 20

Download full text

Benton, Stephen L.; Li, Dan – IDEA Center, Inc., 2018

This technical report describes the results of analyses performed on data collected from 2013 to 2017, using the IDEA Feedback System for Administrators (FSA). The FSA is used to gather impressions from core constituents about an administrator's performance of relevant administrative roles, as well as her/his leadership style, interpersonal…

Descriptors: Feedback (Response), Administrators, Administrator Attitudes, Administrator Role

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Peer reviewed

Direct link

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	20
Applied Psychological…	7
Measurement in Physical…	4
Developmental Medicine &…	3
Journal of Educational…	3
Journal of Experimental…	3
ProQuest LLC	3
Psychological Methods	3
Structural Equation Modeling:…	3
ETS Research Report Series	2
Educational Assessment	2
International Journal of…	2
Online Submission	2
Practical Assessment,…	2
Psychometrika	2
Research in Developmental…	2
Social Indicators Research	2
Advances in Health Sciences…	1
American Educational Research…	1
Assessment & Evaluation in…	1
Athletic Training Education…	1
CALICO Journal	1
Contemporary Educational…	1
Developmental Psychology	1
Early Education and…	1
More ▼

Zimmerman, Donald W.	4
Williams, Richard H.	3
Anna-Maria Fall	2
Beula M. Magimairaj	2
Cornwell, John M.	2
Driller, Matthew	2
Edwards, Keith J.	2
Greg Roberts	2
Linn, Robert L.	2
Moses, Tim	2
Philip Capin	2
Rae, Gordon	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Werts, Charles E.	2
Aiken, Leona S.	1
Allison, Paul A.	1
Alonso, Ariel	1
Anderson, Michele A.	1
Andon, Anabelle	1
Anwyll, Steve	1
Applegate, E. Brooks	1
Aryadoust, Vahid	1
More ▼