ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	26

Descriptor

Correlation	49
Error of Measurement	49
Test Reliability	49
Test Validity	18
Scores	11
Foreign Countries	9
Statistical Analysis	9
Interrater Reliability	7
Item Analysis	7
Mathematical Models	7
Psychometrics	7
Test Items	7
True Scores	6
Comparative Analysis	5
Measurement Techniques	5
Item Response Theory	4
Longitudinal Studies	4
Predictor Variables	4
Statistics	4
Test Theory	4
Testing Problems	4
Achievement Tests	3
Analysis of Covariance	3
Children	3
Cutting Scores	3
More ▼

Publication Type

Journal Articles	29
Reports - Research	28
Reports - Evaluative	9
Speeches/Meeting Papers	5
Reports - Descriptive	2
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Elementary Secondary Education	3
Secondary Education	3
Higher Education	2
Grade 3	1
Grade 4	1
Grade 5	1
High Schools	1
Postsecondary Education	1

Audience

Researchers	2
Administrators	1

Location

Australia	2
Canada	1
Germany	1
Malaysia	1
Netherlands	1
Netherlands (Amsterdam)	1
New Zealand	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
General Educational…	1
Program for International…	1
Stanford Achievement Tests	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 49 results Save | Export

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022

Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

When near Means Related: Evidence from Three Web Survey Experiments on Inter-Item Correlations in Grid Questions

Peer reviewed

Direct link

Silber, Henning; Roßmann, Joss; Gummer, Tobias – International Journal of Social Research Methodology, 2018

In this article, we present the results of three question design experiments on inter-item correlations, which tested a grid design against a single-item design. The first and second experiments examined the inter-item correlations of a set with five and seven items, respectively, and the third experiment examined the impact of the question design…

Descriptors: Foreign Countries, Online Surveys, Experiments, Correlation

A Guide for Setting the Cut-Scores to Minimize Weighted Classification Errors in Test Batteries

Peer reviewed

Direct link

Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017

In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…

Descriptors: Cutting Scores, Weighted Scores, Classification, Testing

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

The Reliability of a 5km Run Test on a Motorized Treadmill

Peer reviewed

Direct link

Driller, Matthew; Brophy-Williams, Ned; Walker, Anthony – Measurement in Physical Education and Exercise Science, 2017

The purpose of the present study was to determine the reliability of a 5km run test on a motorized treadmill. Over three consecutive weeks, 12 well-trained runners completed three 5km time trials on a treadmill following a standardized warm-up. Runners were partially-blinded to their running speed and distance covered. Total time to complete the…

Descriptors: Athletics, Physical Activities, Athletes, Test Reliability

The Miscalculation of Interrater Reliability: A Case Study Involving the AAC&U VALUE Rubrics

Peer reviewed
PDF on ERIC

Download full text

Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017

Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…

Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives

Updated Technical Manual for the IDEA Feedback System for Administrators. IDEA Technical Report No. 20

Download full text

Benton, Stephen L.; Li, Dan – IDEA Center, Inc., 2018

This technical report describes the results of analyses performed on data collected from 2013 to 2017, using the IDEA Feedback System for Administrators (FSA). The FSA is used to gather impressions from core constituents about an administrator's performance of relevant administrative roles, as well as her/his leadership style, interpersonal…

Descriptors: Feedback (Response), Administrators, Administrator Attitudes, Administrator Role

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Peer reviewed

Direct link

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…

Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills

The Validity and Reliability of the Gymaware Linear Position Transducer for Measuring Counter-Movement Jump Performance in Female Athletes

Peer reviewed

Direct link

O'Donnell, Shannon; Tavares, Francisco; McMaster, Daniel; Chambers, Samuel; Driller, Matthew – Measurement in Physical Education and Exercise Science, 2018

The current study aimed to assess the validity and test-retest reliability of a linear position transducer when compared to a force plate through a counter-movement jump in female participants. Twenty-seven female recreational athletes (19 ± 2 years) performed three counter-movement jumps simultaneously using the linear position transducer and…

Descriptors: Test Validity, Test Reliability, Females, Athletes

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	6
Measurement in Physical…	3
Journal of Educational…	2
Journal of Experimental…	2
Practical Assessment,…	2
Psychometrika	2
American Educational Research…	1
Developmental Medicine &…	1
Educational Assessment	1
Educational Sciences: Theory…	1
Grantee Submission	1
IDEA Center, Inc.	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Educational and…	1
Journal of Speech, Language,…	1
Language Testing	1
Language, Speech, and Hearing…	1
Measurement:…	1
Online Submission	1
ProQuest LLC	1
Psychological Methods	1
Research Quarterly for…	1
Social Indicators Research	1
More ▼

Anna-Maria Fall	2
Beula M. Magimairaj	2
Driller, Matthew	2
Greg Roberts	2
Philip Capin	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Williams, Richard H.	2
Zimmerman, Donald W.	2
Allison, Paul A.	1
Augustin Mutak	1
Belfry, M. Joan	1
Benton, Stephen L.	1
Bichi, Ado Abdu	1
Bosma, Liesbeth	1
Bowes, Neal	1
Brophy-Williams, Ned	1
Caroline M. Böhm	1
Chambers, Samuel	1
Crocker, A. C.	1
Cuttance, Peter F.	1
Diederich, Paul B.	1
Erdogan, Semra	1
More ▼