ERIC - Search Results

Publication Date

In 2025	3
Since 2024	4
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	41

Descriptor

Error of Measurement	78
Scores	78
Test Reliability	78
Test Validity	23
Test Interpretation	14
Test Items	13
Item Response Theory	12
Correlation	11
Foreign Countries	10
Measurement Techniques	10
Statistical Analysis	10
Testing Problems	10
Academic Achievement	9
College Entrance Examinations	9
Mathematical Models	9
Standardized Tests	9
Test Construction	9
True Scores	9
Psychometrics	8
Scoring	8
Test Bias	8
Comparative Analysis	7
Computation	7
Factor Analysis	7
Generalizability Theory	7
More ▼

Publication Type

Journal Articles	44
Reports - Research	41
Reports - Evaluative	19
Speeches/Meeting Papers	9
Reports - Descriptive	8
Guides - Non-Classroom	3
Numerical/Quantitative Data	3
Dissertations/Theses -…	2
Guides - General	2
Opinion Papers	2
Collected Works - Serials	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	10
Secondary Education	10
Postsecondary Education	8
High Schools	7
Elementary Secondary Education	4
Junior High Schools	4
Middle Schools	4
Grade 10	3
Grade 9	3
Elementary Education	2
Grade 11	2
Grade 12	2
Grade 5	2
Kindergarten	2
Grade 4	1
Grade 8	1
Intermediate Grades	1
More ▼

Audience

Researchers

Location

Netherlands	3
Canada	2
Indonesia	2
Spain	2
California	1
Denmark	1
Georgia	1
Germany	1
North Carolina	1
Oklahoma	1
South Africa	1
South Korea	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

ACT Assessment	4
General Educational…	3
Early Childhood Longitudinal…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Armed Forces Qualification…	1
Cognitive Abilities Test	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Metropolitan Achievement Tests	1
National Merit Scholarship…	1
New Jersey College Basic…	1
Preliminary Scholastic…	1
Program for International…	1
Student Teacher Relationship…	1
Test of English as a Foreign…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 78 results Save | Export

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

How Did Spain Perform in PISA 2018? New Estimates of Children's PISA Reading Scores

Peer reviewed

Direct link

John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024

International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Lagged Dependent Variable Predictors, Classical Measurement Error, and Path Dependency: The Conditions under Which Various Estimators Are Appropriate

Peer reviewed

Direct link

Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025

Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…

Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

The Video Engagement Scale (VES): Measurement Properties of the Full and Shortened VES across Studies

Peer reviewed

Direct link

Lehmann, Vicky; Hillen, Marij A.; Verdam, Mathilde G. E.; Pieterse, Arwen H.; Labrie, Nanon H. M.; Fruijtier, Agnetha D.; Oreel, Tom H.; Smets, Ellen M. A.; Visser, Leonie N. C. – International Journal of Social Research Methodology, 2023

The Video Engagement Scale (VES) is a quality indicator to assess engagement in experimental video-vignette studies, but its measurement properties warrant improvement. Data from previous studies were combined (N = 2676) and split into three subsamples for a stepped analytical approach. We tested construct validity, criterion validity,…

Descriptors: Likert Scales, Video Technology, Vignettes, Construct Validity

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Conditional Precision of Measurement for Test Scores: Are Conditional Standard Errors Sufficient?

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2019

This inquiry is focused on three indicators of the precision of measurement--conditional on fixed values of ?, the latent variable of item response theory (IRT). The indicators that are compared are (1) The traditional, conditional standard errors, s(eX|?) = CSEM; (2) the IRT-based conditional standard errors, s[subscript irt](eX|?)=C[subscript…

Descriptors: Measurement, Accuracy, Scores, Error of Measurement

Test of Measurement Invariance, and Evidence for Reliability and Validity of AMAS Scores in Dutch Secondary School and University Students

Peer reviewed

Direct link

Schmitz, Eva A.; Salemink, Elske; Wiers, Reinout W.; Jansen, Brenda R. J. – Journal of Psychoeducational Assessment, 2022

The Abbreviated Math Anxiety Scale (AMAS) is commonly used to compare groups on math anxiety. Group comparisons should however be preceded by a demonstration of metric and scalar measurement invariance, which is currently only available for undergraduate students in the USA. This study tested for metric and scalar measurement invariance of AMAS…

Descriptors: Foreign Countries, Secondary School Students, College Students, Mathematics Anxiety

Bayesian Approaches to Test Score Measurement Errors in Student Growth Prediction Models

Direct link

Pei-Hsuan Chiu – ProQuest LLC, 2018

Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…

Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models

Measuring the Development of General Language Skills in English as a Foreign Language--Longitudinal Invariance of the C-Test

Peer reviewed

Direct link

Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023

Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies

Student Perceptions of Teaching Quality in Five Countries: A Partial Credit Model Approach to Assess Measurement Invariance

Peer reviewed

Direct link

van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021

This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…

Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness

Reliability of English Learners' Test Scores. Technical Brief

Download full text

Moore, Joann L.; Li, Tianli; Lu, Yang – ACT, Inc., 2020

The Every Student Succeeds Act requires that English Learners (ELs) are included in annual state testing (grades 3-8 and once in high school) and included in each state's accountability system disaggregated by subgroup to ensure that they receive the support they need to learn English, participate fully in their education experience, and graduate…

Descriptors: College Entrance Examinations, Scores, English Language Learners, Accountability

The Exchangeability of Brief Intelligence Tests for Children with Intellectual Giftedness: Illuminating Error Variance Components' Influence on IQs

Peer reviewed

Direct link

Irby, Sarah M.; Floyd, Randy G. – Psychology in the Schools, 2017

This study examined the exchangeability of total scores (i.e., intelligent quotients [IQs]) from three brief intelligence tests. Tests were administered to 36 children with intellectual giftedness, scored live by one set of primary examiners and later scored by a secondary examiner. For each student, six IQs were calculated, and all 216 values…

Descriptors: Intelligence Tests, Gifted, Error of Measurement, Scores

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	6
ACT, Inc.	2
Applied Measurement in…	2
Applied Psychological…	2
ETS Research Report Series	2
GED Testing Service	2
Grantee Submission	2
International Journal of…	2
Journal of Educational…	2
National Center for Education…	2
ProQuest LLC	2
ACT Education Corp.	1
Annenberg Institute for…	1
Assessment & Evaluation in…	1
British Journal of…	1
Brookings Papers on Education…	1
Clinical Linguistics &…	1
Communique	1
Dyslexia	1
EURASIA Journal of…	1
Education and Information…	1
Educational Assessment	1
Educational Measurement:…	1
Gifted Child Quarterly	1
IEEE Transactions on Education	1
More ▼

Blaker, Lisa	2
Dedrick, Robert F.	2
Ho, Andrew D.	2
Lê, Thanh	2
Najarian, Michelle	2
Nicewander, W. Alan	2
Nord, Christine	2
Reardon, Sean F.	2
Setzer, J. Carl	2
Shaunessy-Dedrick, Elizabeth	2
Suldo, Shannon M.	2
Tourangeau, Karen	2
Vaden-Kiernan, Nancy	2
Wallner-Allen, Kathleen	2
Zimmerman, Donald W.	2
Anders Hjorth-Trolle	1
Anders Holm	1
Atkinson, Leslie	1
Bardhoshi, Gerta	1
Barker, Pierce	1
Benjamin W. Domingue	1
Bleses, Dorthe	1
Bowes, Neal	1
Bradshaw, Jenny	1
More ▼