ERIC - Search Results

Publication Date

In 2025	2
Since 2024	5
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	25
Since 2006 (last 20 years)	62

Descriptor

Error of Measurement	62
Testing	62
Item Response Theory	18
Scores	18
Test Reliability	14
Scoring	13
Psychometrics	12
Statistical Analysis	12
Test Items	10
Test Validity	10
Language Tests	9
Mathematics Tests	9
Measurement	9
Reliability	9
Academic Achievement	8
Computation	8
Test Bias	8
Test Construction	8
Test Results	8
Testing Programs	8
Achievement Tests	7
Comparative Analysis	7
English	7
Equated Scores	7
Simulation	7
More ▼

Publication Type

Journal Articles	46
Reports - Research	33
Reports - Descriptive	17
Reports - Evaluative	8
Numerical/Quantitative Data	7
Opinion Papers	4
Dissertations/Theses -…	2
Guides - General	1
Information Analyses	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Secondary Education	12
Higher Education	10
Junior High Schools	9
Middle Schools	9
Elementary Education	7
Grade 3	6
Grade 4	6
Grade 5	6
Grade 7	6
Grade 8	6
Intermediate Grades	6
Postsecondary Education	6
Early Childhood Education	5
Grade 6	5
Primary Education	5
High Schools	4
Elementary Secondary Education	1
More ▼

Audience

Practitioners	2
Administrators	1
Counselors	1
Policymakers	1
Researchers	1
Teachers	1

Location

New York	5
China (Beijing)	1
Japan	1
North Carolina	1
Taiwan	1
Turkey	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

ACT Assessment	3
Program for International…	2
National Assessment of…	1
National Longitudinal Survey…	1
Progress in International…	1
SAT (College Admission Test)	1
Trends in International…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 62 results Save | Export

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Administration and Scoring Errors on the Woodcock-Johnson IV Tests of Achievement: Before and during COVID-19

Peer reviewed

Direct link

Lockwood, Adam B.; Klatka, Kelsey; Parker, Brandon; Benson, Nicholas – Journal of Psychoeducational Assessment, 2023

Eighty Woodcock-Johnson IV Tests of Achievement protocols from 40 test administrators were examined to determine the types and frequencies of administration and scoring errors made. Non-critical errors (e.g., failure to record verbatim) were found on every protocol (M = 37.2). Critical (e.g., standard score, start point) errors were found on 98.8%…

Descriptors: Achievement Tests, Testing, Scoring, Error of Measurement

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

A Maximum Test of Three Non-Parametric Two-Sample Procedures for Ordinal Data

Direct link

Lotfi Simon Kerzabi – ProQuest LLC, 2021

Monte Carlo methods are an accepted methodology in regards to generation critical values for a Maximum test. The same methods are also applicable to the evaluation of the robustness of the new created test. A table of critical values was created, and the robustness of the new maximum test was evaluated for five different distributions. Robustness…

Descriptors: Data, Monte Carlo Methods, Testing, Evaluation Research

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Assessment of Multiple Choice Question Exams Quality Using Graphical Methods

Peer reviewed
PDF on ERIC

Download full text

Yousuf, Mustafa S.; Miles, Katherine; Harvey, Heather; Al-Tamimi, Mohammad; Badran, Darwish – Journal of University Teaching and Learning Practice, 2022

Exams should be valid, reliable, and discriminative. Multiple informative methods are used for exam analysis. Displaying analysis results numerically, however, may not be easily comprehended. Using graphical analysis tools could be better for the perception of analysis results. Two such methods were employed: standardized x-bar control charts with…

Descriptors: Multiple Choice Tests, Testing, Test Reliability, Test Validity

A Flexible Approach to Identify Interaction Effects between Moderators in Meta-Analysis

Peer reviewed

Direct link

Li, Xinru; Dusseldorp, Elise; Meulman, Jacqueline J. – Research Synthesis Methods, 2019

In meta-analytic studies, there are often multiple moderators available (eg, study characteristics). In such cases, traditional meta-analysis methods often lack sufficient power to investigate interaction effects between moderators, especially high-order interactions. To overcome this problem, meta-CART was proposed: an approach that applies…

Descriptors: Correlation, Meta Analysis, Identification, Testing

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

Does the Effect of a Time Limit for Testing Impair Structural Investigations by Means of Confirmatory Factor Models?

Peer reviewed

Direct link

Schweizer, Karl; Reiß, Siegbert; Troche, Stefan – Educational and Psychological Measurement, 2019

The article reports three simulation studies conducted to find out whether the effect of a time limit for testing impairs model fit in investigations of structural validity, whether the representation of the assumed source of the effect prevents impairment of model fit and whether it is possible to identify and discriminate this method effect from…

Descriptors: Timed Tests, Testing, Barriers, Testing Problems

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Examining Measurement Invariance and Differential Item Functioning with Discrete Latent Construct Indicators: A Note on a Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018

A latent variable modeling method for studying measurement invariance when evaluating latent constructs with multiple binary or binary scored items with no guessing is outlined. The approach extends the continuous indicator procedure described by Raykov and colleagues, utilizes similarly the false discovery rate approach to multiple testing, and…

Descriptors: Models, Statistical Analysis, Error of Measurement, Test Bias

A Guide for Setting the Cut-Scores to Minimize Weighted Classification Errors in Test Batteries

Peer reviewed

Direct link

Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017

In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…

Descriptors: Cutting Scores, Weighted Scores, Classification, Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	7
New York State Education…	5
Journal of Educational…	4
Psychometrika	3
Applied Psychological…	2
Council of Chief State School…	2
International Journal of…	2
Journal of Psychoeducational…	2
ProQuest LLC	2
ACT Education Corp.	1
ACT, Inc.	1
Annenberg Institute for…	1
Applied Measurement in…	1
Asia Pacific Journal of…	1
Behavioral Research and…	1
Canadian Modern Language…	1
Centre for Economic…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Forum	1
Educational Measurement:…	1
Educational Research	1
Educational Research and…	1
Educational Researcher	1
Educational Testing Service	1
More ▼

Solano-Flores, Guillermo	4
Guo, Hongwen	2
Kane, Michael	2
Li, Min	2
Woods, Carol M.	2
Al Harbi, Khaleel	1
Al-Tamimi, Mohammad	1
Algina, James	1
Alonzo, Julie	1
Anderson, Dan	1
Aucejo, Esteban	1
Augustin Mutak	1
Badran, Darwish	1
Barkaoui, Khaled	1
Benjamin W. Domingue	1
Benson, Nicholas	1
Birnbaum, Michael H.	1
Bollen, Kenneth A.	1
Bramley, Tom	1
Briggs, Derek C.	1
Brockmann, Frank	1
Brown, Allison R.	1
Chang, Shun-Wen	1
Chen, Hui-Mei	1
Chung, Hyewon	1
More ▼