ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Error of Measurement	12
Evaluation Methods	12
Testing Problems	12
Test Reliability	5
Evaluation Research	3
Test Validity	3
Educational Assessment	2
Evaluation Criteria	2
Evaluation Problems	2
High Schools	2
Higher Education	2
Item Analysis	2
Item Response Theory	2
Measurement	2
Measurement Techniques	2
Psychometrics	2
Research Design	2
Research Methodology	2
Sampling	2
Scores	2
Standard Setting (Scoring)	2
Statistical Significance	2
Test Bias	2
Test Construction	2
Test Items	2
More ▼

Source

Educational Measurement:…	2
American Educational Research…	1
Applied Measurement in…	1
Educational and Psychological…	1
Evaluation and Program…	1
Grantee Submission	1
International Association for…	1
International Journal of…	1
Journal of Agronomic…	1

Publication Type

Journal Articles	9
Reports - Descriptive	5
Reports - Research	5
Reports - Evaluative	2
Speeches/Meeting Papers	2
Opinion Papers	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

Maine	1
Michigan	1
New Hampshire	1
Oregon	1

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Signal-to-Noise Ratio in Estimating and Testing the Mediation Effect: Structural Equation Modeling versus Path Analysis with Weighted Composites

Peer reviewed

Direct link

Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024

Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…

Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Progress and Proficiency: Redesigning Grading for Competency Education. CompetencyWorks Issue Brief

Download full text

Sturgis, Chris – International Association for K-12 Online Learning, 2014

This paper is part of a series investigating the implementation of competency education. The purpose of the paper is to explore how districts and schools can redesign grading systems to best help students to excel in academics and to gain the skills that are needed to be successful in college, the community, and the workplace. In order to make the…

Descriptors: Grading, Competency Based Education, Evaluation Methods, Evaluation Research

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

A Study of Three-option and Four-option Multiple Choice Exams.

Cooper, Terence H. – Journal of Agronomic Education (JAE), 1988

Describes a study used to determine differences in exam reliability, difficulty, and student evaluations. Indicates that when a fourth option was added to the three-option items, the exams became more difficult. Includes methods, results discussion, and tables on student characteristics, whole test analyses, and selected items. (RT)

Descriptors: Agronomy, College Science, Error of Measurement, Evaluation Methods

To Be or Not to Be: Control and Balancing of Type I and Type II Errors.

Peer reviewed

Cohen, Patricia – Evaluation and Program Planning: An International Journal, 1982

The various costs of Type I and Type II errors of inference from data are discussed. Six methods for minimizing each error type are presented, which may be employed even after data collection for Type I and which minimizes Type II errors by a study design and analytical means combination. (Author/CM)

Descriptors: Analysis of Variance, Data Analysis, Data Collection, Error of Measurement

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

The Case for Unobtrusive Measures.

Download full text

Terenzini, Patrick T. – 1986

Unobtrusive measures are recommended as a means of assessing educational outcomes of colleges. Such measures can counteract the response bias which is common in questionnaires and interviews. Outcomes researchers are, in fact, asked to supplement standard measures with unobtrusive measures. Interesting data may result from observation of students'…

Descriptors: Colleges, Cost Effectiveness, Educational Assessment, Error of Measurement

The Use and Effect of Caution Indices in Detecting Aberrant Patterns of Standard-Setting Recommendations.

Jaeger, Richard M.; Busch, John Christian – 1986

This study explores the use of the modified caution index (MCI) for identifying judges whose patterns of recommendations suggest that their judgments might be based on incomplete information, flawed reasoning, or inattention to their standard-setting tasks. It also examines the effect on test standards and passing rates when the test standards of…

Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, High Schools

Angela Johnson	1
Babcock, Ben	1
Busch, John Christian	1
Cohen, Patricia	1
Cooper, Terence H.	1
Elizabeth Barker	1
Foster, Jeff L.	1
Jaeger, Richard M.	1
Ke-Hai Yuan	1
Lijuan Wang	1
Marcos Viveros Cespedes	1
Meyer, Kevin D.	1
Papay, John P.	1
Phillips, Gary W.	1
Robitzsch, Alexander	1
Rupp, Andre A.	1
Sturgis, Chris	1
Terenzini, Patrick T.	1
Wyse, Adam E.	1
Zhiyong Zhang	1
More ▼