ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	14

Descriptor

Error of Measurement	25
Evaluation Methods	25
Student Evaluation	25
Test Reliability	11
Test Validity	6
Educational Assessment	5
Measurement Techniques	5
Evaluation Criteria	4
Foreign Countries	4
Models	4
Sampling	4
Test Bias	4
Alternative Assessment	3
College Students	3
Higher Education	3
Interrater Reliability	3
Reliability	3
Scores	3
Test Construction	3
Test Items	3
Academic Achievement	2
Achievement Gap	2
Comparative Analysis	2
Computer Assisted Testing	2
Criterion Referenced Tests	2
More ▼

Source

Assessment & Evaluation in…	2
ProQuest LLC	2
School Psychology Review	2
Assessment	1
Communication Education	1
ETS Research Institute	1
Gifted Child Today	1
International Journal of…	1
Journal of College Science…	1
Journal of Curriculum and…	1
Journal of Educational…	1
Journal of Special Education	1
Mathematica Policy Research,…	1
National Assessment Governing…	1
Reading Research and…	1
Society for Research on…	1
Studies in Educational…	1
Teachers College Record	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	10
Reports - Evaluative	6
Reports - Descriptive	4
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Elementary Secondary Education	3
Higher Education	3
Postsecondary Education	1
Secondary Education	1

Audience

Location

Australia	1
District of Columbia	1
New York	1
Portugal	1
Texas	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

Controlling for Measurement Error in Evaluations When Treatment Group Assignment Is Based on Noisy Measures

Peer reviewed

Direct link

Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023

Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…

Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards

Exploring Rating Quality in the Context of High-Stakes Rater-Mediated Educational Assessments

Direct link

Wenjing Guo – ProQuest LLC, 2021

Constructed response (CR) items are widely used in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district and state-level assessments in the United States. One unique feature of CR items is that they depend on human raters to assess the quality of examinees' work. The judgment of human…

Descriptors: National Competency Tests, Responses, Interrater Reliability, Error of Measurement

Charting the Future of Assessments. Full Report

Download full text

Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…

Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias

Toward Incorporating Efficiency Data in Brief Experimental Analysis Decision Making

Peer reviewed

Direct link

Gadke, Daniel L.; Drevon, Daniel D. – School Psychology Review, 2020

Brief experimental analysis (BEA) is frequently used to drive intervention selection decisions for students in need of intensive reading fluency intervention. Researchers have demonstrated that most BEA results for students with reading fluency difficulties are undifferentiated when considering the standard error of measurement (SEM) of…

Descriptors: Data Use, Decision Making, Efficiency, Intervention

The Perceptive Imperative: Connoisseurship and the Temptation of Rubrics

Peer reviewed

Direct link

Gottlieb, Derek; Moroye, Christy M. – Journal of Curriculum and Pedagogy, 2016

We examine the reliance on rubrics for educational evaluation and explore whether such tools fulfill their promise. Following Wittgensteinian critical strategies, we explore what "the application of the [rubric] picture looks like" and then evaluate (a) whether those benefits are attributable to rubric use at all, and (b) whether any of…

Descriptors: Scoring Rubrics, Educational Assessment, Student Evaluation, Educational Benefits

Review of Sample Size for Structural Equation Models in Second Language Testing and Learning Research: A Monte Carlo Approach

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – International Journal of Testing, 2013

The importance of sample size, although widely discussed in the literature on structural equation modeling (SEM), has not been widely recognized among applied SEM researchers. To narrow this gap, we focus on second language testing and learning studies and examine the following: (a) Is the sample size sufficient in terms of precision and power of…

Descriptors: Structural Equation Models, Sample Size, Second Language Instruction, Monte Carlo Methods

Improving Explanatory Inferences from Assessments

Direct link

Diakow, Ronli Phyllis – ProQuest LLC, 2013

This dissertation comprises three papers that propose, discuss, and illustrate models to make improved inferences about research questions regarding student achievement in education. Addressing the types of questions common in educational research today requires three different "extensions" to traditional educational assessment: (1)…

Descriptors: Inferences, Educational Assessment, Academic Achievement, Educational Research

Are Assessment Environments Gendered? An Analysis of the Learning Responses of Male and Female Students to Different Assessment Environments

Peer reviewed

Direct link

Turner, Gill; Gibbs, Graham – Assessment & Evaluation in Higher Education, 2010

There is considerable variation between male and female Bachelor degree performance at Oxford and Cambridge (Oxbridge) where male students attain more First and Third Class degrees and female students attain more Second Class degrees. Various hypotheses have been put forward to explain this phenomenon including the possibility that the distinctive…

Descriptors: Gender Differences, Questionnaires, Evaluation Methods, Evaluation Research

Design of Value-Added Models for IMPACT and TEAM in DC Public Schools, 2010-2011 School Year. Final Report

Download full text

Isenberg, Eric; Hock, Heinrich – Mathematica Policy Research, Inc., 2011

This report presents the value-added models that will be used to measure school and teacher effectiveness in the District of Columbia Public Schools (DCPS) in the 2010-2011 school year. It updates the earlier technical report, "Measuring Value Added for IMPACT and TEAM in DC Public Schools." The earlier report described the methods used…

Descriptors: Public Schools, Teacher Effectiveness, School Effectiveness, Models

Measuring Time: The Stability of Special Education Teacher Time Use

Peer reviewed

Direct link

Vannest, Kimberly J.; Parker, Richard I. – Journal of Special Education, 2010

Instructional time use is an intervention without equal. The measure of such has clear and important implications for special education practice and research. Although exhortations to maximize instruction and thereby student engagement exist throughout the literature, few studies discuss how special education teachers use their time, and none…

Descriptors: School Schedules, Error of Measurement, Sampling, Special Education Teachers

Evaluating the Assessment: Sources of Evidence for Quality Assurance

Peer reviewed

Direct link

Birenbaum, Menucha – Studies in Educational Evaluation, 2007

High quality assessment practice is expected to yield valid and useful score-based interpretations about what the examinees know and are able to do with respect to a defined target domain. Given this assertion, the article presents a framework based on the "unified view of validity," advanced by Cronbach and Messick over two decades ago, to assist…

Descriptors: Quality Control, Student Evaluation, Validity, Evaluation Methods

E-Assessment within the Bologna Paradigm: Evidence from Portugal

Peer reviewed

Direct link

Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010

The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…

Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment

Techniques for Processing Student Grades

Peer reviewed

Sanders, Steven G. – Journal of College Science Teaching, 1975

Several techniques to use in evaluation and grading are presented. Some grading problems are discussed briefly. (PEB)

Descriptors: Error of Measurement, Evaluation, Evaluation Methods, Grading

Test-Retest Reliability and Standard Error of Measurement for the Test of Variables of Attention (T.O.V.A.) With Healthy School-Age Children

Peer reviewed

Direct link

Leark, Robert A.; Wallace, Denise R.; Fitzgerald, Robert – Assessment, 2004

Test-retest reliability of the Test of Variables of Attention (T.O.V.A.) was investigated in two studies using two different time intervals: 90 min and 1 week (plus or minus 2 days). To investigate the 90-min reliability, 31 school-age children (M = 10 years, SD = 2.66) were administered the T.O.V.A. then read ministered the test 90 min afterward.…

Descriptors: Intervals, Reaction Time, Error of Measurement, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2

Abedi, Jamal	1
Amit Sevak	1
Bateman, Andrea	1
Birenbaum, Menucha	1
Bohn, Christine A.	1
Bohn, Emil	1
Cason, Gerald J.	1
Cheung, K. C.	1
Christ, Theodore J.	1
Daniel Fishtein	1
Diakow, Ronli Phyllis	1
Drevon, Daniel D.	1
Emrick, John A.	1
Ferrao, Maria	1
Fitzgerald, Robert	1
Gadke, Daniel L.	1
Gibbs, Graham	1
Gillis, Shelley	1
Glissmeyer, Connie B.	1
Gottlieb, Derek	1
Haertel, Edward H.	1
Hintze, John M.	1
Hock, Heinrich	1
Ikkyu Choi	1
More ▼