ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	40

Descriptor

Error of Measurement	47
Evaluation Problems	47
Evaluation Methods	22
Academic Achievement	12
Measurement Techniques	12
Evaluation Criteria	10
Evaluation Research	10
Teacher Effectiveness	10
Comparative Analysis	9
Educational Policy	9
Achievement Gains	8
Research Methodology	8
Statistical Bias	8
Item Analysis	7
Measurement	7
Teacher Evaluation	7
Test Reliability	7
Achievement Rating	6
Educational Assessment	6
Program Evaluation	6
Research Problems	6
Robustness (Statistics)	6
Scores	6
Change Strategies	5
Correlation	5
More ▼

Publication Type

Journal Articles	30
Reports - Evaluative	22
Reports - Research	15
Reports - Descriptive	6
Opinion Papers	4
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	13
Higher Education	10
Postsecondary Education	5
Adult Education	3
Grade 3	2
Elementary Education	1
Grade 4	1
Grade 5	1
High Schools	1

Audience

Practitioners	2
Policymakers	1
Researchers	1

Location

New York	3
Florida	2
Texas	2
California	1
California (Stanford)	1
Canada	1
Illinois	1
Iran	1
New Jersey	1
North Carolina	1
Ohio	1
Tennessee	1
United Kingdom	1
United States	1
More ▼

Laws, Policies, & Programs

Race to the Top	2
No Child Left Behind Act 2001	1

Assessments and Surveys

Florida Comprehensive…	2
British Household Panel Survey	1
Lexile Scale of Reading	1
National Assessment of…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 47 results Save | Export

Monitoring Rater Quality in Observational Systems: Issues Due to Unreliable Estimates of Rater Quality

Peer reviewed

Direct link

Mark White; Matt Ronfeldt – Educational Assessment, 2024

Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…

Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns

Assessing the Contribution of Measures of Attention and Executive Function to Diagnosis of ADHD or Autism

Peer reviewed

Direct link

Kelsey Harkness; Signe Bray; Chelsea M. Durber; Deborah Dewey; Kara Murias – Journal of Autism and Developmental Disorders, 2025

Attention and executive function (EF) dysregulation are common in a number of disorders including autism and attention-deficit/hyperactivity disorder (ADHD). Better understanding of the relationship between indirect and direct measures of attention and EF and common neurodevelopmental diagnoses may contribute to more efficient and effective…

Descriptors: Adolescents, Autism Spectrum Disorders, Attention Deficit Hyperactivity Disorder, Executive Function

A Modified "a"-Stratified Method for Computerized Adaptive Testing. Research Report. ETS RR-19-10

Peer reviewed
PDF on ERIC

Download full text

Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019

Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items

The Miscalculation of Interrater Reliability: A Case Study Involving the AAC&U VALUE Rubrics

Peer reviewed
PDF on ERIC

Download full text

Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017

Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…

Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives

The Impact of Ignoring the Level of Nesting Structure in Nonparametric Multilevel Latent Class Models

Peer reviewed

Direct link

Park, Jungkyu; Yu, Hsiu-Ting – Educational and Psychological Measurement, 2016

The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…

Descriptors: Hierarchical Linear Modeling, Nonparametric Statistics, Data Analysis, Simulation

Using Student Test Scores to Measure Teacher Performance: Some Problems in the Design and Implementation of Evaluation Systems

Peer reviewed

Direct link

Ballou, Dale; Springer, Matthew G. – Educational Researcher, 2015

Our aim in this article is to draw attention to some underappreciated problems in the design and implementation of evaluation systems that incorporate value-added measures. We focus on four: (1) taking into account measurement error in teacher assessments, (2) revising teachers' scores as more information becomes available about their students,…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Scores, Error of Measurement

Evaluation Strategies in Financial Education: Evaluation with Imperfect Instruments

Peer reviewed

Direct link

Robinson, Lauren; Dudensing, Rebekka; Granovsky, Nancy L. – Journal of Extension, 2016

Program evaluation often suffers due to time constraints, imperfect instruments, incomplete data, and the need to report standardized metrics. This article about the evaluation process for the Wi$eUp financial education program showcases the difficulties inherent in evaluation and suggests best practices for assessing program effectiveness. We…

Descriptors: Evaluation Methods, Evaluation Research, Error of Measurement, Money Management

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Psychometric Challenges in Assessing English Language Learners and Students with Disabilities

Peer reviewed

Direct link

Lane, Suzanne; Leventhal, Brian – Review of Research in Education, 2015

This chapter addresses the psychometric challenges in assessing English language learners (ELLs) and students with disabilities (SWDs). The first section addresses some general considerations in the assessment of ELLs and SWDs, including the prevalence of ELLs and SWDs in the student population, federal and state legislation that requires the…

Descriptors: Psychometrics, Evaluation Problems, English Language Learners, Disabilities

What Response Rates Are Needed to Make Reliable Inferences from Student Evaluations of Teaching?

Peer reviewed

Direct link

Zumrawi, Abdel Azim; Bates, Simon P.; Schroeder, Marianne – Educational Research and Evaluation, 2014

This paper addresses the determination of statistically desirable response rates in students' surveys, with emphasis on assessing the effect of underlying variability in the student evaluation of teaching (SET). We discuss factors affecting the determination of adequate response rates and highlight challenges caused by non-response and lack of…

Descriptors: Inferences, Test Reliability, Response Rates (Questionnaires), Student Evaluation of Teacher Performance

What Do We Know about the Tradeoffs Associated with Teacher Misclassifications in High Stakes Personnel Decisions? What We Know Series: Value-Added Methods and Applications. Knowledge Brief 6

Download full text

Goldhaber, Dan; Loeb, Susanna – Carnegie Foundation for the Advancement of Teaching, 2013

Better teacher evaluation should lead to better instruction and improved outcomes for students, but more accurate classification of teachers requires better information than is now available. Because existing measures of performance are incomplete and imperfect, measured performance does not always reflect true performance. Teachers who are truly…

Descriptors: Personnel Management, Personnel Policy, Teacher Evaluation, Teacher Effectiveness

Assumptions of Multiple Regression: Correcting Two Misconceptions

Peer reviewed
PDF on ERIC

Download full text

Williams, Matt N.; Gomez Grajales, Carlos Alberto; Kurkiewicz, Dason – Practical Assessment, Research & Evaluation, 2013

In 2002, an article entitled "Four assumptions of multiple regression that researchers should always test" by Osborne and Waters was published in "PARE." This article has gone on to be viewed more than 275,000 times (as of August 2013), and it is one of the first results displayed in a Google search for "regression…

Descriptors: Multiple Regression Analysis, Misconceptions, Reader Response, Predictor Variables

A Second Look at "School-Life Expectancy"

Peer reviewed

Direct link

Barakat, Bilal Fouad – International Journal of Educational Development, 2012

The number of years a child of school-entry age can expect to remain in school is of great interest both as a measure of individual human capital and of the performance of an education system. An approximate indicator of this concept is the sum of age-specific enrolment rates. The relatively low data demands of this indicator that are feasible to…

Descriptors: Human Capital, Measurement Techniques, Simulation, Evaluation Methods

Assessing Tradeoffs between Observational and Experimental Designs for Charter School Research. Program on Education Policy and Governance Working Papers Series. PEPG 15-04

Download full text

Ackerman, Matthew; Egalite, Anna J. – Program on Education Policy and Governance, 2015

When lotteries are infeasible, researchers must rely on observational methods to estimate charter effectiveness at raising student test scores. Considerable attention has been paid to observational studies by the Stanford Center for Research on Education Outcomes (CREDO), which have analyzed charter performance in 27 states. However, the…

Descriptors: Charter Schools, Observation, Special Education, Lunch Programs

Step Arounds for Common Pitfalls When Valuing Resources Used versus Resources Produced

Peer reviewed

Direct link

Yates, Brian T. – New Directions for Evaluation, 2012

The value of a program can be understood as referring not only to outcomes, but also to how those outcomes compare to the types and amounts of resources expended to produce the outcomes. Major potential mistakes and biases in assessing the worth of resources consumed, as well as the value of outcomes produced, are explored. Most of these occur…

Descriptors: Program Evaluation, Cost Effectiveness, Evaluation Criteria, Evaluation Problems

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

American Psychologist	2
Carnegie Foundation for the…	2
Education and the Public…	2
Educational Measurement:…	2
Evaluation Review	2
Journal of MultiDisciplinary…	2
National Center for Analysis…	2
Practical Assessment,…	2
American Educational Research…	1
Applied Measurement in…	1
Centre for the Economics of…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Research and…	1
Educational Researcher	1
Educational Testing Service	1
Educational and Psychological…	1
International Journal of…	1
Internet and Higher Education	1
Journal of Autism and…	1
Journal of Extension	1
Journal of Leadership…	1
Journal of School Choice	1
Learning Disability Quarterly	1
National Education Policy…	1
More ▼

Loeb, Susanna	2
Ackerman, Matthew	1
Altonji, Joseph G.	1
Anderson, Dan	1
Andru, Peter	1
Ballou, Dale	1
Barakat, Bilal Fouad	1
Bates, Simon P.	1
Borneman, Matthew J.	1
Botchkarev, Alexei	1
Boyd, Donald	1
Bradford, George	1
Burdick, Donald S.	1
Chelsea M. Durber	1
Connelly, Brian S.	1
Deborah Dewey	1
Devaney, Barbara	1
Dorn, Sherman	1
Dudensing, Rebekka	1
Egalite, Anna J.	1
Froman, Terry	1
Gardner, Eric	1
Goldhaber, Dan	1
Gomez Grajales, Carlos Alberto	1
More ▼