Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 40 |
Descriptor
Error of Measurement | 47 |
Evaluation Problems | 47 |
Evaluation Methods | 22 |
Academic Achievement | 12 |
Measurement Techniques | 12 |
Evaluation Criteria | 10 |
Evaluation Research | 10 |
Teacher Effectiveness | 10 |
Comparative Analysis | 9 |
Educational Policy | 9 |
Achievement Gains | 8 |
More ▼ |
Source
Author
Loeb, Susanna | 2 |
Ackerman, Matthew | 1 |
Altonji, Joseph G. | 1 |
Anderson, Dan | 1 |
Andru, Peter | 1 |
Ballou, Dale | 1 |
Barakat, Bilal Fouad | 1 |
Bates, Simon P. | 1 |
Borneman, Matthew J. | 1 |
Botchkarev, Alexei | 1 |
Boyd, Donald | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 13 |
Higher Education | 10 |
Postsecondary Education | 5 |
Adult Education | 3 |
Grade 3 | 2 |
Elementary Education | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High Schools | 1 |
Audience
Practitioners | 2 |
Policymakers | 1 |
Researchers | 1 |
Location
New York | 3 |
Florida | 2 |
Texas | 2 |
California | 1 |
California (Stanford) | 1 |
Canada | 1 |
Illinois | 1 |
Iran | 1 |
New Jersey | 1 |
North Carolina | 1 |
Ohio | 1 |
More ▼ |
Laws, Policies, & Programs
Race to the Top | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Florida Comprehensive… | 2 |
British Household Panel Survey | 1 |
Lexile Scale of Reading | 1 |
National Assessment of… | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Mark White; Matt Ronfeldt – Educational Assessment, 2024
Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…
Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns
Kelsey Harkness; Signe Bray; Chelsea M. Durber; Deborah Dewey; Kara Murias – Journal of Autism and Developmental Disorders, 2025
Attention and executive function (EF) dysregulation are common in a number of disorders including autism and attention-deficit/hyperactivity disorder (ADHD). Better understanding of the relationship between indirect and direct measures of attention and EF and common neurodevelopmental diagnoses may contribute to more efficient and effective…
Descriptors: Adolescents, Autism Spectrum Disorders, Attention Deficit Hyperactivity Disorder, Executive Function
Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019
Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items
Szafran, Robert F. – Practical Assessment, Research & Evaluation, 2017
Institutional assessment of student learning objectives has become a fact-of-life in American higher education and the Association of American Colleges and Universities' (AAC&U) VALUE Rubrics have become a widely adopted evaluation and scoring tool for student work. As faculty from a variety of disciplines, some less familiar with the…
Descriptors: Interrater Reliability, Case Studies, Scoring Rubrics, Behavioral Objectives
Park, Jungkyu; Yu, Hsiu-Ting – Educational and Psychological Measurement, 2016
The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…
Descriptors: Hierarchical Linear Modeling, Nonparametric Statistics, Data Analysis, Simulation
Ballou, Dale; Springer, Matthew G. – Educational Researcher, 2015
Our aim in this article is to draw attention to some underappreciated problems in the design and implementation of evaluation systems that incorporate value-added measures. We focus on four: (1) taking into account measurement error in teacher assessments, (2) revising teachers' scores as more information becomes available about their students,…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Scores, Error of Measurement
Robinson, Lauren; Dudensing, Rebekka; Granovsky, Nancy L. – Journal of Extension, 2016
Program evaluation often suffers due to time constraints, imperfect instruments, incomplete data, and the need to report standardized metrics. This article about the evaluation process for the Wi$eUp financial education program showcases the difficulties inherent in evaluation and suggests best practices for assessing program effectiveness. We…
Descriptors: Evaluation Methods, Evaluation Research, Error of Measurement, Money Management
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Lane, Suzanne; Leventhal, Brian – Review of Research in Education, 2015
This chapter addresses the psychometric challenges in assessing English language learners (ELLs) and students with disabilities (SWDs). The first section addresses some general considerations in the assessment of ELLs and SWDs, including the prevalence of ELLs and SWDs in the student population, federal and state legislation that requires the…
Descriptors: Psychometrics, Evaluation Problems, English Language Learners, Disabilities
Zumrawi, Abdel Azim; Bates, Simon P.; Schroeder, Marianne – Educational Research and Evaluation, 2014
This paper addresses the determination of statistically desirable response rates in students' surveys, with emphasis on assessing the effect of underlying variability in the student evaluation of teaching (SET). We discuss factors affecting the determination of adequate response rates and highlight challenges caused by non-response and lack of…
Descriptors: Inferences, Test Reliability, Response Rates (Questionnaires), Student Evaluation of Teacher Performance
Goldhaber, Dan; Loeb, Susanna – Carnegie Foundation for the Advancement of Teaching, 2013
Better teacher evaluation should lead to better instruction and improved outcomes for students, but more accurate classification of teachers requires better information than is now available. Because existing measures of performance are incomplete and imperfect, measured performance does not always reflect true performance. Teachers who are truly…
Descriptors: Personnel Management, Personnel Policy, Teacher Evaluation, Teacher Effectiveness
Williams, Matt N.; Gomez Grajales, Carlos Alberto; Kurkiewicz, Dason – Practical Assessment, Research & Evaluation, 2013
In 2002, an article entitled "Four assumptions of multiple regression that researchers should always test" by Osborne and Waters was published in "PARE." This article has gone on to be viewed more than 275,000 times (as of August 2013), and it is one of the first results displayed in a Google search for "regression…
Descriptors: Multiple Regression Analysis, Misconceptions, Reader Response, Predictor Variables
Barakat, Bilal Fouad – International Journal of Educational Development, 2012
The number of years a child of school-entry age can expect to remain in school is of great interest both as a measure of individual human capital and of the performance of an education system. An approximate indicator of this concept is the sum of age-specific enrolment rates. The relatively low data demands of this indicator that are feasible to…
Descriptors: Human Capital, Measurement Techniques, Simulation, Evaluation Methods
Ackerman, Matthew; Egalite, Anna J. – Program on Education Policy and Governance, 2015
When lotteries are infeasible, researchers must rely on observational methods to estimate charter effectiveness at raising student test scores. Considerable attention has been paid to observational studies by the Stanford Center for Research on Education Outcomes (CREDO), which have analyzed charter performance in 27 states. However, the…
Descriptors: Charter Schools, Observation, Special Education, Lunch Programs
Yates, Brian T. – New Directions for Evaluation, 2012
The value of a program can be understood as referring not only to outcomes, but also to how those outcomes compare to the types and amounts of resources expended to produce the outcomes. Major potential mistakes and biases in assessing the worth of resources consumed, as well as the value of outcomes produced, are explored. Most of these occur…
Descriptors: Program Evaluation, Cost Effectiveness, Evaluation Criteria, Evaluation Problems