Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 14 |
| Since 2007 (last 20 years) | 42 |
Descriptor
| Evaluation Methods | 74 |
| Standardized Tests | 74 |
| Models | 59 |
| Academic Achievement | 23 |
| Student Evaluation | 20 |
| Elementary Secondary Education | 15 |
| Foreign Countries | 15 |
| Program Evaluation | 15 |
| Measurement Techniques | 14 |
| Scores | 13 |
| Educational Assessment | 12 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Elementary Secondary Education | 13 |
| Secondary Education | 11 |
| Postsecondary Education | 10 |
| Elementary Education | 9 |
| Higher Education | 9 |
| Grade 4 | 5 |
| High Schools | 5 |
| Intermediate Grades | 5 |
| Middle Schools | 5 |
| Grade 5 | 4 |
| Grade 6 | 4 |
| More ▼ | |
Location
| Australia | 5 |
| Connecticut | 3 |
| United Kingdom (England) | 3 |
| California | 2 |
| Florida | 2 |
| New Hampshire | 2 |
| New York | 2 |
| North Carolina | 2 |
| Pennsylvania | 2 |
| Rhode Island | 2 |
| Texas (Houston) | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Michelle Croft; Bonnie O’Keefe; Marisa Mission; Juliet Squire – Bellwether, 2024
State summative assessments play an important role in measuring student learning and guiding educational improvement efforts, despite their limitations. But there is growing momentum in individual states and nationally to rethink these assessments with an eye toward reducing time spent on testing and increasing the tests' instructional relevance.…
Descriptors: Student Evaluation, Summative Evaluation, State Standards, Educational Improvement
Edward J. Kim – Annenberg Institute for School Reform at Brown University, 2022
This study introduces the signal weighted teacher value-added model (SW VAM), a value-added model that weights student-level observations based on each student's capacity to signal their assigned teacher's quality. Specifically, the model leverages the repeated appearance of a given student to estimate student reliability and sensitivity…
Descriptors: Value Added Models, Student Evaluation, Reliability, Simulation
Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017
Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…
Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology
Backes, Ben; Cowan, James; Goldhaber, Dan; Koedel, Cory; Miller, Luke C.; Xu, Zeyu – Grantee Submission, 2017
Policies that require the use of information about student achievement to evaluate teacher performance are becoming increasingly common across the United States, but there is some question as to how or whether to use student test-based teacher evaluations when student assessments change. We bring empirical evidence to bear on this issue.…
Descriptors: Common Core State Standards, Educational Change, Academic Achievement, Standardized Tests
Soland, James – Applied Measurement in Education, 2017
Research shows that assuming a test scale is equal-interval can be problematic, especially when the assessment is being used to achieve a policy aim like evaluating growth over time. However, little research considers whether teacher value added is sensitive to the underlying test scale, and in particular whether treating an ordinal scale as…
Descriptors: Intervals, Value Added Models, Teacher Evaluation, Teacher Effectiveness
Jimenez, Laura – Center for American Progress, 2020
Schools face enormous challenges regarding how to operate efficiently and safely for the 2020-21 school year. As part of that response, some state leaders are asking the U.S. Department of Education to waive the annual federal testing and accountability requirements for 2021, which are key to understanding and addressing gaps in education among…
Descriptors: COVID-19, Pandemics, Disease Control, Well Being
Santelices, Maria Veronica; Valencia, Edgar; Gonzalez, Jorge; Taut, Sandy – Educational Assessment, Evaluation and Accountability, 2017
This research examines empirically the relationship between two measures of teacher quality: one based on professional standards and a second one using teacher value-added estimates. It also studies the extent to which teacher observable characteristics, such as teacher training variables, are associated to better performance on either of these…
Descriptors: Teacher Effectiveness, Context Effect, Foreign Countries, Value Added Models
Berliner, David C. – Education Policy Analysis Archives, 2018
The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…
Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests
Shen, Zuchao; Simon, Carlee Escue; Kelcey, Ben – eJEP: eJournal of Education Policy, 2016
Value-added models try to separate the contribution of individual teachers or schools to students' learning growth measured by standardized test scores. There is a policy trend to use value-added modeling to evaluate teachers because of its face validity and superficial objectiveness. This article investigates the potential long term consequences…
Descriptors: Value Added Models, Teacher Evaluation, Program Implementation, Teacher Effectiveness
Falk, Carl F.; Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2015
We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…
Descriptors: Guessing (Tests), Item Response Theory, Mathematics Instruction, Mathematics Tests
Cullen, Julie Berry; Koedel, Cory; Parsons, Eric – National Center for Analysis of Longitudinal Data in Education Research (CALDER), 2016
Improving public sector workforce quality is challenging in sectors such as education where worker productivity is difficult to assess and manager incentives are muted by political and bureaucratic constraints. In this paper, we study how providing information to principals about teacher effectiveness and encouraging them to use the information in…
Descriptors: Teacher Competencies, Teacher Effectiveness, Labor Turnover, Teacher Persistence
Cook, H. Gary; Sahakyan, Narek; Linquanti, Robert – Wisconsin Center for Education Research, 2017
The authors develop model analyses based on a U.S. Department of Education guide to illustrate procedures a State could use to compare and contrast school-level overall and English Learner accountability determinations for proficiency in reading/language arts under the options allowed by the Every Student Succeeds Act. As a technical reference,…
Descriptors: State Standards, Accountability, Immigrants, English Language Learners
Cao, Thuy Hong; Jung, Jae Yup; Lee, Jihyun – Journal of Advanced Academics, 2017
Assessment is a crucial component of gifted education. Not only does it facilitate the recognition of the potential and specific needs of gifted students, it also monitors the progress and growth of gifted students, and allows for the evaluation of gifted education programs. In the present review, we synthesize the literature on assessment in…
Descriptors: Program Evaluation, Foreign Countries, Evaluation Methods, Talent
Martínez Abad, Fernando; Chaparro Caso López, Alicia A. – School Effectiveness and School Improvement, 2017
In light of the emergence of statistical analysis techniques based on data mining in education sciences, and the potential they offer to detect non-trivial information in large databases, this paper presents a procedure used to detect factors linked to academic achievement in large-scale assessments. The study is based on a non-experimental,…
Descriptors: Foreign Countries, Data Collection, Statistical Analysis, Evaluation Methods
Anderson, Daniel; Farley, Dan; Tindal, Gerald – Journal of Special Education, 2015
Students with significant cognitive disabilities present an assessment dilemma that centers on access and validity in large-scale testing programs. Typically, access is improved by eliminating construct-irrelevant barriers, while validity is improved, in part, through test standardization. In this article, one state's alternate assessment data…
Descriptors: Mental Retardation, Evaluation Methods, Student Evaluation, Standardized Tests

Direct link
Peer reviewed
