Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 18 |
Descriptor
| Educational Testing | 47 |
| Evaluation Methods | 47 |
| Test Reliability | 34 |
| Test Validity | 21 |
| Student Evaluation | 18 |
| Test Construction | 14 |
| Reliability | 11 |
| Testing Problems | 11 |
| Validity | 10 |
| Educational Assessment | 9 |
| Academic Achievement | 8 |
| More ▼ | |
Source
Author
| Bagnato, Stephen J. | 2 |
| Booker, Kevin | 2 |
| Bruch, Julie | 2 |
| Gill, Brian | 2 |
| Macy, Marisa | 2 |
| Popham, W. James | 2 |
| Alexiou, Jon J. | 1 |
| Allen, R. R. | 1 |
| Austin, Dean A. | 1 |
| Dahl, Theodore | 1 |
| Dwyer, Carol A. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 9 |
| Elementary Education | 5 |
| Higher Education | 3 |
| Middle Schools | 3 |
| Early Childhood Education | 2 |
| High Schools | 2 |
| Preschool Education | 2 |
| Adult Education | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| More ▼ | |
Audience
| Teachers | 2 |
| Counselors | 1 |
| Practitioners | 1 |
| Researchers | 1 |
| Students | 1 |
Location
| United Kingdom | 4 |
| Nebraska | 1 |
| New York | 1 |
| Tennessee | 1 |
| United States | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024
We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…
Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners
W. James Popham – Pearson, 2024
"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…
Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction
Nebraska Department of Education, 2024
The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…
Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States are increasingly interested in including measures of student achievement growth, or "value-
added," in evaluating teachers. Annual state assessments, however, which are the typical measure of student
growth, usually cover only reading and math teachers and only in grades 4-8. These state assessments thus cannot
…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Gill, Brian; Bruch, Julie; Booker, Kevin – Regional Educational Laboratory Mid-Atlantic, 2013
States and school districts are exploring alternatives to state tests for measuring teachers' contributions to student learning. One approach applies statistical value-added methods to alternative student assessments such as commercially available tests and end-of course tests. The evidence suggests that these methods can reliably distinguish…
Descriptors: Teacher Evaluation, Teacher Competencies, Evaluation Methods, Educational Testing
Wright, Robert E. – College Student Journal, 2010
The use of standardized tests for outcome assessment has grown dramatically in recent years. Two driving factors have been the No Child Left Behind legislation, and the increase in outcome assessment measures by accrediting agencies such as AACSB, the international accrediting body for business schools. Despite the growth in usage, little effort…
Descriptors: College Outcomes Assessment, Educational Testing, Standardized Tests, Accreditation (Institutions)
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Tennessee Department of Education, 2012
In the summer of 2011, the Tennessee Department of Education contracted with the National Institute for Excellence in Teaching (NIET) to provide a four-day training for all evaluators across the state. NIET trained more than 5,000 evaluators intensively in the state model (districts using alternative instruments delivered their own training).…
Descriptors: Video Technology, Feedback (Response), Evaluators, Interrater Reliability
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Stobart, Gordon – Educational Research, 2009
Background: Validity is a central concern in any assessment, though this has often not been made explicit in the UK assessment context. This article applies current validity theorising, largely derived from American formulations, to national curriculum assessments in England. Purpose: The aim is to consider validity arguments in relation to the…
Descriptors: National Curriculum, Foreign Countries, Elementary Secondary Education, Educational Policy
Newton, Paul E. – Educational Research, 2009
Background: National curriculum tests have been administered in England for well over a decade. Although reliability evidence has been published, critics have argued that there is not enough evidence (of the right kind) and that test results may be insufficiently reliable. Purpose: This article collates and discusses evidence on the reliability of…
Descriptors: National Curriculum, Test Results, Foreign Countries, Elementary Secondary Education
Gray, B. Thomas – 1997
Validity is a critically important issue with far-reaching implications for testing. The history of conceptualizations of validity over the past 50 years is reviewed, and 3 important areas of controversy are examined. First, the question of whether the three traditionally recognized types of validity should be integrated as a unitary entity of…
Descriptors: Educational Testing, Evaluation Methods, Reliability, Scores
Peer reviewedOngley, Pat – Education and Training, 1970
Discusses the pros and cons of objective testing and its difficulties and dangers. (SB)
Descriptors: Educational Testing, Evaluation Methods, Objective Tests, Test Construction
A Culture of Evidence: An Evidence-Centered Approach to Accountability for Student Learning Outcomes
Millett, Catherine M.; Payne, David G.; Dwyer, Carol A.; Stickler, Leslie M.; Alexiou, Jon J. – Educational Testing Service, 2008
This paper presents a framework that institutions of higher education can use to improve, revise and introduce comprehensive systems for the collection and dissemination of information on student learning outcomes. For faculty and institutional leaders grappling with the many issues and nuances inherent in assessing student learning, the framework…
Descriptors: Higher Education, Educational Testing, Accountability, Outcomes of Education

Direct link
