Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 17 |
Descriptor
Educational Testing | 50 |
Evaluation Methods | 50 |
Test Validity | 50 |
Test Reliability | 19 |
Educational Assessment | 16 |
Student Evaluation | 16 |
Test Construction | 14 |
Testing Problems | 14 |
Test Interpretation | 11 |
Elementary Secondary Education | 9 |
Psychometrics | 9 |
More ▼ |
Source
Author
Bagnato, Stephen J. | 2 |
Bielinski, John | 2 |
Hughes, Katherine L. | 2 |
Leitzel, Thomas C. | 2 |
Macy, Marisa | 2 |
Minnema, Jane | 2 |
Scott-Clayton, Judith | 2 |
Thurlow, Martha | 2 |
Vogler, Daniel E. | 2 |
Alexiou, Jon J. | 1 |
Allen, R. R. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 9 |
Higher Education | 3 |
Early Childhood Education | 2 |
Preschool Education | 2 |
Two Year Colleges | 2 |
Elementary Education | 1 |
High Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Location
United Kingdom (England) | 2 |
Spain | 1 |
United Kingdom | 1 |
United Kingdom (Wales) | 1 |
United States | 1 |
Virginia | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
W. James Popham – Pearson, 2024
"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…
Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods
Hughes, Katherine L.; Scott-Clayton, Judith – Community College Research Center, Columbia University, 2011
Placement exams are high-stakes assessments that determine many students' college trajectories. More than half of entering students at community colleges are placed into developmental education in at least one subject, based primarily on scores from these assessments, yet recent research fails to find evidence that placement into remediation…
Descriptors: Community Colleges, Remedial Instruction, Educational Testing, Student Placement
Wright, Robert E. – College Student Journal, 2010
The use of standardized tests for outcome assessment has grown dramatically in recent years. Two driving factors have been the No Child Left Behind legislation, and the increase in outcome assessment measures by accrediting agencies such as AACSB, the international accrediting body for business schools. Despite the growth in usage, little effort…
Descriptors: College Outcomes Assessment, Educational Testing, Standardized Tests, Accreditation (Institutions)
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Hughes, Katherine L.; Scott-Clayton, Judith – Community College Research Center, Columbia University, 2010
Placement exams are high-stakes assessments that determine many students' college trajectories. More than half of entering students at community colleges are placed into developmental education in at least one subject, based primarily on scores from these assessments, yet recent research fails to find evidence that placement into remediation…
Descriptors: Community Colleges, Remedial Instruction, Literature Reviews, High Stakes Tests
Stone, Elizabeth; Cook, Linda – Educational Testing Service, 2009
Research studies have shown that a smaller percentage of students with learning disabilities participate in state assessments than do their peers without learning disabilities. Furthermore, there is almost always a performance gap between these groups of students on these assessments. It is important to evaluate whether a performance gap on a…
Descriptors: Learning Disabilities, State Standards, Educational Testing, Science Tests
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
A Culture of Evidence: An Evidence-Centered Approach to Accountability for Student Learning Outcomes
Millett, Catherine M.; Payne, David G.; Dwyer, Carol A.; Stickler, Leslie M.; Alexiou, Jon J. – Educational Testing Service, 2008
This paper presents a framework that institutions of higher education can use to improve, revise and introduce comprehensive systems for the collection and dissemination of information on student learning outcomes. For faculty and institutional leaders grappling with the many issues and nuances inherent in assessing student learning, the framework…
Descriptors: Higher Education, Educational Testing, Accountability, Outcomes of Education
Bagnato, Stephen J.; Macy, Marisa – NHSA Dialog, 2010
Authentic assessment is a growing alternative to conventional testing. This research-to-practice article describes a framework for implementing authentic assessment. The R-E-A-L framework shows how roles, equipment, assessment tools, and location can be incorporated into early childhood practices.
Descriptors: Early Childhood Education, Performance Based Assessment, Program Implementation, Guidelines

Goodwin, Laura D.; Leech, Nancy L. – Measurement and Evaluation in Counseling and Development, 2003
The treatment of validity in the newest edition of "Standards for Educational and Psychological Testing" is quite different from coverage in earlier editions of the Standards and in most measurement textbooks. The view of validity in the 1999 Standards is discussed, and suggestions for instructors of measurement courses are offered. (Contains 56…
Descriptors: Educational Testing, Evaluation Methods, Psychological Testing, Standards
Macy, Marisa; Bagnato, Stephen J. – NHSA Dialog, 2010
The inclusion of young children with disabilities has remained a function of the Head Start program since its inception in the 1960s when the United States Congress mandated that children with disabilities comprise 10% of the Head Start enrollment (Zigler & Styfco, 2000). Standardized, norm-referenced tests used to identify children with…
Descriptors: Performance Based Assessment, Disadvantaged Youth, Norm Referenced Tests, Disabilities
Hosp, Michelle K.; Hosp, John L.; Howell, Kenneth W. – Guilford Publications, 2007
This pragmatic, accessible book presents an empirically supported conceptual framework and hands-on instructions for conducting curriculum-based measurement (CBM) in grades K-8. The authors provide everything needed to evaluate student learning in reading, spelling, writing, and math; graph the resulting data; and use this information to make…
Descriptors: Spelling, Curriculum Based Assessment, Elementary Education, Evaluation Methods