Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 40 |
| Since 2017 (last 10 years) | 104 |
| Since 2007 (last 20 years) | 912 |
Descriptor
| Educational Testing | 4168 |
| Elementary Secondary Education | 899 |
| Student Evaluation | 882 |
| Academic Achievement | 756 |
| Educational Assessment | 664 |
| Evaluation Methods | 610 |
| Achievement Tests | 581 |
| Test Construction | 540 |
| Higher Education | 533 |
| Standardized Tests | 499 |
| Testing Problems | 468 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 22 |
| Popham, W. James | 17 |
| Baker, Eva L. | 14 |
| Shipman, Virginia C. | 13 |
| Sinharay, Sandip | 13 |
| Ebel, Robert L. | 12 |
| Haney, Walt | 11 |
| Herman, Joan L. | 10 |
| Mislevy, Robert J. | 10 |
| Hartley, Nancy K. | 8 |
| Koretz, Daniel | 8 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 291 |
| Teachers | 138 |
| Researchers | 79 |
| Administrators | 78 |
| Policymakers | 67 |
| Students | 20 |
| Parents | 19 |
| Counselors | 9 |
| Community | 6 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| California | 102 |
| Canada | 82 |
| Florida | 54 |
| Australia | 52 |
| United Kingdom | 51 |
| United Kingdom (England) | 50 |
| United States | 49 |
| New York | 47 |
| Texas | 42 |
| United Kingdom (Great Britain) | 28 |
| New Jersey | 27 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Zwick, Rebecca – ETS Research Report Series, 2012
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…
Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods
Thompson, Nathan A. – Journal of Applied Testing Technology, 2008
The widespread application of personal computers to educational and psychological testing has substantially increased the number of test administration methodologies available to testing programs. Many of these mediums are referred to by their acronyms, such as CAT, CBT, CCT, and LOFT. The similarities between the acronyms and the methods…
Descriptors: Testing Programs, Psychological Testing, Classification, Educational Testing
Australian Council for Educational Research, 2015
The Australian Council for Educational Research (ACER) is one of the world's leading educational research centres. ACER's mission is to create and promote research-based knowledge, products and services that can be used to improve learning across the life span. This annual report describes ACER's milestones and accomplishments for the 2013-2014…
Descriptors: Foreign Countries, Educational Research, Annual Reports, Research Projects
Garrett, Kristi – Education Digest: Essential Readings Condensed for Quick Review, 2011
Measuring a teacher's effectiveness in quantifiable ways is a logical step in a society driven by the SMART goals (specific, measurable, attainable, relevant, and timely objectives) that pervade modern management. The idea of using student performance on standardized tests to judge a teacher's effectiveness picked up steam after the Obama…
Descriptors: Teacher Evaluation, Standardized Tests, Evaluation Methods, Teacher Effectiveness
Herman, Joan L. – Assessment and Accountability Comprehensive Center, 2010
The way forward to better assessment begins with the conception of assessment not as a single test but as a coherent "system" of measures. Coherent systems must be composed of valid measures of learning and be horizontally, developmentally, and vertically aligned to serve classroom, school, and district improvement. Coherent assessment…
Descriptors: Educational Assessment, Student Evaluation, Educational Testing, Systems Approach
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Descriptors: Educational Testing, Scores, Reports, Psychometrics
Packman, Sheryl; Camara, Wayne J.; Huff, Kristen – Educational Measurement: Issues and Practice, 2010
This paper provides a snapshot of educational measurement professionals--their educational, professional and demographic backgrounds, as well as their workplace settings, job tasks, professional involvement, and compensation practices. Two previous studies have surveyed employers, but this is the first attempt to conduct a comprehensive survey of…
Descriptors: Measurement, Educational Testing, Psychometrics, Compensation (Remuneration)
Camara, Wayne – College Board, 2011
This presentation was presented at the 2011 National Conference on Student Assessment (CCSSO). The focus of this presentation is how to validate the common core state standards (CCSS) in math and ELA and the subsequent assessments that will be developed by state consortia. The CCSS specify the skills students need to be ready for post-secondary…
Descriptors: College Readiness, Career Readiness, Benchmarking, Student Evaluation
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Liekar, Christine Y. – ProQuest LLC, 2012
Since the time of Sputnik, American educators and policymakers have recognized the need to raise expectations by increasing rigor in high schools across the United States. Copious studies attest to the fact that students who take Advanced Placement coursework experience success in college (Adelman, 1999; Camara, 2003; College Board, 2005;…
Descriptors: High School Students, Advanced Placement Programs, Educational Policy, Educational Practices
Loeb, Susanna; Candelaria, Christopher A. – Carnegie Foundation for the Advancement of Teaching, 2012
Value-added models measure teacher performance by the test score gains of their students, adjusted for a variety of factors such as the performance of students when they enter the class. The measures are based on desired student outcomes such as math and reading scores, but they have a number of potential drawbacks. One of them is the…
Descriptors: Academic Achievement, Teacher Effectiveness, Scores, Peer Influence
Verhelst, Norman D. – Scandinavian Journal of Educational Research, 2012
When using IRT models in Educational Achievement Testing, the model is as a rule too simple to catch all the relevant dimensions in the test. It is argued that a simple model may nevertheless be useful but that it can be complemented with additional analyses. Such an analysis, called profile analysis, is proposed and applied to the reading data of…
Descriptors: Multidimensional Scaling, Profiles, Item Response Theory, Achievement Tests
Zhang, Ting – ProQuest LLC, 2013
This study was designed with the overall goal of understanding how difficulties in reading comprehension are associated with early adolescents' performance in large-scale assessments in subject domains including science and civic-related social studies. The current study extended previous research by taking a cognition-centered approach based on…
Descriptors: Early Adolescents, Science Tests, Social Studies, Evidence
Little, Jeri Lynn – ProQuest LLC, 2011
Although generally used for assessment, tests can also serve as tools for learning--but different test formats may not be equally beneficial. Specifically, research has shown multiple-choice tests to be less effective than cued-recall tests in improving the later retention of the tested information (e.g., see meta-analysis by Hamaker, 1986),…
Descriptors: Recall (Psychology), Multiple Choice Tests, Learning Processes, Educational Testing
Schutz, Dick – Education Policy Analysis Archives, 2013
The commentary (1) uses the U. S. National Assessment of Educational Progress (NAEP) as a prototype for examining standardized reading achievement tests at the item level, and (2) sketches an alternative based on an initiative underway in the United Kingdom.
Descriptors: Educational Testing, Educational Change, Achievement Tests, Reading Achievement

Peer reviewed
Direct link
