Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 40 |
| Since 2017 (last 10 years) | 104 |
| Since 2007 (last 20 years) | 912 |
Descriptor
| Educational Testing | 4168 |
| Elementary Secondary Education | 899 |
| Student Evaluation | 882 |
| Academic Achievement | 756 |
| Educational Assessment | 664 |
| Evaluation Methods | 610 |
| Achievement Tests | 581 |
| Test Construction | 540 |
| Higher Education | 533 |
| Standardized Tests | 499 |
| Testing Problems | 468 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 22 |
| Popham, W. James | 17 |
| Baker, Eva L. | 14 |
| Shipman, Virginia C. | 13 |
| Sinharay, Sandip | 13 |
| Ebel, Robert L. | 12 |
| Haney, Walt | 11 |
| Herman, Joan L. | 10 |
| Mislevy, Robert J. | 10 |
| Hartley, Nancy K. | 8 |
| Koretz, Daniel | 8 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 291 |
| Teachers | 138 |
| Researchers | 79 |
| Administrators | 78 |
| Policymakers | 67 |
| Students | 20 |
| Parents | 19 |
| Counselors | 9 |
| Community | 6 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| California | 102 |
| Canada | 82 |
| Florida | 54 |
| Australia | 52 |
| United Kingdom | 51 |
| United Kingdom (England) | 50 |
| United States | 49 |
| New York | 47 |
| Texas | 42 |
| United Kingdom (Great Britain) | 28 |
| New Jersey | 27 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
O'Neil, Timothy P. – ProQuest LLC, 2010
With scant research to draw upon with respect to the maintenance of vertical scales over time, decisions around the creation and performance of vertical scales over time necessarily suffers due to the lack of information. Undetected item parameter drift (IPD) presents one of the greatest threats to scale maintenance within an item response theory…
Descriptors: Scaling, Measures (Individuals), Item Response Theory, Educational Assessment
Alonzo, Alicia C. – Measurement: Interdisciplinary Research and Perspectives, 2010
In their article "Innovations in Setting Performance Standards for K-12 Test-Based Accountability," Kristen Huff and Barbara S. Plake (2010) lay out three preconditions for continued investment in standard-setting methodology and practice, all focused on the sound development and use of achievement level descriptors (ALDs). Among these…
Descriptors: Standard Setting (Scoring), Achievement, Elementary Secondary Education, Accountability
Roberts, Mary Roduta; Gierl, Mark J. – Educational Measurement: Issues and Practice, 2010
This paper presents a framework to provide a structured approach for developing score reports for cognitive diagnostic assessments ("CDAs"). Guidelines for reporting and presenting diagnostic scores are based on a review of current educational test score reporting practices and literature from the area of information design. A sample diagnostic…
Descriptors: Diagnostic Tests, Scores, Technical Writing, Cognitive Tests
Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009
This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…
Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests
Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009
Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…
Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior
Moon, Tonya R. – Gifted Child Quarterly, 2009
The myth equating high-stakes testing with rigor and difficulty is one that can be debunked given the empirical work that has been conducted in this area. To completely debunk this myth in gifted education, the field must centralize efforts. Educators need to consider alternatives to the current system of assessment and the delivery of…
Descriptors: Academically Gifted, Misconceptions, Testing, High Stakes Tests
Sawchuk, Stephen – Education Week, 2009
As the recession crimps education budgets, states are beginning to pare the number of standardized tests they give, particularly those that no longer factor into state or federal accountability decisions. At the district level, though, it's a different story. Despite pressure not to cut staffing and programs, many districts are preserving local…
Descriptors: Federal Legislation, Educational Testing, Educational Finance, Standardized Tests
Machin, Stephen; McNally, Sandra; Wyness, Gill – Educational Research, 2013
Background: Political devolution occurred in the UK in 1998-99, following many years in which some degree of policy administration had been devolved to the four nations. Since devolution, all four countries of the UK have pursued increasingly divergent education policies. This is true in England in particular, where diversity, choice and…
Descriptors: Foreign Countries, Educational Attainment, Educational Policy, Outcomes of Education
Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009
In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…
Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)
Lee, Young-Sun; Cohen, Allan; Toro, Maritsa – Asia Pacific Education Review, 2009
In this study, the effectiveness of detection of differential item functioning (DIF) and testlet DIF using SIBTEST and Poly-SIBTEST were examined in tests composed of testlets. An example using data from a reading comprehension test showed that results from SIBTEST and Poly-SIBTEST were not completely consistent in the detection of DIF and testlet…
Descriptors: Test Bias, Reading Comprehension, Simulation, Reading Tests
Popham, W. James – Educational Leadership, 2009
If a person were to ask an educator to identify the two most important attributes of an education test, the response most certainly would be "validity and reliability." These two tightly wedded concepts have become icons in the field of education assessment. As far as validity is concerned, the term doesn't refer to the accuracy of a test. Rather,…
Descriptors: Educational Testing, Educational Assessment, Student Evaluation, Test Reliability
Preuschoff, Anna Corinna – ProQuest LLC, 2011
As an extension of the effort devoted to updating the questionnaires for TIMSS and PIRLS 2011, this dissertation explored a new reporting strategy for contextual questionnaire data. The study investigated the feasibility of constructing "global indicators" from a large number of diverse background variables, which could provide policy…
Descriptors: Scaling, Learning Motivation, Questionnaires, Educational Environment
Walser, Nancy, Ed. – Harvard Education Press, 2011
"Harvard Education Letter" is published bimonthly at the Harvard Graduate School of Education. This issue of "Harvard Education Letter" contains the following articles: (1) Hybrid Schools for the iGeneration: New Schools Combine "Bricks" and "Clicks" (Brigid Schulte); (2) Dual Language Programs on the Rise: "Enrichment" Model Puts Content Learning…
Descriptors: Misconceptions, English (Second Language), Second Language Learning, Formative Evaluation
Carrell, Julia Louise – ProQuest LLC, 2011
Achievement goal theory is considered to be a well-researched field. However, this research has been primarily through surveys, and not enough attention has been paid to the cognitive aspects of how children perceive goals. Additionally, the mastery-avoidance construct is relatively new to the achievement goal literature, with little research to…
Descriptors: Objectives, Program Effectiveness, Academic Achievement, Mathematics Education
Watson, Robert Stephen – ProQuest LLC, 2010
This dissertation illuminates relationships between micro-level practices of schools and macro-level structures of society through the socio-historical lens of New York State Regents mathematics examinations, which were administered to public school students throughout the State of New York between 1866 and 2009, inclusive. Fundamental research…
Descriptors: Credentials, Educational History, Mathematics Tests, High Stakes Tests

Direct link
Peer reviewed
