Publication Date
| In 2026 | 3 |
| Since 2025 | 472 |
| Since 2022 (last 5 years) | 2430 |
| Since 2017 (last 10 years) | 6610 |
| Since 2007 (last 20 years) | 18014 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 2140 |
| Teachers | 1218 |
| Researchers | 1054 |
| Administrators | 485 |
| Policymakers | 455 |
| Students | 176 |
| Parents | 147 |
| Counselors | 100 |
| Community | 61 |
| Media Staff | 17 |
| Support Staff | 15 |
| More ▼ | |
Location
| Canada | 784 |
| Australia | 690 |
| United States | 582 |
| California | 569 |
| United Kingdom | 479 |
| Texas | 413 |
| Florida | 403 |
| Germany | 392 |
| New York | 378 |
| United Kingdom (England) | 369 |
| China | 361 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 17 |
| Meets WWC Standards with or without Reservations | 22 |
| Does not meet standards | 21 |
Jiang, Yuhong V.; Swallow, Khena M. – Cognition, 2013
Visual attention prioritizes information presented at particular spatial locations. These locations can be defined in reference frames centered on the environment or on the viewer. This study investigates whether incidentally learned attention uses a viewer-centered or environment-centered reference frame. Participants conducted visual search on a…
Descriptors: Attention Deficit Disorders, Attention, Probability, Incidental Learning
Wolf, Raffaela – ProQuest LLC, 2013
Preservation of equity properties was examined using four equating methods--IRT True Score, IRT Observed Score, Frequency Estimation, and Chained Equipercentile--in a mixed-format test under a common-item nonequivalent groups (CINEG) design. Equating of mixed-format tests under a CINEG design can be influenced by factors such as attributes of the…
Descriptors: Testing, Item Response Theory, Equated Scores, Test Items
Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013
A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…
Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores
Reed, Deborah K.; Cummings, Kelli D.; Schaper, Andrew; Biancarosa, Gina – Review of Educational Research, 2014
Recent studies indicate that examiners make a number of intentional and unintentional errors when administering reading assessments to students. Because these errors introduce construct-irrelevant variance in scores, the fidelity of test administrations could influence the results of evaluation studies. To determine how assessment fidelity is…
Descriptors: Fidelity, Reading Tests, Student Evaluation, Reading Research
Sturgis, Chris – International Association for K-12 Online Learning, 2014
This paper is part of a series investigating the implementation of competency education. The purpose of the paper is to explore how districts and schools can redesign grading systems to best help students to excel in academics and to gain the skills that are needed to be successful in college, the community, and the workplace. In order to make the…
Descriptors: Grading, Competency Based Education, Evaluation Methods, Evaluation Research
Blazer, Christie – Research Services, Miami-Dade County Public Schools, 2012
This report provides a summary of states currently administering end-of-course (EOC) assessments. The number of EOC assessments administered by states, the subjects most likely to have an associated EOC exam, and the purpose of EOC exams (for example, percentage of the final course grade and/or graduation requirement) are reviewed. State policies…
Descriptors: Standardized Tests, Student Evaluation, High School Students, Educational Testing
Tempel, Melissa Bollow – Rethinking Schools, 2012
Computerized testing, including the widely used MAP test, has infiltrated the public schools in Milwaukee and across the nation, bringing with it a frightening future for public education. High-stakes standardized tests can be scored almost immediately via the internet, and testing companies can now easily link districts to their online data…
Descriptors: Testing, Standardized Tests, Public Schools, Public Education
Li, Ying; Jiao, Hong; Lissitz, Robert W. – Journal of Applied Testing Technology, 2012
This study investigated the application of multidimensional item response theory (IRT) models to validate test structure and dimensionality. Multiple content areas or domains within a single subject often exist in large-scale achievement tests. Such areas or domains may cause multidimensionality or local item dependence, which both violate the…
Descriptors: Achievement Tests, Science Tests, Item Response Theory, Measures (Individuals)
Sarich, Edward – Language Testing in Asia, 2012
Standardized testing is ubiquitous in Japan. Inexpensive and easily mass distributed, their use has been encouraged at every level of the education system. Over the past thirty years, external testing agencies have been increasingly relied upon to make standardized tests for use as benchmarks in the education system and in the private sector.…
Descriptors: Accountability, Testing, Standardized Tests, Foreign Countries
Mckee, Steve – Language Testing in Asia, 2012
This review of research concerning reading comprehension provides incites into what has been learned from 1995 to the present. Reading comprehension is defined as a complex activity that involves several variables. Reading strategies are discussed and how they relate to reading comprehension. Testing is another concern regarding how reading…
Descriptors: Reading Comprehension, Reading Strategies, Testing, Reading Tests
Lohndal, Terje – ProQuest LLC, 2012
This dissertation attempts to unify two reductionist hypotheses: that there is no relational difference between specifiers and complements, and that verbs do not have thematic arguments. I argue that these two hypotheses actually bear on each other and that we get a better theory if we pursue both of them. The thesis is centered around the…
Descriptors: Hypothesis Testing, Semantics, Syntax, Verbs
Charalambous, Charalambos Y.; Kyriakides, Leonidas; Philippou, George N. – Studies in Educational Evaluation, 2012
This paper illustrates the application of existing guidelines to develop a test grounded in theoretical perspectives and empirical findings in the area of problem solving. By documenting this process, the paper outlines the challenges test developers face when seeking to construct a theory/research-driven test, discusses the decisions made at…
Descriptors: Guidelines, Test Construction, Problem Solving, Testing
Pollitt, Alastair – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article is valuable in many ways, especially for clarifying confusions and inconsistencies in the assessment business. Most importantly, he points out confusions that persist and where open discussion will help us understand what we say and what we mean to say. But I will focus here on the only faults I find in the article: three…
Descriptors: Validity, Evaluation, Definitions, Test Construction
Borsboom, Denny – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton provides an insightful and scholarly overview of central issues in validity theory. As he notes, many of the conceptual problems in validity theory derive from the fact that the word "validity" has two meanings. First, it indicates "whether a test measures what it purports to measure." This is a factual claim about the psychometric…
Descriptors: Validity, Psychometrics, Test Interpretation, Scores
Braun, Henry – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton is to be commended for addressing as challenging a topic as the clarification of the concept of validity. The impetus for this foray is Newton's judgment that, despite decades of development, the definition and elaboration of the term test validity in the 1999 "Standards" retains sufficient ambiguity to permit, if not invite, both…
Descriptors: Educational Improvement, Test Validity, Validity, Tests

Peer reviewed
Direct link
