Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Lee, Yi-Hsuan; Ip, Edward H.; Fuh, Cheng-Der – Educational and Psychological Measurement, 2008
Although computerized adaptive tests have enjoyed tremendous growth, solutions for important problems remain unavailable. One problem is the control of item exposure rate. Because adaptive algorithms are designed to select optimal items, they choose items with high discriminating power. Thus, these items are selected more often than others,…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Test Validity
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Eklof, Hanna – Assessment in Education: Principles, Policy & Practice, 2010
An achievement test score can be viewed as a joint function of skill and will, of knowledge and motivation. However, when interpreting and using test scores, the "will" part is not always acknowledged and scores are mostly interpreted and used as pure measures of student knowledge. This paper argues that students' motivation to do their…
Descriptors: Foreign Countries, Achievement Tests, Scores, Test Wiseness
Polzer, Katherine – Journal of Offender Rehabilitation, 2010
Drug courts are reinventing the drug testing framework by experimenting with new methods, including use of the sweat patch. The sweat patch is a band-aid like strip used to monitor drug court participants. The validity and reliability of the sweat patch as an effective testing method was examined, as well as the effectiveness, meaning how likely…
Descriptors: Courts, Drug Use, Program Effectiveness, Drug Use Testing
Merrigan, Teresa E. – ProQuest LLC, 2012
The purpose of the current study was to evaluate the psychometric properties of alternative approaches to administering and scoring curriculum-based measurement for written expression. Specifically, three response durations (3, 5, and 7 minutes) and six score types (total words written, words spelled correctly, percent of words spelled correctly,…
Descriptors: Curriculum Based Assessment, Testing, Scoring, Writing Tests
Hasson, Natalie; Dodd, Barbara; Botting, Nicola – International Journal of Language & Communication Disorders, 2012
Background: Sentence construction and syntactic organization are known to be poor in children with specific language impairments (SLI), but little is known about the way in which children with SLI approach language tasks, and static standardized tests contribute little to the differentiation of skills within the population of children with…
Descriptors: Alternative Assessment, Sentence Structure, Syntax, Language Processing
Goldwater, Paul M.; Fogarty, Timothy J. – Behaviour & Information Technology, 2012
As accounting education transitions to more distance-learning formats, the integrity of student evaluation continues to serve as an obstacle to adoption. Greater technological possibilities will be opposed if faculty members believe that testing is compromised. This article investigates whether students taking exams remotely (and under no…
Descriptors: Student Evaluation, Accounting, Testing, Distance Education
Stone, Elizabeth; Cook, Linda – Educational Testing Service, 2009
Research studies have shown that a smaller percentage of students with learning disabilities participate in state assessments than do their peers without learning disabilities. Furthermore, there is almost always a performance gap between these groups of students on these assessments. It is important to evaluate whether a performance gap on a…
Descriptors: Learning Disabilities, State Standards, Educational Testing, Science Tests
Goldstein, Jessica; Behuniak, Peter – Assessment for Effective Intervention, 2011
State-level testing programs continue to grow, and the challenge of validation does not wane. Although more than a decade has passed since the 1999 Joint Standards for Educational and Psychological Testing set out a call for the organization of validity evidence into validity arguments, practical examples of such arguments are not readily…
Descriptors: Testing Programs, State Programs, Alternative Assessment, Test Validity
Keiser, Ashley; Reddy, Linda – Journal of Applied School Psychology, 2013
The Pediatric Attention Disorders Diagnostic Screener is a multidimensional, computerized screening tool designed to assess attention and global aspects of executive functioning in children at risk for attention disorders. The screener consists of a semi-structured diagnostic interview, brief parent and teacher rating scales, 3 computer-based…
Descriptors: Screening Tests, Computer Assisted Testing, Children, At Risk Persons
Liu, Kristin K.; Goldstone, Linda; Thurlow, Martha L.; Ward, Jenna; Hatten, James; Christensen, Laurene L. – National Center on Educational Outcomes, 2013
English language learners (ELLs) with disabilities are an increasing presence in schools in the United States. Title I and Title III of the Elementary and Secondary Education Act require that these students meet the same academic grade-level standards and participate in content assessments as their fluent-English speaking peers without…
Descriptors: English Language Learners, Disabilities, State Standards, Standardized Tests
Zhao, Zhongbao – RELC Journal: A Journal of Language Teaching and Research, 2013
This study investigates the validity of the Diagnostic College English Speaking Test (DCEST) in the context of EFL teaching and learning in China. The experiment was conducted in three stages over the course of eight weeks at a national key university in China. By means of test administration and questionnaire survey, the researcher gathered…
Descriptors: Oral Language, Construct Validity, Language Tests, Diagnostic Tests
Sparks, Sarah D. – Education Week, 2011
As Congress debates how to structure the next iteration of federal school accountability, a new national study has raised serious concerns about the effectiveness of test-based incentives to improve education. A blue-ribbon committee of the National Academies' National Research Council undertook a nearly decade-long study of test-based incentive…
Descriptors: Federal Legislation, Incentives, Educational Improvement, Federal Programs
Cheng, Liying; DeLuca, Christopher – Educational Assessment, 2011
Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…
Descriptors: Language Tests, Test Validity, Test Use, English
Wiliam, Dylan – Educational Psychologist, 2010
This article explores the use of standardized tests to hold schools accountable. The history of testing for accountability is reviewed, and it is shown that currently between-school differences account for less than 10% of the variance in student scores, in part because the progress of individuals is small compared to the spread of achievement…
Descriptors: Testing, Standardized Tests, Accountability, Inferences

Peer reviewed
Direct link
