Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Heather May Villalobos Pavia – ProQuest LLC, 2021
The English learner (EL) student population has grown steadily for the past 20 years. During this time, the use of standardized assessments has increased as well. Teacher understanding of assessment accommodations that best support ELs is low, despite the research that shows the unreliability of standardized achievement tests that measure the…
Descriptors: Standardized Tests, Testing Accommodations, Testing Problems, English Language Learners
Brittany N. Zakszeski; Heather E. Ormiston; Malena A. Nygaard; Kane Carlock – School Psychology Review, 2025
Despite the widespread use of school-based universal screening systems for social, emotional, and behavioral risk, limited research has examined discrepancies in ratings provided by teachers and their secondary students. Using the Social, Academic, and Emotional Behavior Risk Screener (SAEBRS; teacher report) and mySAEBRS (student report) scores…
Descriptors: Middle School Students, Middle School Teachers, Screening Tests, Affective Behavior
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Lucía Torres-Sales; Begoña Vigo-Arrazola – Policy Futures in Education, 2025
This article critically analyzes the representation of standardised assessments in Spanish educational policies, particularly in the current Spanish education law (LOMLOE), in reports on the state of the Spanish educational system (2020-2023), and through the voices of teachers working in schools with special difficulties and their implications…
Descriptors: Foreign Countries, Educational Legislation, Standardized Tests, Educational Policy
Kalemdaroglu-Wheeler, Elif – ProQuest LLC, 2023
The purpose of this qualitative exploratory case study was to explore teachers' and administrators' perceptions of test score pollution deriving from COVID-19-related issues that may affect students' test scores on state-mandated standardized tests for grades six through 12 in a state along the Atlantic Coast of the United States. Four research…
Descriptors: Testing Problems, Scores, COVID-19, Pandemics
Laird, Robert D. – Developmental Psychology, 2020
Researchers are often inclined to test agreement or discrepancy hypotheses using difference scores. This commentary explains 2 mathematical-statistical principles underlying associations with difference scores and 2 conceptual-interpretation problems that make difference scores inappropriate for testing such hypotheses. The commentary provides…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Sánchez Sánchez, Ernesto; García Rios, Víctor N.; Silvestre Castro, Eleazar; Licea, Guadalupe Carrasco – North American Chapter of the International Group for the Psychology of Mathematics Education, 2020
In this paper, we address the following questions: What misconceptions do high school students exhibit in their first encounter with significance test problems through a repeated sampling approach? Which theory or framework could explain the presence and features of such patterns? With brief prior instruction on the use of Fathom software to…
Descriptors: High School Students, Misconceptions, Statistical Significance, Testing
Chun Wang; Ping Chen; Shengyu Jiang – Journal of Educational Measurement, 2020
Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence, questions remain as to how to…
Descriptors: Test Construction, Test Items, Adaptive Testing, Maximum Likelihood Statistics
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022
Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…
Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items
Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022
Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…
Descriptors: Ability, Tests, Equated Scores, Testing Problems
Ruth Nelson; Kristen Nichols-Besel; Sarah Tahtinen-Pacheco – Journal of College Reading and Learning, 2024
The number of immigrant and international multilingual learners enrolling in postsecondary education is on the rise. With this growth, there remain difficulties in identifying and supporting multilingual learners moving from K-12 to college due to demographic data collection procedures at the postsecondary level. Postsecondary institutions are…
Descriptors: Multilingualism, Bilingual Students, College Students, Urban Universities
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
It has been argued in the literature on (language) testing that any act of testing/assessment can impact: (1) educators' curriculum design; (2) teachers' teaching practices; and (3) students' learning behaviors. This quality of any given testing situation or act of assessment has been called washback, or backwash if you will. Washback falls into…
Descriptors: Testing Problems, Language Tests, Second Language Learning, Second Language Instruction
Hastedt, Dirk; Rocher, Thierry – International Association for the Evaluation of Educational Achievement, 2020
International large-scale assessments (ILSAs) are one of the most important tools policymakers and other educational stakeholders have to inform evidence based decision making for educational reform. Despite this, and the widespread use of ILSA data, results are sometimes misunderstood or misinterpreted. Here, we offer a brief guide to ILSAs and…
Descriptors: International Assessment, Educational Assessment, Educational Change, Achievement Tests
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019
For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…
Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries

Direct link
Peer reviewed
