Publication Date
| In 2026 | 8 |
| Since 2025 | 2276 |
| Since 2022 (last 5 years) | 12791 |
| Since 2017 (last 10 years) | 33916 |
| Since 2007 (last 20 years) | 68407 |
Descriptor
| Foreign Countries | 30560 |
| Test Validity | 21743 |
| Scores | 18256 |
| Academic Achievement | 16928 |
| Test Construction | 16756 |
| Test Reliability | 15028 |
| Achievement Tests | 14859 |
| Standardized Tests | 14720 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13042 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3393 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 978 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2822 |
| Australia | 2426 |
| Canada | 2270 |
| California | 1854 |
| United States | 1726 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1122 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Beseiso, Majdi; Alzubi, Omar A.; Rashaideh, Hasan – Journal of Computing in Higher Education, 2021
E-learning is gradually gaining prominence in higher education, with universities enlarging provision and more students getting enrolled. The effectiveness of automated essay scoring (AES) is thus holding a strong appeal to universities for managing an increasing learning interest and reducing costs associated with human raters. The growth in…
Descriptors: Automation, Scoring, Essays, Writing Tests
Currin, Elizabeth; Schroeder, Stephanie; McCardle, Todd – Teachers College Record, 2021
Background/Context: Opting out of high-stakes standardized tests, a phenomenon so widespread in the United States as to be regarded as a movement, is nevertheless a misunderstood and often maligned force in educational politics. Purpose: This article offers a counter-narrative of opt-out activism--a more thorough and vivid account of what we view…
Descriptors: Activism, High Stakes Tests, Standardized Tests, Politics of Education
Volfson, Alexander; Eshach, Haim; Ben-Abu, Yuval – Physical Review Physics Education Research, 2021
Science knowledge is reflected in mental models that students tend to form when dealing with science phenomena. One way to identify students' mental models about scientific concepts is the use of diagnostic tests (inventories). Even though several statistical approaches and tools intended for the analysis of such inventories' results exist in the…
Descriptors: Schemata (Cognition), Diagnostic Tests, Scientific Concepts, Multiple Choice Tests
Vo, Thao T.; Ullrich-French, Sarah; French, Brian F. – Journal of Psychoeducational Assessment, 2021
The Academic Intrinsic Motivation Scale (AIMS) measures key components of student intrinsic motivation (IM). We investigate score validity and reliability of a downward extension of the AIMS developed for students in the high school context using a sample of students from the Pacific Northwest region of the United States. Through classical test…
Descriptors: Psychometrics, Student Motivation, High School Students, Learning Motivation
Yu, Qiaona – Applied Linguistics, 2021
Language complexity reveals the ability to use a wide and varied range of sophisticated structures and vocabulary. Although different languages compose complexity differently, complexity measures such as the T-unit have typically been based on clause subordination, which may underrepresent complexity and threaten the validity of studies. This…
Descriptors: Chinese, Difficulty Level, Syntax, Language Proficiency
Quinn, David M.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021
The estimation of test score "gaps" and gap trends plays an important role in monitoring educational inequality. Researchers decompose gaps and gap changes into within- and between-school portions to generate evidence on the role schools play in shaping these inequalities. However, existing decomposition methods assume an equal-interval…
Descriptors: Scores, Tests, Achievement Gap, Equal Education
Thirakunkovit, Suthathip; Rhee, Seongha – THAITESOL Journal, 2021
This study explores the extent to which the difficulty levels of grammar items in an English test can be predicted by the complexity of grammatical structures. The researchers carried out two sets of analyses. In the first analysis, the item facility and item discrimination indices of 175 multiple-choice items were examined. In the second…
Descriptors: Grammar, Test Items, Difficulty Level, English (Second Language)
Feinberg, Richard; Jurich, Daniel; Wise, Steven L. – Applied Measurement in Education, 2021
Previous research on rapid responding tends to implicitly consider examinees as either engaging in solution behavior or purely guessing. However, particularly in a high-stakes testing context, examinees perceiving that they are running out of time may consider the remaining items for less time than necessary to provide a fully informed response,…
Descriptors: High Stakes Tests, Reaction Time, Response Style (Tests), Licensing Examinations (Professions)
Xiao, Yue; He, Qiwei; Veldkamp, Bernard; Liu, Hongyun – Journal of Computer Assisted Learning, 2021
The response process of problem-solving items contains rich information about respondents' behaviours and cognitive process in the digital tasks, while the information extraction is a big challenge. The aim of the study is to use a data-driven approach to explore the latent states and state transitions underlying problem-solving process to reflect…
Descriptors: Problem Solving, Competence, Markov Processes, Test Wiseness
Barnes, Amy C. – New Directions for Student Leadership, 2021
This article explores the ethical use of assessments in leadership training, education, and development. From the importance of having well-trained facilitators to the consideration of power and social identity in the interpretation of individual results, this article advocates for approaching the use of leadership assessments and inventories with…
Descriptors: Leadership, Measures (Individuals), Ethics, Test Use
Tabuena, Almighty C.; Morales, Glinore S. – Online Submission, 2021
This study identified and annotated appropriate test items using the multiple-choice test item format in the cognitive domain of the taxonomy of educational objectives in assessing and evaluating musical learning through the descriptive-developmental research design. This assessment approach is one of the key skills needed of Music teachers to…
Descriptors: Multiple Choice Tests, Test Items, Cognitive Objectives, Taxonomy
Becker, Stephen P. – Grantee Submission, 2021
Objective: To conduct a systematic review of the measures designed to assess sluggish cognitive tempo (SCT) since the first SCT scale using careful test-construction procedures was published in 2009. Methods: The MEDLINE (PubMed), Embase, PsychINFO, and Web of Science databases were searched from September 2009 through December 2019. Articles…
Descriptors: Conceptual Tempo, Test Reliability, Test Validity, Attention Deficit Hyperactivity Disorder
Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Measurement: Issues and Practice, 2019
The current study investigated how item formats and their inherent affordances influence test-takers' cognition under uncertainty. Adult participants solved content-equivalent math items in multiple-selection multiple-choice and four alternative grid formats. The results indicated that participants' affirmative response tendency (i.e., judge the…
Descriptors: Affordances, Test Items, Test Format, Test Wiseness
Aryadoust, Vahid; Foo, Stacy; Ng, Li Ying – Language Testing, 2022
The aim of this study was to investigate how test methods affect listening test takers' performance and cognitive load. Test methods were defined and operationalized as while-listening performance (WLP) and post-listening performance (PLP) formats. To achieve the goal of the study, we examined test takers' (N = 80) brain activity patterns…
Descriptors: Listening Comprehension Tests, Language Tests, Eye Movements, Brain Hemisphere Functions
Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022
The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…
Descriptors: Specialists, Language Tests, Test Validity, College Faculty

Peer reviewed
Direct link
