Publication Date
| In 2026 | 0 |
| Since 2025 | 451 |
| Since 2022 (last 5 years) | 2409 |
| Since 2017 (last 10 years) | 6589 |
| Since 2007 (last 20 years) | 17993 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 2140 |
| Teachers | 1216 |
| Researchers | 1054 |
| Administrators | 483 |
| Policymakers | 453 |
| Students | 176 |
| Parents | 147 |
| Counselors | 100 |
| Community | 61 |
| Media Staff | 17 |
| Support Staff | 15 |
| More ▼ | |
Location
| Canada | 784 |
| Australia | 690 |
| United States | 582 |
| California | 569 |
| United Kingdom | 479 |
| Texas | 413 |
| Florida | 403 |
| Germany | 391 |
| New York | 378 |
| United Kingdom (England) | 369 |
| China | 361 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 17 |
| Meets WWC Standards with or without Reservations | 22 |
| Does not meet standards | 21 |
Michael Bass; Scott Morris; Sheng Zhang – Measurement: Interdisciplinary Research and Perspectives, 2025
Administration of patient-reported outcome measures (PROs), using multidimensional computer adaptive tests (MCATs) has the potential to reduce patient burden, but the efficiency of MCAT depends on the degree to which an individual's responses fit the psychometric properties of the assessment. Assessing patients' symptom burden through the…
Descriptors: Adaptive Testing, Computer Assisted Testing, Patients, Outcome Measures
Nathaniel Owen; Ananda Senel – Review of Education, 2025
Transparency in high-stakes English language assessment has become crucial for ensuring fairness and maintaining assessment validity in language testing. However, our understanding of how transparency is conceptualised and implemented remains fragmented, particularly in relation to stakeholder experiences and technological innovations. This study…
Descriptors: Accountability, High Stakes Tests, Language Tests, Computer Assisted Testing
W. James Popham – Pearson, 2024
"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…
Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction
Hwanggyu Lim; Kyung T. Han – Educational Measurement: Issues and Practice, 2024
Computerized adaptive testing (CAT) has gained deserved popularity in the administration of educational and professional assessments, but continues to face test security challenges. To ensure sustained quality assurance and testing integrity, it is imperative to establish and maintain multiple stable item pools that are consistent in terms of…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks
Samira Syal; Marcia Davis; Xiaodong Zhang; Jason Schoeneberger; Samantha Spinney; Douglas J. Mac Iver; Martha Mac Iver – Reading Psychology, 2024
Motivation to read is crucial to improving reading skill. While there is extensive research examining reading motivation among elementary students, with respect to adolescents, research is limited. Employing a person-centered approach can aid in developing a better understanding of adolescent reading motivation and would help address possible…
Descriptors: Reading Motivation, Adolescents, Reading Achievement, High School Students
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Pan, Yiqin; Livne, Oren; Wollack, James A.; Sinharay, Sandip – Educational Measurement: Issues and Practice, 2023
In computerized adaptive testing, overexposure of items in the bank is a serious problem and might result in item compromise. We develop an item selection algorithm that utilizes the entire bank well and reduces the overexposure of items. The algorithm is based on collaborative filtering and selects an item in two stages. In the first stage, a set…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms
Rayne Bozeman; Robyn K. Mallett; Linas Mitchell; R. Scott Tindale – Active Learning in Higher Education, 2024
Two-phase testing assesses individual performance (phase 1) and then allows collaborative learning within small groups (phase 2). While groups typically outperform individuals, less is known about the social decision schemes that influence member collaboration. In a classroom setting, we compared individual and group performance on a standard test…
Descriptors: Testing, Group Testing, Cooperative Learning, Learning Experience
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Kirsten Lambert; Alison L. Hilton – Journal of Education Policy, 2025
This paper offers a brief yet evocative glimpse into marginalised pre-service teachers' (PST) experiences of teacher testing in Australia's High-stakes Literacy and Numeracy Test for Initial Teacher Education Students (LANTITE). Utilising Critical Disability Theory (CDT)in particular, Goodley's (2016) concept of neoliberal-ableism, we problematise…
Descriptors: Preservice Teachers, Foreign Countries, High Stakes Tests, Teacher Competency Testing
Wesley Maciejewski – Canadian Journal of Science, Mathematics and Technology Education, 2025
Mathematics is a rich and beautiful playground of ideas, the apex of human creativity and ingenuity, applicable to all the world around us: humankind's best effort at understanding and describing the universe. Mathematics examinations aren't this. They're uninspiring and trauma-inducing, with an emphasis on routine calculations and easy-to-grade…
Descriptors: Mathematics Tests, Testing, Evaluation Methods, Educational Quality
Li, Dongmei – Journal of Educational Measurement, 2022
Equating error is usually small relative to the magnitude of measurement error, but it could be one of the major sources of error contributing to mean scores of large groups in educational measurement, such as the year-to-year state mean score fluctuations. Though testing programs may routinely calculate the standard error of equating (SEE), the…
Descriptors: Error Patterns, Educational Testing, Group Testing, Statistical Analysis
ETS Research Institute, 2024
ETS experts are exploring and defining the standards for responsible AI use in assessments. A comprehensive framework and principles will be unveiled in the coming months. In the meantime, this document outlines the critical areas these standards will encompass, including the principles of: (1) Fairness and bias mitigation; (2) Privacy and…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Educational Testing, Ethics
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024
The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…
Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance

Peer reviewed
Direct link
