Publication Date
| In 2026 | 0 |
| Since 2025 | 48 |
| Since 2022 (last 5 years) | 210 |
| Since 2017 (last 10 years) | 491 |
| Since 2007 (last 20 years) | 983 |
Descriptor
| Test Validity | 3907 |
| Test Reliability | 1517 |
| Testing | 1089 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 615 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 493 |
| Higher Education | 489 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Josefina Sala-Roca; Aida Urrea-Monclús; Sara Rodríguez-Pérez – Electronic Journal of Research in Educational Psychology, 2025
Introduction: The Situational Judgment Test of Socioemotional Competence Development in Young People (DCSE-J) is a copyleft psychoeducational instrument for evaluating socioemotional competencies in adolescents. Different studies have analysed its psychometric validity, reliability and criterion validity. The present study aims to analyse…
Descriptors: Foreign Countries, Secondary School Students, College Freshmen, Social Emotional Learning
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Rohr-Mentele, Silja; Forster-Heinzer, Sarah – Empirical Research in Vocational Education and Training, 2021
Competence development and measurement are of great interest to vocational education and training (VET). Although there are many instruments available for measuring competence in diverse settings, in many cases, the completed steps of validation are neither documented nor made transparent in a comprehensible manner. Understanding what an…
Descriptors: Foreign Countries, Vocational Education, Test Validity, Competence
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Choi, Yun Deok – Language Testing in Asia, 2022
A much-debated question in the L2 assessment field is if computer familiarity should be considered a potential source of construct-irrelevant variance in computer-based writing (CBW) tests. This study aims to make a partial validity argument for an online source-based writing test (OSWT) designed for English placement testing (EPT), focusing on…
Descriptors: Test Validity, Scores, Computer Assisted Testing, English (Second Language)
Newton, Paul E. – Educational Measurement: Issues and Practice, 2020
Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…
Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction
Farzaneh Saadati; Macarena Larrain; Anton Bastian; Patricio Felmer; Gabriele Kaiser – Journal of Curriculum Studies, 2024
Improving the effectiveness of teacher professional development programmes is crucial for enhancing education, and assessing teacher professional competence is vital. This study aimed at adapting and validating instruments originally developed in Germany as part of a follow-up study to TEDS-M (Teacher Education Development Study-Mathematics),…
Descriptors: Test Validity, Mathematics Teachers, Teacher Competency Testing, Foreign Countries
Tim Gill – Research Matters, 2024
Core Maths qualifications were introduced into the post-16 curriculum in England in 2014 to help students develop their quantitative and problem-solving skills. Taking the qualification should also give students confidence in understanding the mathematical content in other courses taken at the same time. In this article, we explore whether Core…
Descriptors: Foreign Countries, Mathematics Skills, Mathematics Tests, Minimum Competency Testing
Koretz, Daniel – American Educator, 2018
In "The Testing Charade: Pretending to Make Schools Better", the author's new book from which this article is drawn, the failures of test-based accountability are documented and some of the most egregious misuses and outright abuses of testing are described, along with some of the most serious negative effects. Neither good intentions…
Descriptors: Accountability, Testing, Testing Problems, Test Validity
Barry, Carol L.; Jones, Andrew T.; Ibáñez, Beatriz; Grambau, Marni; Buyske, Jo – Educational Measurement: Issues and Practice, 2022
In response to the COVID-19 pandemic, the American Board of Surgery (ABS) shifted from in-person to remote administrations of the oral certifying exam (CE). Although the overall exam architecture remains the same, there are a number of differences in administration and staffing costs, exam content, security concerns, and the tools used to give the…
Descriptors: COVID-19, Pandemics, Computer Assisted Testing, Verbal Tests
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Collin Shepley; Anthony Setari; Amanda Leigh Duncan; Emily Webb – Assessment for Effective Intervention, 2025
Ongoing professional development is a critical component of high-quality early childhood education systems. To guide the content of such professional development, teacher and classroom quality assessments are often used. These assessments generally address universal or tier 1 instruction but omit information to guide teachers' practices to support…
Descriptors: Test Validity, Computer Assisted Testing, Teacher Evaluation, Performance Based Assessment
Che Lah, Noor Hidayah; Tasir, Zaidatun; Jumaat, Nurul Farhana – Educational Studies, 2023
The aim of the study was to evaluate the extended version of the Problem-Solving Inventory (PSI) via an online learning setting known as the Online Problem-Solving Inventory (OPSI) through the lens of Rasch Model analysis. To date, there is no extended version of the PSI for online settings even though many researchers have used it; thus, this…
Descriptors: Problem Solving, Measures (Individuals), Electronic Learning, Item Response Theory

Peer reviewed
Direct link
