Publication Date
| In 2026 | 0 |
| Since 2025 | 48 |
| Since 2022 (last 5 years) | 210 |
| Since 2017 (last 10 years) | 491 |
| Since 2007 (last 20 years) | 983 |
Descriptor
| Test Validity | 3907 |
| Test Reliability | 1517 |
| Testing | 1089 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 615 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 493 |
| Higher Education | 489 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023
Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…
Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity
Guzman-Orth, Danielle; Steinberg, Jonathan; Albee, Traci – Language Testing, 2023
Standardizing accessible test design and development to meet students' individual access needs is a complex task. The following study provides one approach to accessible test design and development using participatory design methods with school community members. Participatory research provides opportunities to empower collaborators by co-creating…
Descriptors: English Language Learners, Blindness, Visual Impairments, Testing Accommodations
Van Norman, Ethan R.; Forcht, Emily R. – Assessment for Effective Intervention, 2023
This study explored the validity of growth on two computer adaptive tests, Star Reading and Star Math, in explaining performance on an end-of-year achievement test for a sample of students in Grades 3 through 6. Results from quantile regression analyses indicate that growth on Star Reading explained a statistically significant amount of variance…
Descriptors: Test Validity, Computer Assisted Testing, Adaptive Testing, Grade Prediction
New York State Education Department, 2022
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Paper-Based Field Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
Computerized Adaptive Assessment of Understanding of Programming Concepts in Primary School Children
Hogenboom, Sally A. M.; Hermans, Felienne F. J.; Van der Maas, Han L. J. – Computer Science Education, 2022
Background and Context: Valid assessment of understanding of programming concepts in primary school children is essential to implement and improve programming education. Objective: We developed and validated the Computerized Adaptive Programming Concepts Test (CAPCT) with a novel application of Item Response Theory. The CAPCT is a web-based and…
Descriptors: Computer Assisted Testing, Adaptive Testing, Programming, Knowledge Level
Ramsey Lee Cardwell – ProQuest LLC, 2022
The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…
Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing
Mohd Norlizam Mohd Razali; Aida Hanim A. Hamid; Bity Salwana Alias; Azlin Norhaini Mansor – Journal of Education and Learning (EduLearn), 2025
A teacher competency instrument was developed to determine the level of teacher competency in small schools in Peninsular Malaysia. This study was conducted in Perak and Negeri Sembilan to determine the instrument's reliability and validity. Exploratory factor analysis (EFA) and item reliability analysis were used to determine the questionnaire's…
Descriptors: Foreign Countries, Elementary Secondary Education, Small Schools, Rural Schools
Yasuda, Jun-ichiro; Hull, Michael M.; Mae, Naohiro – Physical Review Physics Education Research, 2022
This paper presents improvements made to a computerized adaptive testing (CAT)-based version of the FCI (FCI-CAT) in regards to test security and test efficiency. First, we will discuss measures to enhance test security by controlling for item overexposure, decreasing the risk that respondents may (i) memorize the content of a pretest for use on…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Risk Management
Collin Shepley; Amanda Leigh Duncan; Anthony P. Setari – Journal of Early Intervention, 2025
The provision of progress monitoring within publicly funded early childhood classrooms is legally required, supported by empirical research, and recommended by early childhood professional organizations, for teachers providing Part B services under the Individuals with Disabilities Education Act. Despite the widespread recognition of progress…
Descriptors: Progress Monitoring, Measures (Individuals), Test Construction, Test Validity
Sukru Murat Cebeci; Selcuk Acar – Journal of Creative Behavior, 2025
This study presents the Cebeci Test of Creativity (CTC), a novel computerized assessment tool designed to address the limitations of traditional open-ended paper-and-pencil creativity tests. The CTC is designed to overcome the challenges associated with the administration and manual scoring of traditional paper and pencil creativity tests. In this…
Descriptors: Creativity, Creativity Tests, Test Construction, Test Validity
Andreea Dutulescu; Stefan Ruseti; Denis Iorga; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2025
Automated multiple-choice question (MCQ) generation is valuable for scalable assessment and enhanced learning experiences. How-ever, existing MCQ generation methods face challenges in ensuring plausible distractors and maintaining answer consistency. This paper intro-duces a method for MCQ generation that integrates reasoning-based explanations…
Descriptors: Automation, Computer Assisted Testing, Multiple Choice Tests, Natural Language Processing
New York State Education Department, 2022
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Field Tests, and the Elementary-level (Grade 5) and Intermediate-level (Grade 8) Science Field Tests. School administrators must be thoroughly familiar with the…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
James Soland – Journal of Research on Educational Effectiveness, 2024
When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…
Descriptors: Item Response Theory, Testing, Test Validity, Intervention
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Goodwin, Amanda P.; Petscher, Yaacov; Tock, Jamie; McFadden, Sara; Reynolds, Dan; Lantos, Tess; Jones, Sara – Assessment for Effective Intervention, 2022
Assessment of language skills for upper elementary and middle schoolers is important due to the strong link between language and reading comprehension. Yet, currently few practical, reliable, valid, and instructionally informative assessments of language exist. This study provides validation evidence for Monster, P.I., which is a gamified,…
Descriptors: Adaptive Testing, Computer Assisted Testing, Language Tests, Vocabulary

Peer reviewed
Direct link
