Publication Date
| In 2026 | 0 |
| Since 2025 | 48 |
| Since 2022 (last 5 years) | 210 |
| Since 2017 (last 10 years) | 491 |
| Since 2007 (last 20 years) | 983 |
Descriptor
| Test Validity | 3907 |
| Test Reliability | 1517 |
| Testing | 1089 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 615 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 493 |
| Higher Education | 489 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…
Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics
Daniella Winter; Yoram Braw – Journal of Attention Disorders, 2022
Background: The current study aimed to validate the utility of previously established validity indicators derived from MOXO-d-CPT's continuous performance test. Method: Healthy simulators feigned impairment after searching online for relevant information, an ecologically valid coaching condition (n = 39). They were compared to ADHD patients (n =…
Descriptors: Foreign Countries, Undergraduate Students, Attention Deficit Hyperactivity Disorder, Computer Simulation
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020
Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…
Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards
Davis, Marcia H.; Wang, Wenhao; Kingston, Neal M.; Hock, Michael; Tonks, Stephen M.; Tiemann, Gail – Grantee Submission, 2020
Background: The importance of reading motivation has led to the development of a large number of self-report reading motivation measures; however, there is still a need for a usable measure of adolescent reading motivation that captures a large number of theoretically and empirically distinct constructs. Methods: The current paper details the…
Descriptors: Reading Motivation, Computer Assisted Testing, Adaptive Testing, Measures (Individuals)
Burge, Bethan; Benson, Louise – National Foundation for Educational Research, 2021
Since its introduction in 2017, NFER has been contracted by Ofqual to develop, deliver and analyse the results of the National Reference Test (NRT) in English and maths. The NRT is administered annually and shows if student performance in English and maths at GCSE level has changed from year to year. The NRT results are based on analysis of data…
Descriptors: National Competency Tests, Test Results, English, Mathematics Tests
Hartmann, Stefan; Güzel, Emre; Gschwendtner, Tobias – Empirical Research in Vocational Education and Training, 2023
We investigated the ecological validity of performance measures from a computer-based assessment tool that utilises scripted video vignettes. The intended purpose of this tool is to assess the maintenance and repair skills of automotive technician apprentices, complementing traditional hands-on assessment formats from the German journeymen's…
Descriptors: Performance Based Assessment, Computer Assisted Testing, Auto Mechanics, Job Skills
Qian, Meihua; Wang, Xianyong – Journal of Creative Behavior, 2020
Creativity has been well studied in the past several decades, and numerous measures have been developed to assess creativity. However, validity evidence associated with each measure is often mixed. In particular, the social consequence aspect of validity has received little attention. This is partly due to the difficulty of testing for…
Descriptors: Item Response Theory, Testing, Creativity Tests, Creative Thinking
Delnoij, Laurie E. C.; Janssen, José P. W; Dirkx, Kim J. H.; Martens, Rob L. – International Journal of Assessment Tools in Education, 2022
Informed study decisions are pivotal for student retention in higher online education. A self-assessment prior to enrolment has been proposed as a promising approach to enable informed decision-making and to build resources for retention. To determine whether such a self-assessment affects the decision-making process as intended, thorough and…
Descriptors: Computer Assisted Testing, Self Evaluation (Individuals), Decision Making, Test Validity
Divayana, Dewa Gede Hendra; Sudirtha, I. Gede; Suartama, I. Kadek – International Journal of Instruction, 2021
This research had the main objective to develop a new form/design in the development of test instruments that valid and reliable. The form of the test instrument that was developed adopted the Superitem concept and was integrated into software called Wondershare. Through the integration, digital format test instruments were realized with the…
Descriptors: Computer Assisted Testing, Distance Education, Assessment Literacy, Test Construction
Yoshioka, Sérgio R. I.; Ishitani, Lucila – Informatics in Education, 2018
Computerized Adaptive Testing (CAT) is now widely used. However, inserting new items into the question bank of a CAT requires a great effort that makes impractical the wide application of CAT in classroom teaching. One solution would be to use the tacit knowledge of the teachers or experts for a pre-classification and calibrate during the…
Descriptors: Student Motivation, Adaptive Testing, Computer Assisted Testing, Item Response Theory
Anne-Mai Meesak; Dmitri Rozgonjuk; Tiia Õun; Eve Kikas – Education 3-13, 2024
Children's development during early childhood affects their well-being and educational success, but there are few reliable assessment instruments available. The aim of the study was to develop, pilot and validate an e-assessment instrument for assessing five-year-old children's development in cognitive processes, learning, language and…
Descriptors: Test Validity, Computer Assisted Testing, Measures (Individuals), Child Development
Stefan O'Grady – International Journal of Listening, 2025
Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…
Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests
Jung Youn, Soo – Language Testing, 2023
As access to smartphones and emerging technologies has become ubiquitous in our daily lives and in language learning, technology-mediated social interaction has become common in teaching and assessing L2 speaking. The changing ecology of L2 spoken interaction provides language educators and testers with opportunities for renewed test design and…
Descriptors: Test Construction, Test Validity, Second Language Learning, Telecommunications
Istiyono, Edi; Dwandaru, Wipsar Sunu Brams; Lede, Yulita Adelfin; Rahayu, Farida; Nadapdap, Amipa – International Journal of Instruction, 2019
The objective of this study was to develop Physics critical thinking skill test using computerized adaptive test (CAT) based on item response theory (IRT). This research was a development research using 4-D (define, design, develop, and disseminate). The content validity of the items was proven using Aiken's V. The test trial involved 252 students…
Descriptors: Critical Thinking, Thinking Skills, Cognitive Tests, Physics

Peer reviewed
Direct link
