Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 16 |
| Since 2007 (last 20 years) | 29 |
Descriptor
| Computer Assisted Testing | 51 |
| Test Reliability | 34 |
| Test Validity | 23 |
| Test Construction | 17 |
| Scoring | 15 |
| Reliability | 13 |
| Foreign Countries | 12 |
| Computer Software | 10 |
| Student Evaluation | 10 |
| Elementary Secondary Education | 8 |
| Scores | 8 |
| More ▼ | |
Source
Author
| Darling-Hammond, Linda | 2 |
| Asilkalkan, Abdullah | 1 |
| Attali, Yigal | 1 |
| Ault, Haley | 1 |
| Badger, Julia R. | 1 |
| Balkin, Richard S. | 1 |
| Barrio Minton, Casey | 1 |
| Bateson, Gordon | 1 |
| Beck, Klaus | 1 |
| Bonner, Cavan V. | 1 |
| Brick, J. Michael | 1 |
| More ▼ | |
Publication Type
| Reports - Descriptive | 51 |
| Journal Articles | 35 |
| Speeches/Meeting Papers | 8 |
| Reports - Evaluative | 2 |
| Book/Product Reviews | 1 |
| Information Analyses | 1 |
| Opinion Papers | 1 |
Education Level
Audience
| Researchers | 4 |
| Practitioners | 2 |
| Policymakers | 1 |
Location
| Australia | 4 |
| Connecticut | 2 |
| New Hampshire | 2 |
| New York | 2 |
| Rhode Island | 2 |
| United Kingdom (England) | 2 |
| Vermont | 2 |
| China | 1 |
| Europe | 1 |
| Germany | 1 |
| Hong Kong | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 2 |
| Elementary and Secondary… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022
This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…
Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing
Sonique Sailsman; Emma El-Shami – Quarterly Review of Distance Education, 2024
Nurse educators at the undergraduate level spend significant time developing and revising exam questions. Following the exam administration, course faculty have the opportunity to complete an item analysis and question revision to improve reliability and validity. A challenge faculty face is tracking these exam changes when teaching as part of a…
Descriptors: Nursing Education, Nursing Students, College Faculty, Test Construction
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Lenz, A. Stephen; Ault, Haley; Balkin, Richard S.; Barrio Minton, Casey; Erford, Bradley T.; Hays, Danica G.; Kim, Bryan S. K.; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
In April 2021, The Association for Assessment and Research in Counseling Executive Council commissioned a time-referenced task group to revise the Responsibilities of Users of Standardized Tests (RUST) Statement (3rd edition) published by the Association for Assessment in Counseling (AAC) in 2003. The task group developed a work plan to implement…
Descriptors: Responsibility, Standardized Tests, Counselor Training, Ethics
Maddox, Bryan – OECD Publishing, 2023
The digital transition in educational testing has introduced many new opportunities for technology to enhance large-scale assessments. These include the potential to collect and use log data on test-taker response processes routinely, and on a large scale. Process data has long been recognised as a valuable source of validation evidence in…
Descriptors: Measurement, Inferences, Test Reliability, Computer Assisted Testing
Heng Lu – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
The test view is on the Duolingo English Test (DET), an alternative online English proficiency test with a machine-driven characteristic. The review covers essential information of the DET such as test purpose, usage, score-mapping with CEFR scale, price, and publisher. Meanwhile, the test usefulness is discussed with focuses on reliability,…
Descriptors: Computer Software, Computer Assisted Instruction, Second Language Learning, Second Language Instruction
Bateson, Gordon – International Journal of Computer-Assisted Language Learning and Teaching, 2021
As a result of the Japanese Ministry of Education's recent edict that students' written and spoken English should be assessed in university entrance exams, there is an urgent need for tools to help teachers and students prepare for these exams. Although some commercial tools already exist, they are generally expensive and inflexible. To address…
Descriptors: Test Construction, Computer Assisted Testing, Internet, Writing Tests
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Shavelson, Richard J.; Zlatkin-Troitschanskaia, Olga; Beck, Klaus; Schmidt, Susanne; Marino, Julian P. – International Journal of Testing, 2019
Following employers' criticisms and recent societal developments, policymakers and educators have called for students to develop a range of generic skills such as critical thinking ("twenty-first century skills"). So far, such skills have typically been assessed by student self-reports or with multiple-choice tests. An alternative…
Descriptors: Critical Thinking, Cognitive Tests, Performance Based Assessment, Student Evaluation
Semsar, Katharine; Brownell, Sara; Couch, Brian A.; Crowe, Alison J.; Smith, Michelle K.; Summers, Mindi M.; Wright, Christian D.; Knight, Jennifer K. – Advances in Physiology Education, 2019
We describe the development of a new, freely available, online, programmatic-level assessment tool, Measuring Achievement and Progress in Science in Physiology, or Phys-MAPS (http://cperl.lassp.cornell.edu/bio-maps). Aligned with the conceptual frameworks of Core Principles of Physiology, and Vision and Change Core Concepts, Phys-MAPS can be used…
Descriptors: Physiology, Science Instruction, Science Tests, Computer Assisted Testing
Kaufman, Alan S. – Journal of Intelligence, 2021
U.S. Supreme Court justices and other federal judges are, effectively, appointed for life, with no built-in check on their cognitive functioning as they approach old age. There is about a century of research on aging and intelligence that shows the vulnerability of processing speed, fluid reasoning, visual-spatial processing, and working memory to…
Descriptors: Judges, Federal Government, Aging (Individuals), Decision Making
Badger, Julia R.; Mellanby, Jane – British Journal of Educational Psychology, 2018
Background: School attainment tests and Cognitive Abilities Tests are used in the United Kingdom to set targets for educational outcome. Whilst these are good predictors, they depend not only on basic ability but also on learnt knowledge and skills, such as reading. Method and Aims: VESPARCH is an online group test of verbal and spatial reasoning,…
Descriptors: Foreign Countries, Intelligence Tests, Verbal Ability, Spatial Ability
Magno, Carlo – UNESCO Bangkok, 2020
The COVID-19 pandemic has disrupted education across the globe leading countries to adapt how they administer and manage high-stakes examinations and large-scale learning assessments. This thematic review describes the measures that countries have taken, in terms of policies and practices, when learning assessments are disrupted by emergencies and…
Descriptors: High Stakes Tests, COVID-19, Pandemics, Cross Cultural Studies
Davis, Michelle R. – Education Week, 2013
Widespread technical failures and interruptions of recent online testing in a number of states have shaken the confidence of educators and policymakers in high-tech assessment methods and raised serious concerns about schools' technological readiness for the coming common-core online tests. The glitches arose as many districts in the 46 states…
Descriptors: Computer Assisted Testing, Testing Problems, Reliability, Public Schools

Peer reviewed
Direct link
