Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 10 |
Descriptor
Computer Assisted Testing | 14 |
Test Validity | 14 |
Test Construction | 7 |
Psychometrics | 4 |
Test Items | 4 |
Mathematics Tests | 3 |
Scoring | 3 |
Achievement Tests | 2 |
Adults | 2 |
Automation | 2 |
Decision Making | 2 |
More ▼ |
Source
Educational Measurement:… | 14 |
Author
Wise, Steven L. | 2 |
Abedi, Jamal | 1 |
Agrimson, Jared | 1 |
Arslan, Burcu | 1 |
Barry, Carol L. | 1 |
Bhola, Dennison S. | 1 |
Boyer, Michelle | 1 |
Burkhardt, Amy | 1 |
Buyske, Jo | 1 |
Gierl, Mark J. | 1 |
Gong, Tao | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Research | 6 |
Reports - Descriptive | 4 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Information Analyses | 1 |
Education Level
Secondary Education | 3 |
Junior High Schools | 2 |
Middle Schools | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Researchers | 1 |
Location
Texas | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Barry, Carol L.; Jones, Andrew T.; Ibáñez, Beatriz; Grambau, Marni; Buyske, Jo – Educational Measurement: Issues and Practice, 2022
In response to the COVID-19 pandemic, the American Board of Surgery (ABS) shifted from in-person to remote administrations of the oral certifying exam (CE). Although the overall exam architecture remains the same, there are a number of differences in administration and staffing costs, exam content, security concerns, and the tools used to give the…
Descriptors: COVID-19, Pandemics, Computer Assisted Testing, Verbal Tests
Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…
Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment
Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020
Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…
Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making
Schneider, M. Christina; Agrimson, Jared; Veazey, Mary – Educational Measurement: Issues and Practice, 2022
This paper presents results of a score interpretation study for a computer adaptive mathematics assessment. The study purpose was to test the efficacy of item developers' alignment of items to Range Achievement-Level Descriptors (RALDs; Egan et al.) against the empirical achievement-level alignment of items to investigate the use of RALDs as the…
Descriptors: Computer Assisted Testing, Mathematics Tests, Scores, Grade 3
Yamamoto, Kentaro; Shin, Hyo Jeong; Khorramdel, Lale – Educational Measurement: Issues and Practice, 2018
A multistage adaptive testing (MST) design was implemented for the Programme for the International Assessment of Adult Competencies (PIAAC) starting in 2012 for about 40 countries and has been implemented for the 2018 cycle of the Programme for International Student Assessment (PISA) for more than 80 countries. Using examples from PISA and PIAAC,…
Descriptors: International Assessment, Foreign Countries, Achievement Tests, Test Validity
Wise, Steven L. – Educational Measurement: Issues and Practice, 2017
The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time
Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020
Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…
Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards
Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016
Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…
Descriptors: Test Items, Test Construction, Psychometrics, Models

Oosterhof, Albert C.; Salisbury, David F. – Educational Measurement: Issues and Practice, 1985
The computer assisted testing (CAT) program at Florida State University's Assessment Resource Center is described. Three measurement issues (test quality, confidence in mastery decisions, and maintenance of test validity) and two instructional design issues (quality of instructional objectives and extended feedback following testing) important for…
Descriptors: Computer Assisted Testing, Educational Objectives, Feedback, Higher Education

Rudner, Lawrence M. – Educational Measurement: Issues and Practice, 1990
Three major pragmatic issues in computerized testing are addressed: (1) encouraging teacher use; (2) reporting of information; and (3) test construction. Reference is made to four related articles. Additional areas for research include reporting of test information; item bank standards; validity; and rules for stopping in computerized testing.…
Descriptors: Computer Assisted Testing, Evaluation Utilization, Item Banks, Research Needs
Wise, Steven L.; Bhola, Dennison S.; Yang, Sheng-Ta – Educational Measurement: Issues and Practice, 2006
The attractiveness of computer-based tests (CBTs) is due largely to their capability to expand the ways we conduct testing. A relatively unexplored application, however, is actively using the computer to reduce construct-irrelevant variance while a test is being administered. This investigation introduces the effort-monitoring CBT, in which the…
Descriptors: Computer Assisted Testing, Test Validity, Reaction Time, Guessing (Tests)

Nelson, Larry R. – Educational Measurement: Issues and Practice, 1984
The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)
Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing

Olsen, James B.; And Others – Educational Measurement: Issues and Practice, 1990
The development, school district implementation, and initial validation of the Wicat Skills Assessment Test--a computerized testing system for state assessment objectives--are described. Testing with 10,000 students in grades 3, 5, 7, and 9 began in Texas in 1986-87 and 1987-88. Results support the feasibility for districtwide computerized…
Descriptors: Computer Assisted Testing, Educational Assessment, Elementary School Students, Elementary Secondary Education