ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	10

Descriptor

Computer Assisted Testing	14
Test Validity	14
Test Construction	7
Psychometrics	4
Test Items	4
Mathematics Tests	3
Scoring	3
Achievement Tests	2
Adults	2
Automation	2
Decision Making	2
Educational Assessment	2
Grade 3	2
Grade 5	2
Grade 7	2
Grade 9	2
Guessing (Tests)	2
Reaction Time	2
Surgery	2
Test Interpretation	2
Test Use	2
Academic Language	1
Achievement Gap	1
Adaptive Testing	1
Affective Objectives	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	14
Reports - Research	6
Reports - Descriptive	4
Reports - Evaluative	3
Opinion Papers	2
Information Analyses	1

Education Level

Secondary Education	3
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Researchers

Location

Texas

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

The Impact of the COVID-19 Pandemic on American Board of Surgery's Oral Certifying Exams

Peer reviewed

Direct link

Barry, Carol L.; Jones, Andrew T.; Ibáñez, Beatriz; Grambau, Marni; Buyske, Jo – Educational Measurement: Issues and Practice, 2022

In response to the COVID-19 pandemic, the American Board of Surgery (ABS) shifted from in-person to remote administrations of the oral certifying exam (CE). Although the overall exam architecture remains the same, there are a number of differences in administration and staffing costs, exam content, security concerns, and the tools used to give the…

Descriptors: COVID-19, Pandemics, Computer Assisted Testing, Verbal Tests

Digital Module 18: Automated Scoring

Peer reviewed

Direct link

Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…

Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment

The Effect of Drag-and-Drop Item Features on Test-Taker Performance and Response Strategies

Peer reviewed

Direct link

Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020

Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…

Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making

The Relationship between Item Developer Alignment of Items to Range Achievement-Level Descriptors and Item Difficulty: Implications for Validating Intended Score Interpretations

Peer reviewed

Direct link

Schneider, M. Christina; Agrimson, Jared; Veazey, Mary – Educational Measurement: Issues and Practice, 2022

This paper presents results of a score interpretation study for a computer adaptive mathematics assessment. The study purpose was to test the efficacy of item developers' alignment of items to Range Achievement-Level Descriptors (RALDs; Egan et al.) against the empirical achievement-level alignment of items to investigate the use of RALDs as the…

Descriptors: Computer Assisted Testing, Mathematics Tests, Scores, Grade 3

Multistage Adaptive Testing Design in International Large-Scale Assessments

Peer reviewed

Direct link

Yamamoto, Kentaro; Shin, Hyo Jeong; Khorramdel, Lale – Educational Measurement: Issues and Practice, 2018

A multistage adaptive testing (MST) design was implemented for the Programme for the International Assessment of Adult Competencies (PIAAC) starting in 2012 for about 40 countries and has been implemented for the 2018 cycle of the Programme for International Student Assessment (PISA) for more than 80 countries. Using examples from PISA and PIAAC,…

Descriptors: International Assessment, Foreign Countries, Achievement Tests, Test Validity

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

A Process for Reviewing and Evaluating Generated Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016

Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…

Descriptors: Test Items, Test Construction, Psychometrics, Models

Some Measurement and Instruction Related Considerations Regarding Computer-Assisted Testing.

Peer reviewed

Oosterhof, Albert C.; Salisbury, David F. – Educational Measurement: Issues and Practice, 1985

The computer assisted testing (CAT) program at Florida State University's Assessment Resource Center is described. Three measurement issues (test quality, confidence in mastery decisions, and maintenance of test validity) and two instructional design issues (quality of instructional objectives and extended feedback following testing) important for…

Descriptors: Computer Assisted Testing, Educational Objectives, Feedback, Higher Education

Computer Testing: Pragmatic Issues and Research Needs.

Peer reviewed

Rudner, Lawrence M. – Educational Measurement: Issues and Practice, 1990

Three major pragmatic issues in computerized testing are addressed: (1) encouraging teacher use; (2) reporting of information; and (3) test construction. Reference is made to four related articles. Additional areas for research include reporting of test information; item bank standards; validity; and rules for stopping in computerized testing.…

Descriptors: Computer Assisted Testing, Evaluation Utilization, Item Banks, Research Needs

Taking the Time to Improve the Validity of Low-Stakes Tests: The Effort-Monitoring CBT

Peer reviewed

Direct link

Wise, Steven L.; Bhola, Dennison S.; Yang, Sheng-Ta – Educational Measurement: Issues and Practice, 2006

The attractiveness of computer-based tests (CBTs) is due largely to their capability to expand the ways we conduct testing. A relatively unexplored application, however, is actively using the computer to reduce construct-irrelevant variance while a test is being administered. This investigation introduces the effort-monitoring CBT, in which the…

Descriptors: Computer Assisted Testing, Test Validity, Reaction Time, Guessing (Tests)

Using Microcomputers to Assess Achievement and Instruction.

Peer reviewed

Nelson, Larry R. – Educational Measurement: Issues and Practice, 1984

The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)

Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing

Development, Implementation, and Validation of a Computerized Test for Statewide Assessment.

Peer reviewed

Olsen, James B.; And Others – Educational Measurement: Issues and Practice, 1990

The development, school district implementation, and initial validation of the Wicat Skills Assessment Test--a computerized testing system for state assessment objectives--are described. Testing with 10,000 students in grades 3, 5, 7, and 9 began in Texas in 1986-87 and 1987-88. Results support the feasibility for districtwide computerized…

Descriptors: Computer Assisted Testing, Educational Assessment, Elementary School Students, Elementary Secondary Education

Wise, Steven L.	2
Abedi, Jamal	1
Agrimson, Jared	1
Arslan, Burcu	1
Barry, Carol L.	1
Bhola, Dennison S.	1
Boyer, Michelle	1
Burkhardt, Amy	1
Buyske, Jo	1
Gierl, Mark J.	1
Gong, Tao	1
Grambau, Marni	1
Guher Gorgun	1
Ibáñez, Beatriz	1
Jiang, Yang	1
Jones, Andrew T.	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Khorramdel, Lale	1
Lai, Hollis	1
Lee, Hansol	1
Lottridge, Sue	1
Nelson, Larry R.	1
Okan Bulut	1
More ▼