ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	12

Descriptor

Computer Assisted Testing	29
Scoring	29
Simulation	29
Adaptive Testing	19
Test Items	14
Comparative Analysis	10
Test Construction	9
Item Response Theory	7
Item Analysis	6
Testing Problems	6
Correlation	4
Estimation (Mathematics)	4
Test Bias	4
Test Length	4
Test Reliability	4
Algorithms	3
Difficulty Level	3
Educational Technology	3
English (Second Language)	3
Equations (Mathematics)	3
Evaluation Methods	3
Item Banks	3
Mathematical Models	3
Maximum Likelihood Statistics	3
Models	3
More ▼

Source

ETS Research Report Series	3
Journal of Educational…	3
ProQuest LLC	2
Applied Psychological…	1
Australian Review of Applied…	1
Evaluation Quarterly	1
IGI Global	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Educational and…	1
Journal of Technology,…	1
National Center for Education…	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	13
Reports - Evaluative	12
Reports - Research	11
Speeches/Meeting Papers	5
Numerical/Quantitative Data	3
Dissertations/Theses -…	2
Reports - Descriptive	2
Books	1
Collected Works - General	1
Tests/Questionnaires	1

Education Level

Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Center for Epidemiologic…	1
Graduate Record Examinations	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

A Comparison of Final Scoring Methods under the Multistage Adaptive Testing Framework

Direct link

Hacer Karamese – ProQuest LLC, 2022

Multistage adaptive testing (MST) has become popular in the testing industry because the research has shown that it combines the advantages of both linear tests and item-level computer adaptive testing (CAT). The previous research efforts primarily focused on MST design issues such as panel design, module length, test length, distribution of test…

Descriptors: Adaptive Testing, Scoring, Computer Assisted Testing, Design

A Fair Comparison of the Performance of Computerized Adaptive Testing and Multistage Adaptive Testing

Direct link

Wang, Keyin – ProQuest LLC, 2017

The comparison of item-level computerized adaptive testing (CAT) and multistage adaptive testing (MST) has been researched extensively (e.g., Kim & Plake, 1993; Luecht et al., 1996; Patsula, 1999; Jodoin, 2003; Hambleton & Xing, 2006; Keng, 2008; Zheng, 2012). Various CAT and MST designs have been investigated and compared under the same…

Descriptors: Comparative Analysis, Computer Assisted Testing, Adaptive Testing, Test Items

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

Curtailment and Stochastic Curtailment to Shorten the CES-D

Peer reviewed

Direct link

Finkelman, Matthew D.; Smits, Niels; Kim, Wonsuk; Riley, Barth – Applied Psychological Measurement, 2012

The Center for Epidemiologic Studies-Depression (CES-D) scale is a well-known self-report instrument that is used to measure depressive symptomatology. Respondents who take the full-length version of the CES-D are administered a total of 20 items. This article investigates the use of curtailment and stochastic curtailment (SC), two sequential…

Descriptors: Measures (Individuals), Depression (Psychology), Test Length, Computer Assisted Testing

Computerized Adaptive Testing with the Zinnes and Griggs Pairwise Preference Ideal Point Model

Peer reviewed

Direct link

Stark, Stephen; Chernyshenko, Oleksandr S. – International Journal of Testing, 2011

This article delves into a relatively unexplored area of measurement by focusing on adaptive testing with unidimensional pairwise preference items. The use of such tests is becoming more common in applied non-cognitive assessment because research suggests that this format may help to reduce certain types of rater error and response sets commonly…

Descriptors: Test Length, Simulation, Adaptive Testing, Item Analysis

From Biology to Education: Scoring and Clustering Multilingual Text Sequences and Other Sequential. Research Report. ETS RR-12-25

Peer reviewed
PDF on ERIC

Download full text

Sukkarieh, Jane Z.; von Davier, Matthias; Yamamoto, Kentaro – ETS Research Report Series, 2012

This document describes a solution to a problem in the automatic content scoring of the multilingual character-by-character highlighting item type. This solution is language independent and represents a significant enhancement. This solution not only facilitates automatic scoring but plays an important role in clustering students' responses;…

Descriptors: Scoring, Multilingualism, Test Items, Role

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Handbook of Research on Technology Tools for Real-World Skill Development (2 Volumes)

Peer reviewed

Direct link

Rosen, Yigel, Ed.; Ferrara, Steve, Ed.; Mosharraf, Maryam, Ed. – IGI Global, 2016

Education is expanding to include a stronger focus on the practical application of classroom lessons in an effort to prepare the next generation of scholars for a changing world economy centered on collaborative and problem-solving skills for the digital age. "The Handbook of Research on Technology Tools for Real-World Skill Development"…

Descriptors: Technological Literacy, Technology Uses in Education, Problem Solving, Skill Development

Implementing ICAO Language Proficiency Requirements in the Versant Aviation English Test

Peer reviewed

Direct link

Van Moere, Alistair; Suzuki, Masanori; Downey, Ryan; Cheng, Jian – Australian Review of Applied Linguistics, 2009

This paper discusses the development of an assessment to satisfy the International Civil Aviation Organization (ICAO) Language Proficiency Requirements. The Versant Aviation English Test utilizes speech recognition technology and a computerized testing platform, such that test administration and scoring are fully automated. Developed in…

Descriptors: Scoring, Test Construction, Language Proficiency, Standards

A Review of Item Exposure Control Strategies for Computerized Adaptive Testing Developed from 1983 to 2005

Peer reviewed
PDF on ERIC

Download full text

Direct link

Georgiadou, Elissavet; Triantafillou, Evangelos; Economides, Anastasios A. – Journal of Technology, Learning, and Assessment, 2007

Since researchers acknowledged the several advantages of computerized adaptive testing (CAT) over traditional linear test administration, the issue of item exposure control has received increased attention. Due to CAT's underlying philosophy, particular items in the item pool may be presented too often and become overexposed, while other items are…

Descriptors: Adaptive Testing, Computer Assisted Testing, Scoring, Test Items

Polytomous Modeling of Cognitive Errors in Computer Adaptive Testing.

Peer reviewed

Wang, LihShing; Li, Chun-Shan – Journal of Applied Measurement, 2001

Used Monte Carlo simulation to compare the relative measurement efficiency of polytomous modeling and dichotomous modeling under different scoring schemes and termination criteria. Results suggest that polytomous computerized adaptive testing (CAT) yields marginal gains over dichotomous CAT when termination criteria are more stringent. Discusses…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Monte Carlo Methods

"Mental Model" Comparison of Automated and Human Scoring.

Peer reviewed

Williamson, David M.; Bejar, Isaac I.; Hone, Anne S. – Journal of Educational Measurement, 1999

Contrasts "mental models" used by automated scoring for the simulation division of the computerized Architect Registration Examination with those used by experienced human graders for 3,613 candidate solutions. Discusses differences in the models used and the potential of automated scoring to enhance the validity evidence of scores. (SLD)

Descriptors: Architects, Comparative Analysis, Computer Assisted Testing, Judges

A Sharing Item Response Theory Model for Computerized Adaptive Testing

Peer reviewed

Direct link

Segall, Daniel O. – Journal of Educational and Behavioral Statistics, 2004

A new sharing item response theory (SIRT) model is presented that explicitly models the effects of sharing item content between informants and test takers. This model is used to construct adaptive item selection and scoring rules that provide increased precision and reduced score gains in instances where sharing occurs. The adaptive item selection…

Descriptors: Scoring, Item Analysis, Item Response Theory, Adaptive Testing

Development of Automated Scoring Algorithms for Complex Performance Assessments: A Comparison of Two Approaches.

Peer reviewed

Clauser, Brian E.; Margolis, Melissa J.; Clyman, Stephen G.; Ross, Linette P. – Journal of Educational Measurement, 1997

Research on automated scoring is extended by comparing alternative automated systems for scoring a computer simulation of physicians' patient management skills. A regression-based system is more highly correlated with experts' evaluations than a system that uses complex rules to map performances into score levels, but both approaches are feasible.…

Descriptors: Algorithms, Automation, Comparative Analysis, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

Chung, Gregory K. W. K.	2
Segall, Daniel O.	2
Stocking, Martha L.	2
Ali, Usama S.	1
Baker, Eva L.	1
Bejar, Isaac I.	1
Bennett, Randy Elliot	1
Breyer, F. Jay	1
Chang, Hua-Hua	1
Chen, Ping	1
Cheng, Jian	1
Chernyshenko, Oleksandr S.	1
Clauser, Brian E.	1
Clyman, Stephen G.	1
De Ayala, R. J.	1
DeAyala, R. J.	1
Downey, Ryan	1
Economides, Anastasios A.	1
Ferrara, Steve, Ed.	1
Finkelman, Matthew D.	1
Georgiadou, Elissavet	1
Hacer Karamese	1
Harris, Dickie A.	1
Herl, Howard E.	1
More ▼