ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Computer Assisted Testing	10
Item Response Theory	10
Adaptive Testing	5
Test Items	5
Psychometrics	4
Scoring	4
Simulation	4
Evaluation Methods	3
Comparative Analysis	2
Correlation	2
Item Analysis	2
Multiple Choice Tests	2
Reliability	2
Scores	2
Test Bias	2
Ability	1
Accuracy	1
Achievement Tests	1
Adults	1
Anxiety	1
Bayesian Statistics	1
Bias	1
Cognitive Ability	1
Cognitive Measurement	1
Cognitive Processes	1
More ▼

Source

International Journal of…

Publication Type

Journal Articles	10
Reports - Research	6
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Secondary Education

Audience

Location

Denmark	1
Germany	1
Poland	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Stopping Rules for Computer Adaptive Testing When Item Banks Have Nonuniform Information

Peer reviewed

Direct link

Morris, Scott B.; Bass, Michael; Howard, Elizabeth; Neapolitan, Richard E. – International Journal of Testing, 2020

The standard error (SE) stopping rule, which terminates a computer adaptive test (CAT) when the "SE" is less than a threshold, is effective when there are informative questions for all trait levels. However, in domains such as patient-reported outcomes, the items in a bank might all target one end of the trait continuum (e.g., negative…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Banks, Item Response Theory

Investigating Technology-Enhanced Item Formats Using Cognitive and Item Response Theory Approaches

Peer reviewed

Direct link

Moon, Jung Aa; Sinharay, Sandip; Keehner, Madeleine; Katz, Irvin R. – International Journal of Testing, 2020

The current study examined the relationship between test-taker cognition and psychometric item properties in multiple-selection multiple-choice and grid items. In a study with content-equivalent mathematics items in alternative item formats, adult participants' tendency to respond to an item was affected by the presence of a grid and variations of…

Descriptors: Computer Assisted Testing, Multiple Choice Tests, Test Wiseness, Psychometrics

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Use of Automated Scoring Features to Generate Hypotheses Regarding Language-Based DIF

Peer reviewed

Direct link

Shermis, Mark D.; Mao, Liyang; Mulholland, Matthew; Kieftenbeld, Vincent – International Journal of Testing, 2017

This study uses the feature sets employed by two automated scoring engines to determine if a "linguistic profile" could be formulated that would help identify items that are likely to exhibit differential item functioning (DIF) based on linguistic features. Sixteen items were administered to 1200 students where demographic information…

Descriptors: Computer Assisted Testing, Scoring, Hypothesis Testing, Essays

Computerized Adaptive Testing with the Zinnes and Griggs Pairwise Preference Ideal Point Model

Peer reviewed

Direct link

Stark, Stephen; Chernyshenko, Oleksandr S. – International Journal of Testing, 2011

This article delves into a relatively unexplored area of measurement by focusing on adaptive testing with unidimensional pairwise preference items. The use of such tests is becoming more common in applied non-cognitive assessment because research suggests that this format may help to reduce certain types of rater error and response sets commonly…

Descriptors: Test Length, Simulation, Adaptive Testing, Item Analysis

The Applicability of Multidimensional Computerized Adaptive Testing for Cognitive Ability Measurement in Organizational Assessment

Peer reviewed

Direct link

Makransky, Guido; Glas, Cees A. W. – International Journal of Testing, 2013

Cognitive ability tests are widely used in organizations around the world because they have high predictive validity in selection contexts. Although these tests typically measure several subdomains, testing is usually carried out for a single subdomain at a time. This can be ineffective when the subdomains assessed are highly correlated. This…

Descriptors: Foreign Countries, Cognitive Ability, Adaptive Testing, Feedback (Response)

A Monte Carlo Simulation Investigating the Validity and Reliability of Ability Estimation in Item Response Theory with Speeded Computer Adaptive Tests

Peer reviewed

Direct link

Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010

Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…

Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing

Correcting for Person Misfit in Aggregated Score Reporting

Peer reviewed

Direct link

Brown, Richard S.; Villarreal, Julio C. – International Journal of Testing, 2007

There has been considerable research regarding the extent to which psychometric sound assessments sometimes yield individual score estimates that are inconsistent with the response patterns of the individual. It has been suggested that individual response patterns may differ from expectations for a number of reasons, including subject motivation,…

Descriptors: Psychometrics, Test Bias, Testing, Simulation

Item Response Modeling with BILOG-MG and MULTILOG for Windows

Peer reviewed

Direct link

Rupp, Andre A. – International Journal of Testing, 2003

Item response theory (IRT) has become one of the most popular scoring frameworks for measurement data. IRT models are used frequently in computerized adaptive testing, cognitively diagnostic assessment, and test equating. This article reviews two of the most popular software packages for IRT model estimation, BILOG-MG (Zimowski, Muraki, Mislevy, &…

Descriptors: Test Items, Adaptive Testing, Item Response Theory, Computer Software

Specifying and Refining a Measurement Model for a Computer-Based Interactive Assessment

Peer reviewed

Direct link

Levy, Roy; Mislevy, Robert J. – International Journal of Testing, 2004

The challenges of modeling students' performance in computer-based interactive assessments include accounting for multiple aspects of knowledge and skill that arise in different situations and the conditional dependencies among multiple aspects of performance. This article describes a Bayesian approach to modeling and estimating cognitive models…

Descriptors: Computer Assisted Testing, Markov Processes, Computer Networks, Bayesian Statistics

Bass, Michael	1
Brown, Richard S.	1
Chernyshenko, Oleksandr S.	1
Engelhard, George, Jr.	1
Foltz, Peter	1
Glas, Cees A. W.	1
Howard, Elizabeth	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Kieftenbeld, Vincent	1
Levy, Roy	1
Makransky, Guido	1
Mao, Liyang	1
Mislevy, Robert J.	1
Moon, Jung Aa	1
Morris, Scott B.	1
Mulholland, Matthew	1
Neapolitan, Richard E.	1
Rosenstein, Mark	1
Rupp, Andre A.	1
Sass, D. A.	1
Schmitt, T. A.	1
Shermis, Mark D.	1
Sinharay, Sandip	1
Stark, Stephen	1
More ▼