ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	3

Descriptor

Computer Assisted Testing	5
Scoring	5
Item Response Theory	4
Test Items	3
Adaptive Testing	2
Evaluation Methods	2
Item Analysis	2
Psychometrics	2
Accuracy	1
Computer Software	1
Computer Use	1
Correlation	1
Design	1
Earth Science	1
Equations (Mathematics)	1
Essay Tests	1
Essays	1
Evaluation Research	1
Evaluators	1
Formative Evaluation	1
Gender Differences	1
Hypothesis Testing	1
Interrater Reliability	1
Language Usage	1
Language Variation	1
More ▼

Source

International Journal of…

Author

Bauer, Malcolm	1
Behrens, John T.	1
Chernyshenko, Oleksandr S.	1
DeMark, Sarah F.	1
Engelhard, George, Jr.	1
Foltz, Peter	1
Kieftenbeld, Vincent	1
Mao, Liyang	1
Mislevy, Robert J.	1
Mulholland, Matthew	1
Rosenstein, Mark	1
Rupp, Andre A.	1
Shermis, Mark D.	1
Stark, Stephen	1
Steinberg, Linda S.	1
Williamson, David M.	1
Wind, Stefanie A.	1
Wolfe, Edward W.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	3
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Use of Automated Scoring Features to Generate Hypotheses Regarding Language-Based DIF

Peer reviewed

Direct link

Shermis, Mark D.; Mao, Liyang; Mulholland, Matthew; Kieftenbeld, Vincent – International Journal of Testing, 2017

This study uses the feature sets employed by two automated scoring engines to determine if a "linguistic profile" could be formulated that would help identify items that are likely to exhibit differential item functioning (DIF) based on linguistic features. Sixteen items were administered to 1200 students where demographic information…

Descriptors: Computer Assisted Testing, Scoring, Hypothesis Testing, Essays

Computerized Adaptive Testing with the Zinnes and Griggs Pairwise Preference Ideal Point Model

Peer reviewed

Direct link

Stark, Stephen; Chernyshenko, Oleksandr S. – International Journal of Testing, 2011

This article delves into a relatively unexplored area of measurement by focusing on adaptive testing with unidimensional pairwise preference items. The use of such tests is becoming more common in applied non-cognitive assessment because research suggests that this format may help to reduce certain types of rater error and response sets commonly…

Descriptors: Test Length, Simulation, Adaptive Testing, Item Analysis

Item Response Modeling with BILOG-MG and MULTILOG for Windows

Peer reviewed

Direct link

Rupp, Andre A. – International Journal of Testing, 2003

Item response theory (IRT) has become one of the most popular scoring frameworks for measurement data. IRT models are used frequently in computerized adaptive testing, cognitively diagnostic assessment, and test equating. This article reviews two of the most popular software packages for IRT model estimation, BILOG-MG (Zimowski, Muraki, Mislevy, &…

Descriptors: Test Items, Adaptive Testing, Item Response Theory, Computer Software

Design Rationale for a Complex Performance Assessment

Peer reviewed

Direct link

Williamson, David M.; Bauer, Malcolm; Steinberg, Linda S.; Mislevy, Robert J.; Behrens, John T.; DeMark, Sarah F. – International Journal of Testing, 2004

In computer-based interactive environments meant to support learning, students must bring a wide range of relevant knowledge, skills, and abilities to bear jointly as they solve meaningful problems in a learning domain. To function effectively as an assessment, a computer system must additionally be able to evoke and interpret observable evidence…

Descriptors: Computer Assisted Testing, Psychometrics, Task Analysis, Performance Based Assessment