ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	7

Descriptor

Computer Assisted Testing	10
Simulation	10
Test Items	6
Comparative Analysis	4
Item Response Theory	4
Models	4
Scores	4
Evaluation Methods	3
Scoring	3
Statistical Analysis	3
Test Format	3
Adaptive Testing	2
Data Collection	2
English (Second Language)	2
Games	2
Generalization	2
Item Analysis	2
Language Tests	2
Markov Processes	2
Multivariate Analysis	2
Second Language Learning	2
Test Content	2
Algebra	1
Antisocial Behavior	1
Bayesian Statistics	1
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	10
Reports - Research	10
Tests/Questionnaires	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Taming Log Files from Game/Simulation-Based Assessments: Data Models and Data Analysis Tools. Research Report. ETS RR-16-10

Peer reviewed
PDF on ERIC

Download full text

Hao, Jiangang; Smith, Lawrence; Mislevy, Robert; von Davier, Alina; Bauer, Malcolm – ETS Research Report Series, 2016

Extracting information efficiently from game/simulation-based assessment (G/SBA) logs requires two things: a well-structured log file and a set of analysis methods. In this report, we propose a generic data model specified as an extensible markup language (XML) schema for the log files of G/SBAs. We also propose a set of analysis methods for…

Descriptors: Evaluation Methods, Games, Computer Assisted Testing, Data Collection

Statistical Methods for Assessments in Simulations and Serious Games. Research Report. ETS RR-14-12

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014

Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…

Descriptors: Simulation, Evaluation Methods, Games, Data Collection

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

From Biology to Education: Scoring and Clustering Multilingual Text Sequences and Other Sequential. Research Report. ETS RR-12-25

Peer reviewed
PDF on ERIC

Download full text

Sukkarieh, Jane Z.; von Davier, Matthias; Yamamoto, Kentaro – ETS Research Report Series, 2012

This document describes a solution to a problem in the automatic content scoring of the multilingual character-by-character highlighting item type. This solution is language independent and represents a significant enhancement. This solution not only facilitates automatic scoring but plays an important role in clustering students' responses;…

Descriptors: Scoring, Multilingualism, Test Items, Role

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Severity of Organized Item Theft in Computerized Adaptive Testing: An Empirical Study. Research Report. ETS RR-06-22

Peer reviewed
PDF on ERIC

Download full text

Yi, Qing; Zhang, Jinming; Chang, Hua-Hua – ETS Research Report Series, 2006

Chang and Zhang (2002, 2003) proposed several baseline criteria for assessing the severity of possible test security violations for computerized tests with high-stakes outcomes. However, these criteria were obtained from theoretical derivations that assumed uniformly randomized item selection. The current study investigated potential damage caused…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Computer Security

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

A Comparison of Two Procedures for Constrained Adaptive Test Construction. Research Report. ETS RR-04-39

Peer reviewed
PDF on ERIC

Download full text

Robin, Frédéric; van der Linden, Wim J.; Eignor, Daniel R.; Steffen, Manfred; Stocking, Martha L. – ETS Research Report Series, 2005

The relatively new shadow test approach (STA) to computerized adaptive testing (CAT) proposed by Wim van der Linden is a potentially attractive alternative to the weighted deviation algorithm (WDA) implemented at ETS. However, it has not been evaluated under testing conditions representative of current ETS testing programs. Of interest was whether…

Descriptors: Test Construction, Computer Assisted Testing, Simulation, Evaluation Methods

A General Diagnostic Model Applied to Language Testing Data. Research Report. ETS RR-05-16

Peer reviewed
PDF on ERIC

Download full text

von Davier, Matthias – ETS Research Report Series, 2005

Probabilistic models with more than one latent variable are designed to report profiles of skills or cognitive attributes. Testing programs want to offer additional information beyond what a single test score can provide using these skill profiles. Many recent approaches to skill profile models are limited to dichotomous data and have made use of…

Descriptors: Models, Diagnostic Tests, Language Tests, Language Proficiency

Chang, Hua-Hua	2
Steffen, Manfred	2
von Davier, Matthias	2
Ali, Usama S.	1
Bauer, Malcolm	1
Breyer, F. Jay	1
Eignor, Daniel R.	1
Fu, Jianbin	1
Hao, Jiangang	1
Kim, Sooyeon	1
Lorenz, Florian	1
Mavronikolas, Elia	1
Mislevy, Robert	1
Moses, Tim	1
Patsula, Liane	1
Rizavi, Saba	1
Robin, Frédéric	1
Rotou, Ourania	1
Smith, Lawrence	1
Stocking, Martha L.	1
Sukkarieh, Jane Z.	1
Yamamoto, Kentaro	1
Yi, Qing	1
Zapata, Diego	1
Zhang, Jinming	1
More ▼