ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	33

Descriptor

Test Items	37
Computer Assisted Testing	17
Item Response Theory	14
Test Bias	11
Adaptive Testing	9
Simulation	9
Testing	8
Foreign Countries	7
Scores	7
Scoring	7
Test Construction	7
Comparative Analysis	6
Evaluation Methods	6
Test Format	6
Models	5
Psychometrics	5
Accuracy	4
Computation	4
Error of Measurement	4
Item Analysis	4
Measurement	4
Sample Size	4
Statistical Analysis	4
Student Evaluation	4
Achievement Tests	3
More ▼

Source

International Journal of…

Publication Type

Journal Articles	37
Reports - Research	23
Reports - Evaluative	6
Reports - Descriptive	5
Guides - General	1
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Elementary Education	3
Postsecondary Education	3
Elementary Secondary Education	2
Grade 4	2
Intermediate Grades	2
Middle Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 3	1
Grade 5	1
Grade 7	1
Grade 8	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Location

United States	3
China	2
South Korea	2
Canada	1
France	1
Germany	1
Hong Kong	1
Iran	1
Kuwait	1
Malaysia	1
Massachusetts	1
Minnesota	1
Philippines	1
Qatar	1
Singapore	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Graduate Management Admission…	1
International English…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Dynamic Multistage Testing: A Highly Efficient and Regulated Adaptive Testing Method

Peer reviewed

Direct link

Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019

This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics

Item Parameter Drift in Computer Adaptive Testing Due to Lack of Content Knowledge

Peer reviewed

Direct link

Aksu Dunya, Beyza – International Journal of Testing, 2018

This study was conducted to analyze potential item parameter drift (IPD) impact on person ability estimates and classification accuracy when drift affects an examinee subgroup. Using a series of simulations, three factors were manipulated: (a) percentage of IPD items in the CAT exam, (b) percentage of examinees affected by IPD, and (c) item pool…

Descriptors: Adaptive Testing, Classification, Accuracy, Computer Assisted Testing

A Comparison of Methods for Detecting Examinee Preknowledge of Items

Peer reviewed

Direct link

Wang, Xi; Liu, Yang; Robin, Frederic; Guo, Hongwen – International Journal of Testing, 2019

In an on-demand testing program, some items are repeatedly used across test administrations. This poses a risk to test security. In this study, we considered a scenario wherein a test was divided into two subsets: one consisting of secure items and the other consisting of possibly compromised items. In a simulation study of multistage adaptive…

Descriptors: Identification, Methods, Test Items, Cheating

Generating Reading Comprehension Items Using Automated Processes

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – International Journal of Testing, 2022

Over the last five years, tremendous strides have been made in advancing the AIG methodology required to produce items in diverse content areas. However, the one content area where enormous problems remain unsolved is language arts, generally, and reading comprehension, more specifically. While reading comprehension test items can be created using…

Descriptors: Reading Comprehension, Test Construction, Test Items, Natural Language Processing

Investigating Technology-Enhanced Item Formats Using Cognitive and Item Response Theory Approaches

Peer reviewed

Direct link

Moon, Jung Aa; Sinharay, Sandip; Keehner, Madeleine; Katz, Irvin R. – International Journal of Testing, 2020

The current study examined the relationship between test-taker cognition and psychometric item properties in multiple-selection multiple-choice and grid items. In a study with content-equivalent mathematics items in alternative item formats, adult participants' tendency to respond to an item was affected by the presence of a grid and variations of…

Descriptors: Computer Assisted Testing, Multiple Choice Tests, Test Wiseness, Psychometrics

ITC Guidelines for Translating and Adapting Tests (Second Edition)

Peer reviewed

Direct link

International Journal of Testing, 2018

The second edition of the International Test Commission Guidelines for Translating and Adapting Tests was prepared between 2005 and 2015 to improve upon the first edition, and to respond to advances in testing technology and practices. The 18 guidelines are organized into six categories to facilitate their use: pre-condition (3), test development…

Descriptors: Translation, Test Construction, Testing, Scoring

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Examining Provision and Sufficiency of Testing Accommodations for English Learners

Peer reviewed

Direct link

Roschmann, Sarina; Witmer, Sara E.; Volker, Martin A. – International Journal of Testing, 2021

Accommodations are commonly provided to address language-related barriers students may experience during testing. Research on the validity of scores from accommodated test administrations remains somewhat inconclusive. The current study investigated item response patterns to understand whether accommodations, as used in practice among English…

Descriptors: Testing Accommodations, English Language Learners, Scores, Item Response Theory

Use of Automated Scoring Features to Generate Hypotheses Regarding Language-Based DIF

Peer reviewed

Direct link

Shermis, Mark D.; Mao, Liyang; Mulholland, Matthew; Kieftenbeld, Vincent – International Journal of Testing, 2017

This study uses the feature sets employed by two automated scoring engines to determine if a "linguistic profile" could be formulated that would help identify items that are likely to exhibit differential item functioning (DIF) based on linguistic features. Sixteen items were administered to 1200 students where demographic information…

Descriptors: Computer Assisted Testing, Scoring, Hypothesis Testing, Essays

Using Out-of-Level Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wei, Hua; Lin, Jie – International Journal of Testing, 2015

Out-of-level testing refers to the practice of assessing a student with a test that is intended for students at a higher or lower grade level. Although the appropriateness of out-of-level testing for accountability purposes has been questioned by educators and policymakers, incorporating out-of-level items in formative assessments for accurate…

Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Instructional Program Divisions

Examining Test Speededness by Native Language

Peer reviewed

Direct link

Talento-Miller, Eileen; Guo, Fanmin; Han, Kyung T. – International Journal of Testing, 2013

When power tests include a time limit, it is important to assess the possibility of speededness for examinees. Past research on differential speededness has examined gender and ethnic subgroups in the United States on paper and pencil tests. When considering the needs of a global audience, research regarding different native language speakers is…

Descriptors: Adaptive Testing, Computer Assisted Testing, English, Scores

Item Calibration Samples and the Stability of Achievement Estimates and System Rankings: Another Look at the PISA Model

Peer reviewed

Direct link

Rutkowski, Leslie; Rutkowski, David; Zhou, Yan – International Journal of Testing, 2016

Using an empirically-based simulation study, we show that typically used methods of choosing an item calibration sample have significant impacts on achievement bias and system rankings. We examine whether recent PISA accommodations, especially for lower performing participants, can mitigate some of this bias. Our findings indicate that standard…

Descriptors: Simulation, International Programs, Adolescents, Student Evaluation

Investigating Test-Taking Behaviors Using Timing and Process Data

Peer reviewed

Direct link

Lee, Yi-Hsuan; Haberman, Shelby J. – International Journal of Testing, 2016

The use of computer-based assessments makes the collection of detailed data that capture examinees' progress in the tests and time spent on individual actions possible. This article presents a study using process and timing data to aid understanding of an international language assessment and the examinees. Issues regarding test-taking strategies,…

Descriptors: Computer Assisted Testing, Test Wiseness, Language Tests, International Assessment

Multiple-Group Noncompensatory Differential Item Functioning in Raju's Differential Functioning of Items and Tests

Peer reviewed

Direct link

Oshima, T. C.; Wright, Keith; White, Nick – International Journal of Testing, 2015

Raju, van der Linden, and Fleer (1995) introduced a framework for differential functioning of items and tests (DFIT) for unidimensional dichotomous models. Since then, DFIT has been shown to be a quite versatile framework as it can handle polytomous as well as multidimensional models both at the item and test levels. However, DFIT is still limited…

Descriptors: Test Bias, Item Response Theory, Test Items, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Gierl, Mark J.	2
Veldkamp, Bernard P.	2
Aksu Dunya, Beyza	1
Aryadoust, Vahid	1
Backhoff, Eduardo	1
Baghaei, Purya	1
Banks, Kathleen	1
Bridgeman, Brent	1
Buckendahl, Chad W.	1
Carlstedt, Berit	1
Chernyshenko, Oleksandr S.	1
Childs, Ruth A.	1
Cohen, Allan S.	1
Cole, Ki Lynn	1
Contreras-Nino, Luis Angel	1
Davis-Becker, Susan L.	1
DeMars, Christine E.	1
Ercikan, Kadriye	1
Gerrow, Jack	1
Guo, Fanmin	1
Guo, Hongwen	1
Gustafsson, Jan-Eric	1
Haberman, Shelby J.	1
Hambleton, Ronald K.	1
Han, Kyung T.	1
More ▼