ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	13

Descriptor

Statistical Analysis	13
Test Items	13
Foreign Countries	6
Test Bias	5
Scores	4
College Entrance Examinations	3
English	3
Item Analysis	3
Item Response Theory	3
Models	3
Psychometrics	3
Achievement Tests	2
Cheating	2
Classification	2
Comparative Analysis	2
Computer Software	2
Correlation	2
Educational Assessment	2
Goodness of Fit	2
High School Students	2
High Stakes Tests	2
Identification	2
Mathematics Tests	2
Measurement	2
Methods	2
More ▼

Source

International Journal of…

Publication Type

Journal Articles	13
Reports - Research	11
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	6
Postsecondary Education	4
Elementary Secondary Education	2
Elementary Education	1
High Schools	1
Preschool Education	1
Secondary Education	1

Audience

Location

Canada	2
Israel	1
Saudi Arabia	1
Sweden	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Comparison of Methods for Detecting Examinee Preknowledge of Items

Peer reviewed

Direct link

Wang, Xi; Liu, Yang; Robin, Frederic; Guo, Hongwen – International Journal of Testing, 2019

In an on-demand testing program, some items are repeatedly used across test administrations. This poses a risk to test security. In this study, we considered a scenario wherein a test was divided into two subsets: one consisting of secure items and the other consisting of possibly compromised items. In a simulation study of multistage adaptive…

Descriptors: Identification, Methods, Test Items, Cheating

Examining the Impact of Covariates on Anchor Tests to Ascertain Quality over Time in a College Admissions Test

Peer reviewed

Direct link

Wiberg, Marie; von Davier, Alina A. – International Journal of Testing, 2017

We propose a comprehensive procedure for the implementation of a quality control process of anchor tests for a college admissions test with multiple consecutive administrations. We propose to examine the anchor tests and their items in connection with covariates to investigate if there was any unusual behavior in the anchor test results over time…

Descriptors: College Entrance Examinations, Test Items, Equated Scores, Quality Control

Differential Distractor Functioning as a Method for Explaining DIF: The Case of a National Admissions Test in Saudi Arabia

Peer reviewed

Direct link

Tsaousis, Ioannis; Sideridis, Georgios; Al-Saawi, Fahad – International Journal of Testing, 2018

The aim of the present study was to examine Differential Distractor Functioning (DDF) as a means of improving the quality of a measure through understanding biased responses across groups. A DDF analysis could shed light on the potential sources of construct-irrelevant variance by examining whether the differential selection of incorrect choices…

Descriptors: Foreign Countries, College Entrance Examinations, Test Bias, Chemistry

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

An Illustration of Diagnostic Classification Modeling in Student Learning Outcomes Assessment

Peer reviewed

Direct link

Jurich, Daniel P.; Bradshaw, Laine P. – International Journal of Testing, 2014

The assessment of higher-education student learning outcomes is an important component in understanding the strengths and weaknesses of academic and general education programs. This study illustrates the application of diagnostic classification models, a burgeoning set of statistical models, in assessing student learning outcomes. To facilitate…

Descriptors: College Outcomes Assessment, Classification, Statistical Analysis, Models

Differential Item Functioning Detection with the Mantel-Haenszel Procedure: The Effects of Matching Types and Other Factors

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E.; Zilberberg, Anna; Phan, Ha – International Journal of Testing, 2015

The Mantel-Haenszel (MH) procedure is commonly used to detect items that function differentially for groups of examinees from various demographic and linguistic backgrounds--for example, in international assessments. As in some other DIF methods, the total score is used to match examinees on ability. In thin matching, each of the total score…

Descriptors: Test Items, Educational Testing, Evaluation Methods, Ability Grouping

Multiple Mini-Interviews in the Age of the Internet: Does Preparation Help Applicants to Medical School?

Peer reviewed

Direct link

Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017

Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…

Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability

Exploring Crossing Differential Item Functioning by Gender in Mathematics Assessment

Peer reviewed

Direct link

Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Testing, 2015

The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…

Descriptors: Mathematics Tests, Gender Differences, Test Bias, Test Items

Exploring Differential Subgroup Functioning on SAT Writing Items: What Happens When English Is Not a Test Taker's Best Language?

Peer reviewed

Direct link

Engelhard, George, Jr.; Kobrin, Jennifer L.; Wind, Stefanie A. – International Journal of Testing, 2014

The purpose of this study is to explore patterns in model-data fit related to subgroups of test takers from a large-scale writing assessment. Using data from the SAT, a calibration group was randomly selected to represent test takers who reported that English was their best language from the total population of test takers (N = 322,011). A…

Descriptors: College Entrance Examinations, Writing Tests, Goodness of Fit, English

The Role of Item Models in Automatic Item Generation

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012

Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items

Modeling Item-Level and Step-Level Invariance Effects in Polytomous Items Using the Partial Credit Model

Peer reviewed

Direct link

Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012

Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…

Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items

A Range-Null Hypothesis Approach for Testing DIF under the Rasch Model

Peer reviewed

Direct link

Wells, Craig S.; Cohen, Allan S.; Patton, Jeffrey – International Journal of Testing, 2009

A primary concern with testing differential item functioning (DIF) using a traditional point-null hypothesis is that a statistically significant result does not imply that the magnitude of DIF is of practical interest. Similarly, for a given sample size, a non-significant result does not allow the researcher to conclude the item is free of DIF. To…

Descriptors: Test Bias, Test Items, Statistical Analysis, Hypothesis Testing

Item Equivalence in English and Chinese Translation of a Cognitive Development Test for Preschoolers

Peer reviewed

Direct link

He, Wei; Wolfe, Edward W. – International Journal of Testing, 2010

This article reports the results of a study of potential sources of item nonequivalence between English and Chinese language versions of a cognitive development test for preschool-aged children. Items were flagged for potential nonequivalence through statistical and judgment-based procedures, and the relationship between flag status and item…

Descriptors: Preschool Children, Mandarin Chinese, Cognitive Development, Item Analysis

Al-Saawi, Fahad	1
Bradshaw, Laine P.	1
Cohen, Allan S.	1
DeMars, Christine E.	1
Engelhard, George, Jr.	1
Gafni, Naomi	1
Gattamorta, Karina A.	1
Gierl, Mark J.	1
Guo, Hongwen	1
He, Wei	1
Jurich, Daniel P.	1
Kobrin, Jennifer L.	1
Lai, Hollis	1
Lamprianou, Iasonas	1
Liu, Yang	1
Maeda, Hotaka	1
Moshinsky, Avital	1
Myers, Nicholas D.	1
Ong, Yoke Mooi	1
Patton, Jeffrey	1
Penfield, Randall D.	1
Phan, Ha	1
Robin, Frederic	1
Sideridis, Georgios	1
Socha, Alan	1
More ▼