ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	14

Source

International Journal of…

Publication Type

Journal Articles	14
Reports - Research	11
Reports - Evaluative	3

Education Level

Elementary Education	4
High Schools	2
Secondary Education	2
Grade 3	1
Grade 4	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Germany	1
Hong Kong	1
Iowa	1
South Korea (Seoul)	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Progress in International…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Explaining Performance Decline over the Course of Taking Comprehensive Proficiency Tests: The Roles of Effort and Omission Propensity

Peer reviewed

Direct link

Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024

In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…

Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries

Response Time Based Nonparametric Kullback-Leibler Divergence Measure for Detecting Aberrant Test-Taking Behavior

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffery R.; Ouyang, Yunbo; Thomas, Sarah L. – International Journal of Testing, 2018

Many important high-stakes decisions--college admission, academic performance evaluation, and even job promotion--depend on accurate and reliable scores from valid large-scale assessments. However, examinees sometimes cheat by copying answers from other test-takers or practicing with test items ahead of time, which can undermine the effectiveness…

Descriptors: Reaction Time, High Stakes Tests, Test Wiseness, Cheating

Applying Evidence-Centered Design for the Development of Game-Based Assessments in Physics Playground

Peer reviewed

Direct link

Kim, Yoon Jeon; Almond, Russell G.; Shute, Valerie J. – International Journal of Testing, 2016

Game-based assessment (GBA) is a specific use of educational games that employs game activities to elicit evidence for educationally valuable skills and knowledge. While this approach can provide individualized and diagnostic information about students, the design and development of assessment mechanics for a GBA is a nontrivial task. In this…

Descriptors: Design, Evidence Based Practice, Test Construction, Physics

Challenges to the Use of Artificial Neural Networks for Diagnostic Classifications with Student Test Data

Peer reviewed

Direct link

Briggs, Derek C.; Circi, Ruhan – International Journal of Testing, 2017

Artificial Neural Networks (ANNs) have been proposed as a promising approach for the classification of students into different levels of a psychological attribute hierarchy. Unfortunately, because such classifications typically rely upon internally produced item response patterns that have not been externally validated, the instability of ANN…

Descriptors: Artificial Intelligence, Classification, Student Evaluation, Tests

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

The Effect of Propensity Scores on DIF Analysis: Inference on the Potential Cause of DIF

Peer reviewed

Direct link

Lee, HyeSun; Geisinger, Kurt F. – International Journal of Testing, 2014

Differential item functioning (DIF) analysis is important in terms of test fairness. While DIF analyses have mainly been conducted with manifest grouping variables, such as gender or race/ethnicity, it has been recently claimed that not only the grouping variables but also contextual variables pertaining to examinees should be considered in DIF…

Descriptors: Test Bias, Gender Differences, Regression (Statistics), Statistical Analysis

Recursive Partitioning to Identify Potential Causes of Differential Item Functioning in Cross-National Data

Peer reviewed

Direct link

Finch, W. Holmes; Hernández Finch, Maria E.; French, Brian F. – International Journal of Testing, 2016

Differential item functioning (DIF) assessment is key in score validation. When DIF is present scores may not accurately reflect the construct of interest for some groups of examinees, leading to incorrect conclusions from the scores. Given rising immigration, and the increased reliance of educational policymakers on cross-national assessments…

Descriptors: Test Bias, Scores, Native Language, Language Usage

Assessing the Effect of Language Demand in Bundles of Math Word Problems

Peer reviewed

Direct link

Banks, Kathleen; Jeddeeni, Ahmad; Walker, Cindy M. – International Journal of Testing, 2016

Differential bundle functioning (DBF) analyses were conducted to determine whether seventh and eighth grade second language learners (SLLs) had lower probabilities of answering bundles of math word problems correctly that had heavy language demands, when compared to non-SLLs of equal math proficiency. Math word problems on each of four test forms…

Descriptors: Middle School Students, English Language Learners, Second Language Learning, Grade 7

Exploring Crossing Differential Item Functioning by Gender in Mathematics Assessment

Peer reviewed

Direct link

Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Testing, 2015

The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…

Descriptors: Mathematics Tests, Gender Differences, Test Bias, Test Items

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees' Knowledge and Skills in Mathematics: An Operational Implementation of Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Alves, Cecilia; Majeau, Renate Taylor – International Journal of Testing, 2010

The purpose of this study is to apply the attribute hierarchy method in an operational diagnostic mathematics program at Grades 3 and 6 to promote cognitive inferences about students' problem-solving skills. The attribute hierarchy method is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute…

Descriptors: Test Items, Student Reaction, Diagnostic Tests, Psychometrics

Gender Differences and Similarities in PISA 2003 Mathematics: A Comparison between the United States and Hong Kong

Peer reviewed

Direct link

Liu, Ou Lydia; Wilson, Mark – International Journal of Testing, 2009

Differential gender performance in standardized mathematics assessment has long been a heated topic. Gender gaps of varied magnitude have been identified on large-scale assessments in the United States. To continue the investigation, this study examined male and female performance on the Programme for International Student Assessment (PISA) 2003…

Descriptors: Foreign Countries, Probability, Gender Differences, Standardized Tests

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

A Review of "Integrity[TM]"

Peer reviewed

Direct link

Veldkamp, Bernard P. – International Journal of Testing, 2008

Integrity[TM], an online application for testing both the statistical integrity of the test and the academic integrity of the examinees, was evaluated for this review. Program features and the program output are described. An overview of the statistics in Integrity[TM] is provided, and the application is illustrated with a small simulation study.…

Descriptors: Simulation, Integrity, Statistics, Computer Assisted Testing

Probability	14
Test Items	7
Scores	6
Foreign Countries	5
Test Bias	5
Mathematics Tests	4
Simulation	4
Statistical Analysis	4
Cheating	3
Comparative Analysis	3
Computation	3
Evaluation Methods	3
Gender Differences	3
Item Response Theory	3
Models	3
Multiple Choice Tests	3
Regression (Statistics)	3
Student Evaluation	3
Bayesian Statistics	2
Classification	2
Educational Assessment	2
High School Students	2
High Stakes Tests	2
Language Usage	2
Measurement	2
More ▼

Almond, Russell G.	1
Alves, Cecilia	1
Banks, Kathleen	1
Briggs, Derek C.	1
Camilla Rjosk	1
Circi, Ruhan	1
Emons, Wilco H. M.	1
Finch, W. Holmes	1
French, Brian F.	1
Geisinger, Kurt F.	1
Gierl, Mark J.	1
Harring, Jeffery R.	1
Hernández Finch, Maria E.	1
Jeddeeni, Ahmad	1
Karoline A. Sachse	1
Kim, Yoon Jeon	1
Kruyen, Peter M.	1
Lamprianou, Iasonas	1
Lee, HyeSun	1
Liu, Ou Lydia	1
Maeda, Hotaka	1
Majeau, Renate Taylor	1
Man, Kaiwen	1
Mapuranga, Raymond	1
Nicole Mahler	1
More ▼