ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7

Source

International Journal of…

Publication Type

Journal Articles	7
Reports - Research	7

Education Level

Elementary Education	2
Grade 4	1
Grade 7	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Germany	1
Iowa	1

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Explaining Performance Decline over the Course of Taking Comprehensive Proficiency Tests: The Roles of Effort and Omission Propensity

Peer reviewed

Direct link

Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024

In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…

Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries

Response Time Based Nonparametric Kullback-Leibler Divergence Measure for Detecting Aberrant Test-Taking Behavior

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffery R.; Ouyang, Yunbo; Thomas, Sarah L. – International Journal of Testing, 2018

Many important high-stakes decisions--college admission, academic performance evaluation, and even job promotion--depend on accurate and reliable scores from valid large-scale assessments. However, examinees sometimes cheat by copying answers from other test-takers or practicing with test items ahead of time, which can undermine the effectiveness…

Descriptors: Reaction Time, High Stakes Tests, Test Wiseness, Cheating

Applying Evidence-Centered Design for the Development of Game-Based Assessments in Physics Playground

Peer reviewed

Direct link

Kim, Yoon Jeon; Almond, Russell G.; Shute, Valerie J. – International Journal of Testing, 2016

Game-based assessment (GBA) is a specific use of educational games that employs game activities to elicit evidence for educationally valuable skills and knowledge. While this approach can provide individualized and diagnostic information about students, the design and development of assessment mechanics for a GBA is a nontrivial task. In this…

Descriptors: Design, Evidence Based Practice, Test Construction, Physics

Challenges to the Use of Artificial Neural Networks for Diagnostic Classifications with Student Test Data

Peer reviewed

Direct link

Briggs, Derek C.; Circi, Ruhan – International Journal of Testing, 2017

Artificial Neural Networks (ANNs) have been proposed as a promising approach for the classification of students into different levels of a psychological attribute hierarchy. Unfortunately, because such classifications typically rely upon internally produced item response patterns that have not been externally validated, the instability of ANN…

Descriptors: Artificial Intelligence, Classification, Student Evaluation, Tests

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

Recursive Partitioning to Identify Potential Causes of Differential Item Functioning in Cross-National Data

Peer reviewed

Direct link

Finch, W. Holmes; Hernández Finch, Maria E.; French, Brian F. – International Journal of Testing, 2016

Differential item functioning (DIF) assessment is key in score validation. When DIF is present scores may not accurately reflect the construct of interest for some groups of examinees, leading to incorrect conclusions from the scores. Given rising immigration, and the increased reliance of educational policymakers on cross-national assessments…

Descriptors: Test Bias, Scores, Native Language, Language Usage

Assessing the Effect of Language Demand in Bundles of Math Word Problems

Peer reviewed

Direct link

Banks, Kathleen; Jeddeeni, Ahmad; Walker, Cindy M. – International Journal of Testing, 2016

Differential bundle functioning (DBF) analyses were conducted to determine whether seventh and eighth grade second language learners (SLLs) had lower probabilities of answering bundles of math word problems correctly that had heavy language demands, when compared to non-SLLs of equal math proficiency. Math word problems on each of four test forms…

Descriptors: Middle School Students, English Language Learners, Second Language Learning, Grade 7

Probability	7
Scores	3
Bayesian Statistics	2
Cheating	2
Classification	2
Foreign Countries	2
Item Response Theory	2
Language Usage	2
Mathematics Tests	2
Monte Carlo Methods	2
Multiple Choice Tests	2
Physics	2
Science Tests	2
Statistical Analysis	2
Test Bias	2
Test Items	2
Test Wiseness	2
Ability	1
Accuracy	1
Achievement Tests	1
Artificial Intelligence	1
Comparative Analysis	1
Computation	1
Cross Cultural Studies	1
Design	1
More ▼

Almond, Russell G.	1
Banks, Kathleen	1
Briggs, Derek C.	1
Camilla Rjosk	1
Circi, Ruhan	1
Finch, W. Holmes	1
French, Brian F.	1
Harring, Jeffery R.	1
Hernández Finch, Maria E.	1
Jeddeeni, Ahmad	1
Karoline A. Sachse	1
Kim, Yoon Jeon	1
Maeda, Hotaka	1
Man, Kaiwen	1
Nicole Mahler	1
Ouyang, Yunbo	1
Sebastian Weirich	1
Shute, Valerie J.	1
Thomas, Sarah L.	1
Walker, Cindy M.	1
Zhang, Bo	1
More ▼