ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	16
Since 2007 (last 20 years)	24

Descriptor

Item Response Theory	38
Test Items	38
Testing Problems	38
Test Construction	11
Difficulty Level	8
Foreign Countries	8
Simulation	8
Achievement Tests	7
Computer Assisted Testing	7
Estimation (Mathematics)	7
Error of Measurement	6
International Assessment	6
Adaptive Testing	5
Models	5
Multiple Choice Tests	5
Psychometrics	5
Scores	5
Secondary School Students	5
Comparative Analysis	4
Educational Assessment	4
Equated Scores	4
Item Bias	4
Mathematical Models	4
Responses	4
Bayesian Statistics	3
More ▼

Publication Type

Journal Articles	25
Reports - Research	22
Reports - Evaluative	12
Speeches/Meeting Papers	10
Numerical/Quantitative Data	2
Reports - Descriptive	2
Dissertations/Theses -…	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Secondary Education	5
Higher Education	4
Postsecondary Education	3
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Location

Kentucky	1
Latin America	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Graduate Management Admission…	1
National Assessment of…	1
Progress in International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

Simultaneously Modeling Differential Testlet Functioning and Differential Item Functioning: Addressing Variance Heterogeneity with a Multigroup One-Parameter Testlet Model

Peer reviewed

Direct link

Luo, Yong; Liang, Xinya – Measurement: Interdisciplinary Research and Perspectives, 2019

Current methods that simultaneously model differential testlet functioning (DTLF) and differential item functioning (DIF) constrain the variances of latent ability and testlet effects to be equal between the focal and the reference groups. Such a constraint can be stringent and unrealistic with real data. In this study, we propose a multigroup…

Descriptors: Test Items, Item Response Theory, Test Bias, Models

A Review of Subscore Estimation Methods. ETS RR-18-17

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Qu, Yanxuan – ETS Research Report Series, 2018

Various subscore estimation methods that use auxiliary information to improve subscore accuracy and stability have been developed. This report provides a review of various subscore estimation methods described in the literature. The methodology of each method is described, then research studies on these subscore estimation methods are summarized.…

Descriptors: Scores, Evaluation Methods, Item Response Theory, Test Items

A Response Time Process Model for Not-Reached and Omitted Items

Peer reviewed

Direct link

Lu, Jing; Wang, Chun – Journal of Educational Measurement, 2020

Item nonresponses are prevalent in standardized testing. They happen either when students fail to reach the end of a test due to a time limit or quitting, or when students choose to omit some items strategically. Oftentimes, item nonresponses are nonrandom, and hence, the missing data mechanism needs to be properly modeled. In this paper, we…

Descriptors: Item Response Theory, Test Items, Standardized Tests, Responses

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Detection of Item Preknowledge Using Likelihood Ratio Test and Score Test

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2017

An increasing concern of producers of educational assessments is fraudulent behavior during the assessment (van der Linden, 2009). Benefiting from item preknowledge (e.g., Eckerly, 2017; McLeod, Lewis, & Thissen, 2003) is one type of fraudulent behavior. This article suggests two new test statistics for detecting individuals who may have…

Descriptors: Test Items, Cheating, Testing Problems, Identification

The Retrofit of an English Language Placement Test Used for Large-Scale Assessments in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Mendoza, Arturo; Martínez, Joaquín – International Journal of Language Testing, 2023

Language placement tests (LPTs) are used to assess students' proficiency in a progressive manner in the target language. Based on their performance, students are assigned to stepped language courses. These tests are usually considered low stakes because they do not have significant consequences in students' lives, which is perhaps the reason why…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction

Spoilt for Choice? Issues around the Use and Comparability of Optional Exam Questions

Peer reviewed

Direct link

Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019

For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…

Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3

Journal of Educational…	5
Educational and Psychological…	3
Journal of Educational and…	3
ETS Research Report Series	2
AERA Online Paper Repository	1
Applied Measurement in…	1
Assessment in Education:…	1
Educational Assessment,…	1
Electronic Journal of Science…	1
International Journal for the…	1
International Journal of…	1
International Journal of…	1
Journal of Experimental…	1
Large-scale Assessments in…	1
Measurement:…	1
Online Submission	1
Participatory Educational…	1
ProQuest LLC	1
Psychometrika	1
More ▼

Debeer, Dries	2
Hambleton, Ronald K.	2
Janssen, Rianne	2
Robitzsch, Alexander	2
von Davier, Matthias	2
Allison Ames	1
Andrés Christiansen	1
Baghi, Heibatollah	1
Bezirhan, Ummugul	1
Bramley, Tom	1
Brandon Crawford	1
Camenares, Devin	1
Chen, Haiwen H.	1
Chen, Yunxiao	1
Cohen, Allan S.	1
Cole, Ki Lynn	1
Crisp, Victoria	1
Cui, Ying	1
Dancer, L. Suzanne	1
De Ayala, R. J.	1
De Boeck, Paul	1
Dorans, Neil J.	1
Ferrara, Steven F.	1
Fu, Jianbin	1
More ▼