ERIC - Search Results

Publication Date

In 2025	5
Since 2024	12
Since 2021 (last 5 years)	40
Since 2016 (last 10 years)	65
Since 2006 (last 20 years)	113

Descriptor

Item Response Theory	151
Test Format	151
Test Items	151
Difficulty Level	41
Foreign Countries	40
Test Construction	39
Comparative Analysis	35
Multiple Choice Tests	35
Scores	31
Models	27
Item Analysis	26
Computer Assisted Testing	24
Equated Scores	24
Mathematics Tests	23
Science Tests	18
Simulation	18
Statistical Analysis	18
Psychometrics	16
Achievement Tests	14
Sample Size	14
Test Bias	13
Test Reliability	13
Test Validity	13
Accuracy	12
Language Tests	12
More ▼

Publication Type

Reports - Research	105
Journal Articles	103
Reports - Evaluative	27
Speeches/Meeting Papers	21
Dissertations/Theses -…	11
Reports - Descriptive	6
Numerical/Quantitative Data	5
Tests/Questionnaires	3
Collected Works - General	1
Guides - General	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	27
Secondary Education	25
Postsecondary Education	23
Elementary Education	15
Grade 8	11
Middle Schools	11
Elementary Secondary Education	10
Junior High Schools	10
High Schools	8
Grade 4	7
Intermediate Grades	6
Grade 3	4
Grade 6	4
Grade 5	3
Grade 7	3
Early Childhood Education	2
Grade 12	2
Grade 9	2
Primary Education	2
Grade 1	1
Grade 2	1
More ▼

Audience

Location

Turkey	7
Germany	5
Australia	4
Canada	3
Iran	2
Japan	2
Malaysia	2
United States	2
Belgium	1
China	1
Florida	1
France	1
Hong Kong	1
Illinois	1
Indonesia	1
Iowa	1
Italy	1
Mexico	1
Netherlands	1
Nigeria	1
Oman	1
Oregon	1
Philippines	1
Saudi Arabia	1
Singapore	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 151 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Application of Two-Parameter Item Response Theory for Determining Form-Dependent Items on Exams Using Different Item Orders

Peer reviewed
PDF on ERIC

Download full text

Pentecost, Thomas C.; Raker, Jeffery R.; Murphy, Kristen L. – Practical Assessment, Research & Evaluation, 2023

Using multiple versions of an assessment has the potential to introduce item environment effects. These types of effects result in version dependent item characteristics (i.e., difficulty and discrimination). Methods to detect such effects and resulting implications are important for all levels of assessment where multiple forms of an assessment…

Descriptors: Item Response Theory, Test Items, Test Format, Science Tests

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

The Effect of Question Positioning on Data Quality in Web Surveys

Peer reviewed

Direct link

Cornelia Eva Neuert – Sociological Methods & Research, 2024

The quality of data in surveys is affected by response burden and questionnaire length. With an increasing number of questions, respondents can become bored, tired, and annoyed and may take shortcuts to reduce the effort needed to complete the survey. In this article, direct evidence is presented on how the position of items within a web…

Descriptors: Online Surveys, Test Items, Test Format, Test Construction

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

A Comparison of Three Designs for List-Style Open-Ended Questions in Web Surveys

Peer reviewed

Direct link

Kunz, Tanja; Meitinger, Katharina – Field Methods, 2022

Although list-style open-ended questions generally help us gain deeper insights into respondents' thoughts, opinions, and behaviors, the quality of responses is often compromised. We tested a dynamic and a follow-up design to motivate respondents to give higher quality responses than with a static design, but without overburdening them. Our…

Descriptors: Online Surveys, Item Response Theory, Test Items, Test Format

Modeling Slipping Effects in a Large-Scale Assessment with Innovative Item Formats

Peer reviewed

Direct link

Cuhadar, Ismail; Binici, Salih – Educational Measurement: Issues and Practice, 2022

This study employs the 4-parameter logistic item response theory model to account for the unexpected incorrect responses or slipping effects observed in a large-scale Algebra 1 End-of-Course assessment, including several innovative item formats. It investigates whether modeling the misfit at the upper asymptote has any practical impact on the…

Descriptors: Item Response Theory, Measurement, Student Evaluation, Algebra

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

A Comparison of IRT Linking Approaches under the Nonequivalent Groups Anchor Test Design

Direct link

Jiajing Huang – ProQuest LLC, 2022

The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…

Descriptors: Item Response Theory, Test Format, Test Items, Test Construction

The Impact of Local Item Dependence on Computer Adaptive Testing Given between and within Testlet Adaptivity

Direct link

Ozge Ersan Cinar – ProQuest LLC, 2022

In educational tests, a group of questions related to a shared stimulus is called a testlet (e.g., a reading passage with multiple related questions). Use of testlets is very common in educational tests. Additionally, computerized adaptive testing (CAT) is a mode of testing where the test forms are created in real time tailoring to the test…

Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Educational Testing

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

Can High-Dimensional Questionnaires Resolve the Ipsativity Issue of Forced-Choice Response Formats?

Peer reviewed

Direct link

Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021

Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…

Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Educational and Psychological…	14
ProQuest LLC	11
Applied Measurement in…	9
ETS Research Report Series	7
Journal of Educational…	7
Language Testing	7
Applied Psychological…	6
Practical Assessment,…	6
International Journal of…	5
Grantee Submission	4
Educational Measurement:…	3
International Journal of…	3
College Board	2
International Journal of…	2
Journal of Educational and…	2
Measurement:…	2
ACT, Inc.	1
Advances in Health Sciences…	1
Behavioral Research and…	1
Discover Education	1
Educational Assessment	1
Educational Psychology	1
Field Methods	1
Intelligence	1
International Association for…	1
More ▼

Lee, Won-Chan	3
Nicewander, W. Alan	3
Sykes, Robert C.	3
Wang, Lin	3
Wang, Wen-Chung	3
Ackerman, Terry	2
Baghaei, Purya	2
Bulut, Okan	2
DeBoer, George E.	2
Frey, Andreas	2
Han, Kyung T.	2
Hardcastle, Joseph	2
Herrmann-Abell, Cari F.	2
Hohensinn, Christine	2
Jianbin Fu	2
Kubinger, Klaus D.	2
Lunz, Mary E.	2
Mellenbergh, Gideon J.	2
Patrick C. Kyllonen	2
Pommerich, Mary	2
Schaeffer, Gary A.	2
Sinharay, Sandip	2
Sireci, Stephen G.	2
Wainer, Howard	2
More ▼

Program for International…	6
Advanced Placement…	3
Graduate Record Examinations	3
Trends in International…	3
ACT Assessment	2
Peabody Picture Vocabulary…	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Armed Services Vocational…	1
College Level Examination…	1
Defining Issues Test	1
Gates MacGinitie Reading Tests	1
International English…	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
National Assessment Program…	1
National Assessment of…	1
Peabody Individual…	1
Raven Progressive Matrices	1
Remote Associates Test	1
Wechsler Individual…	1
More ▼