ERIC - Search Results

Publication Date

In 2025	2
Since 2024	13
Since 2021 (last 5 years)	58

Descriptor

Test Items	58
Item Response Theory	37
Item Analysis	19
Models	16
Sample Size	14
Accuracy	13
Goodness of Fit	12
Difficulty Level	11
Simulation	11
Error of Measurement	10
Comparative Analysis	9
Bayesian Statistics	8
Reaction Time	8
Scores	8
Scoring	8
Test Length	8
Classification	7
Correlation	7
Response Style (Tests)	7
Responses	7
Test Bias	7
Computation	6
Factor Analysis	6
Foreign Countries	6
Statistical Analysis	6
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	58
Reports - Research	54
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Elementary Secondary Education	3
Higher Education	3
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Grade 2	1
Grade 3	1
Primary Education	1
Secondary Education	1

Audience

Location

Germany	2
Chile	1
Florida	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
NEO Personality Inventory	1

What Works Clearinghouse Rating

Showing 1 to 15 of 58 results Save | Export

Functional Approaches for Modeling Unfolding Data

Peer reviewed

Direct link

Engelhard, George – Educational and Psychological Measurement, 2023

The purpose of this study is to introduce a functional approach for modeling unfolding response data. Functional data analysis (FDA) has been used for examining cumulative item response data, but a functional approach has not been systematically used with unfolding response processes. A brief overview of FDA is presented and illustrated within the…

Descriptors: Data Analysis, Models, Responses, Test Items

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

An Explanatory Multidimensional Random Item Effects Rating Scale Model

Peer reviewed

Direct link

Huang, Sijia; Luo, Jinwen; Cai, Li – Educational and Psychological Measurement, 2023

Random item effects item response theory (IRT) models, which treat both person and item effects as random, have received much attention for more than a decade. The random item effects approach has several advantages in many practical settings. The present study introduced an explanatory multidimensional random item effects rating scale model. The…

Descriptors: Rating Scales, Item Response Theory, Models, Test Items

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Reevaluating the SIBTEST Classification Heuristics for Dichotomous Differential Item Functioning

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Ames, Allison; Crawford, Brandon; Liang, Xinya – Educational and Psychological Measurement, 2022

A simulation study was conducted to investigate the heuristics of the SIBTEST procedure and how it compares with ETS classification guidelines used with the Mantel-Haenszel procedure. Prior heuristics have been used for nearly 25 years, but they are based on a simulation study that was restricted due to computer limitations and that modeled item…

Descriptors: Test Bias, Heuristics, Classification, Statistical Analysis

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

The NEAT Equating via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Peer reviewed

Direct link

Jiang, Zhehan; Han, Yuting; Xu, Lingling; Shi, Dexin; Liu, Ren; Ouyang, Jinying; Cai, Fen – Educational and Psychological Measurement, 2023

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven…

Descriptors: Test Items, Equated Scores, Sample Size, Artificial Intelligence

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models

Peer reviewed

Direct link

Finch, W. Holmes – Educational and Psychological Measurement, 2023

Psychometricians have devoted much research and attention to categorical item responses, leading to the development and widespread use of item response theory for the estimation of model parameters and identification of items that do not perform in the same way for examinees from different population subgroups (e.g., differential item functioning…

Descriptors: Test Bias, Item Response Theory, Computation, Methods

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy

Peer reviewed

Direct link

Wu, Tong; Kim, Stella Y.; Westine, Carl – Educational and Psychological Measurement, 2023

For large-scale assessments, data are often collected with missing responses. Despite the wide use of item response theory (IRT) in many testing programs, however, the existing literature offers little insight into the effectiveness of various approaches to handling missing responses in the context of scale linking. Scale linking is commonly used…

Descriptors: Data Analysis, Responses, Statistical Analysis, Measurement

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Bolt, Daniel M.	2
Debelak, Rudolf	2
Harring, Jeffrey R.	2
Kim, Doyoung	2
Kim, Stella Y.	2
Man, Kaiwen	2
Strobl, Carolin	2
Wang, Chun	2
Weiss, David J.	2
von Davier, Matthias	2
Abad, Francisco J.	1
Ahn, Soyeon	1
Al-Harbi, Khaleel	1
Allan S. Cohen	1
Ames, Allison	1
Ames, Allison J.	1
Atanasov, Dimitar V.	1
Betts, Joe	1
Bezirhan, Ummugul	1
Brennan, Robert L.	1
Bulut, Okan	1
Bürkner, Paul-Christian	1
Cagnone, Silvia	1
Cai, Fen	1
Cai, Li	1
More ▼