ERIC - Search Results

Publication Date

In 2025	3
Since 2024	26
Since 2021 (last 5 years)	85
Since 2016 (last 10 years)	178
Since 2006 (last 20 years)	362

Descriptor

Item Response Theory	390
Test Items	181
Models	133
Statistical Analysis	94
Simulation	93
Error of Measurement	89
Scores	86
Comparative Analysis	84
Computation	84
Psychometrics	76
Correlation	70
Test Bias	69
Item Analysis	63
Test Theory	63
Test Reliability	62
Foreign Countries	60
Sample Size	58
Factor Analysis	55
Goodness of Fit	54
Monte Carlo Methods	52
Accuracy	50
Bayesian Statistics	50
Difficulty Level	47
Measurement Techniques	46
Evaluation Methods	45
More ▼

Source

Educational and Psychological…

564

Publication Type

Journal Articles	540
Reports - Research	387
Reports - Evaluative	118
Reports - Descriptive	34
Speeches/Meeting Papers	9
Opinion Papers	4
Guides - Non-Classroom	3
Book/Product Reviews	1
Historical Materials	1
Information Analyses	1

Education Level

Higher Education	30
Postsecondary Education	22
Secondary Education	22
Elementary Education	18
Middle Schools	15
Junior High Schools	13
Grade 4	11
Elementary Secondary Education	10
Grade 7	10
High Schools	9
Intermediate Grades	8
Grade 5	7
Grade 6	7
Grade 8	7
Grade 3	6
Grade 9	6
Early Childhood Education	5
Grade 10	4
Primary Education	3
Grade 2	2
Preschool Education	2
Grade 11	1
Kindergarten	1
More ▼

Audience

Practitioners	2
Researchers	1
Students	1
Teachers	1

Location

Germany	8
Taiwan	7
Australia	6
Canada	6
United Kingdom	5
Georgia	4
United States	4
Florida	3
Netherlands	3
South Korea	3
California	2
China	2
Colombia	2
Hong Kong	2
Illinois (Chicago)	2
India	2
Ireland	2
Japan	2
Pennsylvania	2
Saudi Arabia	2
Spain	2
Belgium	1
California (Los Angeles)	1
Colorado (Denver)	1
Costa Rica	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Educational and Psychological Measurement X

Showing 1 to 15 of 564 results Save | Export

The Accuracy of Bayesian Model Fit Indices in Selecting among Multidimensional Item Response Theory Models

Peer reviewed

Direct link

Ken A. Fujimoto; Carl F. Falk – Educational and Psychological Measurement, 2024

Item response theory (IRT) models are often compared with respect to predictive performance to determine the dimensionality of rating scale data. However, such model comparisons could be biased toward nested-dimensionality IRT models (e.g., the bifactor model) when comparing those models with non-nested-dimensionality IRT models (e.g., a…

Descriptors: Item Response Theory, Rating Scales, Predictive Measurement, Bayesian Statistics

Detecting Rating Scale Malfunctioning with the Partial Credit Model and Generalized Partial Credit Model

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2023

Rating scale analysis techniques provide researchers with practical tools for examining the degree to which ordinal rating scales (e.g., Likert-type scales or performance assessment rating scales) function in psychometrically useful ways. When rating scales function as expected, researchers can interpret ratings in the intended direction (i.e.,…

Descriptors: Rating Scales, Testing Problems, Item Response Theory, Models

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias from Rapid Guessing Behavior

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2024

Rapid guessing (RG) is a form of non-effortful responding that is characterized by short response latencies. This construct-irrelevant behavior has been shown in previous research to bias inferences concerning measurement properties and scores. To mitigate these deleterious effects, a number of response time threshold scoring procedures have been…

Descriptors: Reaction Time, Scores, Item Response Theory, Guessing (Tests)

Artificial Neural Networks for Short-Form Development of Psychometric Tests: A Study on Synthetic Populations Using Autoencoders

Peer reviewed

Direct link

Monica Casella; Pasquale Dolce; Michela Ponticorvo; Nicola Milano; Davide Marocco – Educational and Psychological Measurement, 2024

Short-form development is an important topic in psychometric research, which requires researchers to face methodological choices at different steps. The statistical techniques traditionally used for shortening tests, which belong to the so-called exploratory model, make assumptions not always verified in psychological data. This article proposes a…

Descriptors: Artificial Intelligence, Test Construction, Test Format, Psychometrics

An Explanatory Multidimensional Random Item Effects Rating Scale Model

Peer reviewed

Direct link

Huang, Sijia; Luo, Jinwen; Cai, Li – Educational and Psychological Measurement, 2023

Random item effects item response theory (IRT) models, which treat both person and item effects as random, have received much attention for more than a decade. The random item effects approach has several advantages in many practical settings. The present study introduced an explanatory multidimensional random item effects rating scale model. The…

Descriptors: Rating Scales, Item Response Theory, Models, Test Items

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models

Peer reviewed

Direct link

Finch, W. Holmes – Educational and Psychological Measurement, 2023

Psychometricians have devoted much research and attention to categorical item responses, leading to the development and widespread use of item response theory for the estimation of model parameters and identification of items that do not perform in the same way for examinees from different population subgroups (e.g., differential item functioning…

Descriptors: Test Bias, Item Response Theory, Computation, Methods

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy

Peer reviewed

Direct link

Wu, Tong; Kim, Stella Y.; Westine, Carl – Educational and Psychological Measurement, 2023

For large-scale assessments, data are often collected with missing responses. Despite the wide use of item response theory (IRT) in many testing programs, however, the existing literature offers little insight into the effectiveness of various approaches to handling missing responses in the context of scale linking. Scale linking is commonly used…

Descriptors: Data Analysis, Responses, Statistical Analysis, Measurement

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Wald X[superscript 2] Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data

Peer reviewed

Direct link

Sijia Huang; Dubravka Svetina Valdivia – Educational and Psychological Measurement, 2024

Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald X[superscript 2] test-based…

Descriptors: Item Analysis, Item Response Theory, Algorithms, Accuracy

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

A Monte Carlo Study of Confidence Interval Methods for Generalizability Coefficient

Peer reviewed

Direct link

Jiang, Zhehan; Raymond, Mark; DiStefano, Christine; Shi, Dexin; Liu, Ren; Sun, Junhua – Educational and Psychological Measurement, 2022

Computing confidence intervals around generalizability coefficients has long been a challenging task in generalizability theory. This is a serious practical problem because generalizability coefficients are often computed from designs where some facets have small sample sizes, and researchers have little guide regarding the trustworthiness of the…

Descriptors: Monte Carlo Methods, Intervals, Generalizability Theory, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 38

Wang, Wen-Chung	17
Marcoulides, George A.	12
Dimitrov, Dimiter M.	10
Raykov, Tenko	9
Cai, Li	8
DeMars, Christine E.	8
Ferrando, Pere J.	8
Engelhard, George, Jr.	7
Wind, Stefanie A.	7
Andrich, David	6
Strobl, Carolin	6
Zumbo, Bruno D.	6
Dodd, Barbara G.	5
Finch, W. Holmes	5
Huang, Hung-Yu	5
Huggins-Manley, Anne Corinne	5
Jiang, Zhehan	5
Liu, Ren	5
Ludlow, Larry H.	5
Powers, Stephen	5
Rupp, Andre A.	5
Smith, Richard M.	5
Stone, Clement A.	5
Wilson, Mark	5
More ▼

Program for International…	7
SAT (College Admission Test)	5
Trends in International…	5
Law School Admission Test	4
Graduate Record Examinations	3
Eysenck Personality Inventory	2
Georgia Criterion Referenced…	2
Holland Vocational Preference…	2
Iowa Tests of Educational…	2
Learning Style Inventory	2
Raven Progressive Matrices	2
Rotter Internal External…	2
Test of English as a Foreign…	2
Wechsler Adult Intelligence…	2
ACT Assessment	1
Advanced Placement…	1
Attribution Style…	1
Brief Symptom Inventory	1
California Psychological…	1
Center for Epidemiologic…	1
Childrens Depression Inventory	1
Childrens Manifest Anxiety…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Dynamic Indicators of Basic…	1
More ▼