ERIC - Search Results

Publication Date

In 2025	3
Since 2024	13
Since 2021 (last 5 years)	38
Since 2016 (last 10 years)	75
Since 2006 (last 20 years)	125

Descriptor

Item Analysis	177
Simulation	177
Test Items	92
Item Response Theory	68
Comparative Analysis	42
Computer Assisted Testing	39
Models	37
Evaluation Methods	36
Statistical Analysis	35
Error of Measurement	34
Sample Size	32
Correlation	27
Adaptive Testing	26
Goodness of Fit	26
Difficulty Level	23
Test Construction	21
Mathematical Models	19
Test Bias	19
Bayesian Statistics	18
Scores	18
Latent Trait Theory	17
Scoring	17
Factor Analysis	16
Foreign Countries	16
Test Reliability	16
More ▼

Publication Type

Journal Articles	135
Reports - Research	131
Reports - Evaluative	20
Reports - Descriptive	12
Speeches/Meeting Papers	10
Dissertations/Theses -…	6
Tests/Questionnaires	3
Information Analyses	2
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - General	1
More ▼

Education Level

Secondary Education	9
Higher Education	6
Elementary Secondary Education	5
Elementary Education	3
Postsecondary Education	3
High Schools	2
Adult Education	1
Grade 12	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers	3
Practitioners	2

Location

Canada	2
Canada (Montreal)	1
Florida	1
Israel	1
Japan	1
Minnesota	1
South Korea	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	4
National Assessment of…	2
Test of English as a Foreign…	2
Big Five Inventory	1
California Achievement Tests	1
Comprehensive Tests of Basic…	1
Florida Comprehensive…	1
National Longitudinal Study…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Teaching and Learning…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 177 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Estimating Reliability for Response-Time Difference Measures: Toward a Standardized, Model-Based Approach

Peer reviewed

Direct link

Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024

A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…

Descriptors: Reliability, Reaction Time, Psychometrics, Criticism

Conceptualizing Correlated Residuals as Item-Level Method Effects in Confirmatory Factor Analysis

Peer reviewed

Direct link

Karl Schweizer; Andreas Gold; Dorothea Krampen; Stefan Troche – Educational and Psychological Measurement, 2024

Conceptualizing two-variable disturbances preventing good model fit in confirmatory factor analysis as item-level method effects instead of correlated residuals avoids violating the principle that residual variation is unique for each item. The possibility of representing such a disturbance by a method factor of a bifactor measurement model was…

Descriptors: Correlation, Factor Analysis, Measurement Techniques, Item Analysis

Why Forced-Choice and Likert Items Provide the Same Information on Personality, including Social Desirability

Peer reviewed

Direct link

Martin Bäckström; Fredrik Björklund – Educational and Psychological Measurement, 2024

The forced-choice response format is often considered superior to the standard Likert-type format for controlling social desirability in personality inventories. We performed simulations and found that the trait information based on the two formats converges when the number of items is high and forced-choice items are mixed with regard to…

Descriptors: Likert Scales, Item Analysis, Personality Traits, Personality Measures

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Wald X[superscript 2] Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data

Peer reviewed

Direct link

Sijia Huang; Dubravka Svetina Valdivia – Educational and Psychological Measurement, 2024

Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald X[superscript 2] test-based…

Descriptors: Item Analysis, Item Response Theory, Algorithms, Accuracy

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Rotation Local Solutions in Multidimensional Item Response Theory Models

Peer reviewed

Direct link

Hoang V. Nguyen; Niels G. Waller – Educational and Psychological Measurement, 2024

We conducted an extensive Monte Carlo study of factor-rotation local solutions (LS) in multidimensional, two-parameter logistic (M2PL) item response models. In this study, we simulated more than 19,200 data sets that were drawn from 96 model conditions and performed more than 7.6 million rotations to examine the influence of (a) slope parameter…

Descriptors: Monte Carlo Methods, Item Response Theory, Correlation, Error of Measurement

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

Small-Variance Priors in Bayesian Factor Analysis with Ordinal Data

Peer reviewed

Direct link

Liang, Xinya; Cao, Chunhua – Journal of Experimental Education, 2023

To evaluate multidimensional factor structure, a popular method that combines features of confirmatory and exploratory factor analysis is Bayesian structural equation modeling with small-variance normal priors (BSEM-N). This simulation study evaluated BSEM-N as a variable selection and parameter estimation tool in factor analysis with sparse…

Descriptors: Factor Analysis, Bayesian Statistics, Structural Equation Models, Simulation

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Identifying Problematic Item Characteristics with Small Samples Using Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2022

Researchers frequently use Mokken scale analysis (MSA), which is a nonparametric approach to item response theory, when they have relatively small samples of examinees. Researchers have provided some guidance regarding the minimum sample size for applications of MSA under various conditions. However, these studies have not focused on item-level…

Descriptors: Nonparametric Statistics, Item Response Theory, Sample Size, Test Items

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Educational and Psychological…	23
Journal of Educational…	22
Applied Psychological…	12
Journal of Educational and…	10
ETS Research Report Series	7
International Journal of…	6
Psychometrika	6
Multivariate Behavioral…	5
ProQuest LLC	5
Applied Measurement in…	4
Structural Equation Modeling:…	4
Measurement:…	3
Educational Sciences: Theory…	2
Grantee Submission	2
IEEE Transactions on Learning…	2
International Journal of…	2
Journal of Educational Data…	2
Journal of Educational…	2
Journal of Experimental…	2
Journal of Experimental…	2
Large-scale Assessments in…	2
Practical Assessment,…	2
Advances in Health Sciences…	1
Athletic Training Education…	1
Autism: The International…	1
More ▼

Wang, Wen-Chung	5
Weiss, David J.	5
Reckase, Mark D.	4
Rutkowski, Leslie	3
Wilson, Mark	3
Wind, Stefanie A.	3
Chang, Hua-Hua	2
Cho, Sun-Joo	2
Chun Wang	2
Dinero, Thomas E.	2
Guo, Hongwen	2
Haertel, Edward	2
Huang, Hung-Yu	2
Ishii, Takatoshi	2
Liaw, Yuan-Ling	2
Paek, Insu	2
Pelánek, Radek	2
Pine, Steven M.	2
Roussos, Louis A.	2
Rutkowski, David	2
Sinharay, Sandip	2
Svetina, Dubravka	2
Ueno, Maomi	2
Wainer, Howard	2
More ▼