ERIC - Search Results

Publication Date

In 2025	4
Since 2024	10

Descriptor

Psychometrics	10
Simulation	10
Item Response Theory	6
Computation	4
Models	4
Accuracy	3
Algorithms	3
Error of Measurement	3
Item Analysis	3
Measurement	3
Test Items	3
Data Analysis	2
Evaluation Methods	2
Factor Analysis	2
Foreign Countries	2
Matrices	2
Measurement Techniques	2
Reliability	2
Sample Size	2
Sampling	2
Achievement Tests	1
Construct Validity	1
Correlation	1
Cost Effectiveness	1
Creativity	1
More ▼

Source

Grantee Submission	3
Creativity Research Journal	1
Educational Process:…	1
Educational and Psychological…	1
Measurement:…	1
Psychology Learning and…	1
Structural Equation Modeling:…	1
Studies in Second Language…	1

Publication Type

Reports - Research	10
Journal Articles	7

Education Level

Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Australia	1
Germany	1
North America	1
Sweden	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Big Five Inventory	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix

Peer reviewed

Direct link

Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025

Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…

Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling

Estimating Reliability for Response-Time Difference Measures: Toward a Standardized, Model-Based Approach

Peer reviewed

Direct link

Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024

A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…

Descriptors: Reliability, Reaction Time, Psychometrics, Criticism

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

The Psychometric Quality of Objective Structured Clinical Examinations within Psychology Programs: A Systematic Review

Peer reviewed

Direct link

Azaan Vhora; Ryan L. Davies; Kylie Rice – Psychology Learning and Teaching, 2024

Background: Objective Structured Clinical Examinations (OSCEs) are a simulation-based assessment tool used extensively in medical education for evaluating clinical competence. OSCEs are widely regarded as more valid, reliable, and valuable compared to traditional assessment measures, and are now emerging within professional psychology training…

Descriptors: Psychology, Higher Education, Psychometrics, Objective Tests

Exploring the Effects of Collapsing Rating Scale Categories in Polytomous Item Response Theory Analyses: An Illustration and Simulation Study

Peer reviewed

Direct link

Chia-Lin Tsai; Stefanie Wind; Samantha Estrada – Measurement: Interdisciplinary Research and Perspectives, 2025

Researchers who work with ordinal rating scales sometimes encounter situations where the scale categories do not function in the intended or expected way. For example, participants' use of scale categories may result in an empirical difficulty ordering for the categories that does not match what was intended. Likewise, the level of distinction…

Descriptors: Rating Scales, Item Response Theory, Psychometrics, Self Efficacy

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

Does Acquiescence Disagree with Measurement Invariance Testing?

Peer reviewed

Direct link

E. Damiano D'Urso; Jesper Tijmstra; Jeroen K. Vermunt; Kim De Roover – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance (MI) is required for validly comparing latent constructs measured by multiple ordinal self-report items. Non-invariances may occur when disregarding (group differences in) an acquiescence response style (ARS; an agreeing tendency regardless of item content). If non-invariance results solely from neglecting ARS, one should…

Descriptors: Error of Measurement, Structural Equation Models, Construct Validity, Measurement Techniques

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Chun Wang	3
Gongjun Xu	3
Aiman Mohammad Freihat	1
Azaan Vhora	1
Benjamin Goecke	1
Boris Forthmann	1
Bronson Hui	1
Chenchen Ma	1
Chengyu Cui	1
Chia-Lin Tsai	1
E. Damiano D'Urso	1
Jeroen K. Vermunt	1
Jesper Tijmstra	1
Jing Ouyang	1
Kim De Roover	1
Kylie Rice	1
Omar Saleh Bani Yassin	1
Roger E. Beaty	1
Ryan L. Davies	1
Samantha Estrada	1
Stefanie Wind	1
Weicong Lyu	1
Xinchang Zhou	1
Yan Xia	1
Zhiyi Wu	1
More ▼