ERIC - Search Results

Publication Date

In 2025	4
Since 2024	10
Since 2021 (last 5 years)	18
Since 2016 (last 10 years)	44

Descriptor

Psychometrics	44
Simulation	44
Item Response Theory	21
Test Items	13
Models	12
Error of Measurement	11
Evaluation Methods	11
Accuracy	9
Scores	9
Item Analysis	8
Computation	7
Goodness of Fit	7
Test Reliability	7
Difficulty Level	6
Factor Analysis	6
Test Validity	6
Correlation	5
Data Analysis	5
Measurement Techniques	5
Reliability	5
Sample Size	5
Comparative Analysis	4
Computer Assisted Testing	4
Foreign Countries	4
Measurement	4
More ▼

Publication Type

Journal Articles	39
Reports - Research	38
Reports - Evaluative	3
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	3
Elementary Education	2
High Schools	2
Elementary Secondary Education	1
Grade 12	1
Grade 4	1
Intermediate Grades	1
Two Year Colleges	1

Audience

Location

Australia	1
China	1
Florida	1
Germany	1
North America	1
Saudi Arabia	1
Sweden	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Behavioral Risk Factor…	1
Big Five Inventory	1
Cognitive Abilities Test	1
Florida Comprehensive…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 44 results Save | Export

Integration of Historical Data for the Analysis of Multiple Assessment Studies

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2023

Integrative data analyses have recently been shown to be an effective tool for researchers interested in synthesizing datasets from multiple studies in order to draw statistical or substantive conclusions. The actual process of integrating the different datasets depends on the availability of some common measures or items reflecting the same…

Descriptors: Data Analysis, Synthesis, Test Items, Simulation

Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix

Peer reviewed

Direct link

Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025

Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…

Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling

Estimating Reliability for Response-Time Difference Measures: Toward a Standardized, Model-Based Approach

Peer reviewed

Direct link

Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024

A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…

Descriptors: Reliability, Reaction Time, Psychometrics, Criticism

The Accuracy of Estimating Parameters of Multiple-Choice Test Items, Following Item-Response Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025

Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…

Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items

The Psychometric Quality of Objective Structured Clinical Examinations within Psychology Programs: A Systematic Review

Peer reviewed

Direct link

Azaan Vhora; Ryan L. Davies; Kylie Rice – Psychology Learning and Teaching, 2024

Background: Objective Structured Clinical Examinations (OSCEs) are a simulation-based assessment tool used extensively in medical education for evaluating clinical competence. OSCEs are widely regarded as more valid, reliable, and valuable compared to traditional assessment measures, and are now emerging within professional psychology training…

Descriptors: Psychology, Higher Education, Psychometrics, Objective Tests

Exploring the Effects of Collapsing Rating Scale Categories in Polytomous Item Response Theory Analyses: An Illustration and Simulation Study

Peer reviewed

Direct link

Chia-Lin Tsai; Stefanie Wind; Samantha Estrada – Measurement: Interdisciplinary Research and Perspectives, 2025

Researchers who work with ordinal rating scales sometimes encounter situations where the scale categories do not function in the intended or expected way. For example, participants' use of scale categories may result in an empirical difficulty ordering for the categories that does not match what was intended. Likewise, the level of distinction…

Descriptors: Rating Scales, Item Response Theory, Psychometrics, Self Efficacy

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

Automated and Interactive Game-Based Assessment of Critical Thinking

Peer reviewed

Direct link

Wang, Dang; Liu, Hongyun; Hau, Kit-Tai – Education and Information Technologies, 2022

Critical thinking is one of the important higher-order skills very much treasured in education, but hard to be measured using paper-pencil tests. In line with recent recommendation to measure high-order thinking skills with interactive tasks (vs. static one set of questions), in this study we developed an interactive and automated game-based…

Descriptors: Game Based Learning, Evaluation Methods, Critical Thinking, Simulation

Does Acquiescence Disagree with Measurement Invariance Testing?

Peer reviewed

Direct link

E. Damiano D'Urso; Jesper Tijmstra; Jeroen K. Vermunt; Kim De Roover – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance (MI) is required for validly comparing latent constructs measured by multiple ordinal self-report items. Non-invariances may occur when disregarding (group differences in) an acquiescence response style (ARS; an agreeing tendency regardless of item content). If non-invariance results solely from neglecting ARS, one should…

Descriptors: Error of Measurement, Structural Equation Models, Construct Validity, Measurement Techniques

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error

Peer reviewed

Direct link

Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…

Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation

Application of Change Point Analysis of Response Time Data to Detect Test Speededness

Peer reviewed

Direct link

Cheng, Ying; Shao, Can – Educational and Psychological Measurement, 2022

Computer-based and web-based testing have become increasingly popular in recent years. Their popularity has dramatically expanded the availability of response time data. Compared to the conventional item response data that are often dichotomous or polytomous, response time has the advantage of being continuous and can be collected in an…

Descriptors: Reaction Time, Test Wiseness, Computer Assisted Testing, Simulation

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2022

Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…

Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length

Psychometric Models for Scoring Multiple Reporter Assessments: Applications to Integrative Data Analysis in Prevention Science and Beyond

Peer reviewed

Direct link

Curran, Patrick J.; Georgeson, A. R.; Bauer, Daniel J.; Hussong, Andrea M. – International Journal of Behavioral Development, 2021

Conducting valid and reliable empirical research in the prevention sciences is an inherently difficult and challenging task. Chief among these is the need to obtain numerical scores of underlying theoretical constructs for use in subsequent analysis. This challenge is further exacerbated by the increasingly common need to consider multiple…

Descriptors: Psychometrics, Scoring, Prevention, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	8
ETS Research Report Series	4
Grantee Submission	4
Measurement:…	4
Educational Measurement:…	2
Journal of Educational…	2
Applied Measurement in…	1
Career and Technical…	1
Creativity Research Journal	1
Education and Information…	1
Educational Process:…	1
Educational Psychology Review	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Creative Behavior	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
ProQuest LLC	1
Psicologica: International…	1
Psychology Learning and…	1
Research Quarterly for…	1
Sociological Methods &…	1
Structural Equation Modeling:…	1
More ▼

Chun Wang	4
Gongjun Xu	3
Guo, Hongwen	2
Madison, Matthew J.	2
Marcoulides, Katerina M.	2
Aiman Mohammad Freihat	1
AlGhamdi, Hannan M.	1
Albano, Anthony D.	1
Andrews, Jessica J.	1
Andrews-Todd, Jessica	1
Azaan Vhora	1
Babcock, Ben	1
Bauer, Daniel J.	1
Benjamin Goecke	1
Bergner, Yoav	1
Bodzin, Alec	1
Boris Forthmann	1
Bradshaw, Laine P.	1
Bronson Hui	1
Browne, Matthew	1
Cai, Liuhan	1
Chamberlain, John	1
Chenchen Ma	1
Cheng, Ying	1
Chengyu Cui	1
More ▼