ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	22

Descriptor

Comparative Analysis	25
Computation	25
Goodness of Fit	25
Item Response Theory	11
Models	10
Statistical Analysis	10
Structural Equation Models	7
Correlation	6
Factor Analysis	6
Simulation	6
Bayesian Statistics	5
Test Items	5
Accuracy	4
Error of Measurement	4
Foreign Countries	4
Maximum Likelihood Statistics	4
Sample Size	4
Scores	4
Hierarchical Linear Modeling	3
Test Bias	3
Achievement Tests	2
Adaptive Testing	2
Change	2
Computer Assisted Testing	2
Efficiency	2
More ▼

Source

Educational and Psychological…	3
Multivariate Behavioral…	3
International Journal of…	2
Journal of Educational and…	2
ProQuest LLC	2
Psychological Methods	2
Structural Equation Modeling:…	2
AERA Online Paper Repository	1
Applied Measurement in…	1
Applied Psychological…	1
International Journal of…	1
Journal of Educational…	1
Practical Assessment,…	1
Psicologica: International…	1
Universal Journal of…	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	18
Reports - Evaluative	3
Dissertations/Theses -…	2
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Junior High Schools	2
Middle Schools	2
Secondary Education	2
Elementary Education	1
Grade 7	1
Grade 9	1
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Canada	1
South Korea	1
Spain	1

Laws, Policies, & Programs

Assessments and Surveys

Progress in International…

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Detecting Differential Item Functioning: Item Response Theory Methods versus the Mantel-Haenszel Procedure

Peer reviewed
PDF on ERIC

Download full text

Diaz, Emily; Brooks, Gordon; Johanson, George – International Journal of Assessment Tools in Education, 2021

This Monte Carlo study assessed Type I error in differential item functioning analyses using Lord's chi-square (LC), Likelihood Ratio Test (LRT), and Mantel-Haenszel (MH) procedure. Two research interests were investigated: item response theory (IRT) model specification in LC and the LRT and continuity correction in the MH procedure. This study…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Comparative Analysis

A Comparison of Frequentist and Bayesian Approaches: The Power to Detect Model Misspecifications in Confirmatory Factor Analytic Models

Peer reviewed
PDF on ERIC

Download full text

Önen, Emine – Universal Journal of Educational Research, 2019

This simulation study was conducted to compare the performances of Frequentist and Bayesian approaches in the context of power to detect model misspecification in terms of omitted cross-loading in CFA models with respect to the several variables (number of omitted cross-loading, magnitude of main loading, number of factors, number of indicators…

Descriptors: Factor Analysis, Bayesian Statistics, Comparative Analysis, Statistical Analysis

The Impact of Prior Specifications on Model Comparison in Bayesian Structural Equation Modeling

Peer reviewed

Direct link

Huang, Jiajing; Liang, Xinya; Yang, Yanyun – AERA Online Paper Repository, 2017

In Bayesian structural equation modeling (BSEM), prior settings may affect model fit, parameter estimation, and model comparison. This simulation study was to investigate how the priors impact evaluation of relative fit across competing models. The design factors for data generation included sample sizes, factor structures, data distributions, and…

Descriptors: Bayesian Statistics, Structural Equation Models, Goodness of Fit, Sample Size

Convergence, Admissibility, and Fit of Alternative Confirmatory Factor Analysis Models for MTMM Data

Peer reviewed

Direct link

Lance, Charles E.; Fan, Yi – Educational and Psychological Measurement, 2016

We compared six different analytic models for multitrait-multimethod (MTMM) data in terms of convergence, admissibility, and model fit to 258 samples of previously reported data. Two well-known models, the correlated trait-correlated method (CTCM) and the correlated trait-correlated uniqueness (CTCU) models, were fit for reference purposes in…

Descriptors: Multitrait Multimethod Techniques, Factor Analysis, Models, Goodness of Fit

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

Peer reviewed
PDF on ERIC

Download full text

Boedeker, Peter – Practical Assessment, Research & Evaluation, 2017

Hierarchical linear modeling (HLM) is a useful tool when analyzing data collected from groups. There are many decisions to be made when constructing and estimating a model in HLM including which estimation technique to use. Three of the estimation techniques available when analyzing data with HLM are maximum likelihood, restricted maximum…

Descriptors: Hierarchical Linear Modeling, Maximum Likelihood Statistics, Bayesian Statistics, Computation

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

The Impact of Model Parameterization and Estimation Methods on Tests of Measurement Invariance with Ordered Polytomous Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Koziol, Natalie A.; Bovaird, James A. – Educational and Psychological Measurement, 2018

Evaluations of measurement invariance provide essential construct validity evidence--a prerequisite for seeking meaning in psychological and educational research and ensuring fair testing procedures in high-stakes settings. However, the quality of such evidence is partly dependent on the validity of the resulting statistical conclusions. Type I or…

Descriptors: Computation, Tests, Error of Measurement, Comparative Analysis

Person Fit Analysis in Computerized Adaptive Testing Using Tests for a Change Point

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016

Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit

Bayesian Estimation of Multi-Unidimensional Graded Response IRT Models

Direct link

Kuo, Tzu-Chun – ProQuest LLC, 2015

Item response theory (IRT) has gained an increasing popularity in large-scale educational and psychological testing situations because of its theoretical advantages over classical test theory. Unidimensional graded response models (GRMs) are useful when polytomous response items are designed to measure a unified latent trait. They are limited in…

Descriptors: Item Response Theory, Bayesian Statistics, Computation, Models

Design-Comparable Effect Sizes in Multiple Baseline Designs: A General Modeling Framework

Peer reviewed

Direct link

Pustejovsky, James E.; Hedges, Larry V.; Shadish, William R. – Journal of Educational and Behavioral Statistics, 2014

In single-case research, the multiple baseline design is a widely used approach for evaluating the effects of interventions on individuals. Multiple baseline designs involve repeated measurement of outcomes over time and the controlled introduction of a treatment at different times for different individuals. This article outlines a general…

Descriptors: Hierarchical Linear Modeling, Effect Size, Maximum Likelihood Statistics, Computation

Toward Increasing Fairness in Score Scale Calibrations Employed in International Large-Scale Assessments

Peer reviewed

Direct link

Oliveri, Maria Elena; von Davier, Matthias – International Journal of Testing, 2014

In this article, we investigate the creation of comparable score scales across countries in international assessments. We examine potential improvements to current score scale calibration procedures used in international large-scale assessments. Our approach seeks to improve fairness in scoring international large-scale assessments, which often…

Descriptors: Test Bias, Scores, International Programs, Educational Assessment

Explore the Usefulness of Person-Fit Analysis on Large-Scale Assessment

Peer reviewed

Direct link

Cui, Ying; Mousavi, Amin – International Journal of Testing, 2015

The current study applied the person-fit statistic, l[subscript z], to data from a Canadian provincial achievement test to explore the usefulness of conducting person-fit analysis on large-scale assessments. Item parameter estimates were compared before and after the misfitting student responses, as identified by l[subscript z], were removed. The…

Descriptors: Measurement, Achievement Tests, Comparative Analysis, Test Items

Exploring a Three-Level Model of Calibration Accuracy

Peer reviewed

Direct link

Schraw, Gregory; Kuch, Fred; Gutierrez, Antonio P.; Richmond, Aaron S. – Journal of Educational Psychology, 2014

We compared 5 different statistics (i.e., G index, gamma, "d'", sensitivity, specificity) used in the social sciences and medical diagnosis literatures to assess calibration accuracy in order to examine the relationship among them and to explore whether one statistic provided a best fitting general measure of accuracy. College…

Descriptors: Statistics, Statistical Analysis, Correlation, Accuracy

An Investigation of the Sample Performance of Two Nonnormality Corrections for RMSEA

Peer reviewed

Direct link

Brosseau-Liard, Patricia E.; Savalei, Victoria; Li, Libo – Multivariate Behavioral Research, 2012

The root mean square error of approximation (RMSEA) is a popular fit index in structural equation modeling (SEM). Typically, RMSEA is computed using the normal theory maximum likelihood (ML) fit function. Under nonnormality, the uncorrected sample estimate of the ML RMSEA tends to be inflated. Two robust corrections to the sample ML RMSEA have…

Descriptors: Structural Equation Models, Goodness of Fit, Maximum Likelihood Statistics, Robustness (Statistics)

Next Steps in Bayesian Structural Equation Models: Comments on, Variations of, and Extensions to Muthen and Asparouhov (2012)

Peer reviewed

Direct link

Rindskopf, David – Psychological Methods, 2012

Muthen and Asparouhov (2012) made a strong case for the advantages of Bayesian methodology in factor analysis and structural equation models. I show additional extensions and adaptations of their methods and show how non-Bayesians can take advantage of many (though not all) of these advantages by using interval restrictions on parameters. By…

Descriptors: Structural Equation Models, Bayesian Statistics, Factor Analysis, Computation

Previous Page | Next Page »

Pages: 1 | 2

Sinharay, Sandip	2
Abad, Francisco J.	1
Anguiano-Carrasco, Cristina	1
Beauducel, Andre	1
Boedeker, Peter	1
Bovaird, James A.	1
Brooks, Gordon	1
Brosseau-Liard, Patricia E.	1
Brown, Anna	1
Burton, Nancy	1
Cai, Li	1
Chi, Eunlim	1
Cui, Ying	1
Deng, Nina	1
Diaz, Emily	1
Fan, Yi	1
Ferrando, Pere J.	1
Gutierrez, Antonio P.	1
Hedges, Larry V.	1
Huang, Jiajing	1
Johanson, George	1
Koziol, Natalie A.	1
Kuch, Fred	1
Kuo, Tzu-Chun	1
Lance, Charles E.	1
More ▼