ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	30

Descriptor

Goodness of Fit	40
Simulation	40
Statistical Analysis	40
Test Items	15
Models	14
Sample Size	13
Comparative Analysis	12
Item Response Theory	10
Computation	8
Factor Analysis	8
Mathematical Models	7
Maximum Likelihood Statistics	7
Item Analysis	6
Error of Measurement	5
Sampling	5
Achievement Tests	4
Bayesian Statistics	4
Classification	4
Data Analysis	4
Evaluation Methods	4
Monte Carlo Methods	4
Accuracy	3
Correlation	3
Effect Size	3
Graphs	3
More ▼

Source

Educational and Psychological…	7
Journal of Educational and…	6
Structural Equation Modeling:…	4
Journal of Educational…	3
Applied Psychological…	2
Multivariate Behavioral…	2
ProQuest LLC	2
Applied Measurement in…	1
ETS Research Report Series	1
Grantee Submission	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Speech, Language,…	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	29
Reports - Research	28
Reports - Evaluative	6
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Reports - Descriptive	1
Reports - General	1

Education Level

Secondary Education

Audience

Researchers

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…	1
National Longitudinal Survey…	1
Program for International…	1
Raven Advanced Progressive…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Evaluating Model Fit in Bayesian Confirmatory Factor Analysis with Large Samples: Simulation Study Introducing the BRMSEA

Peer reviewed

Direct link

Hoofs, Huub; van de Schoot, Rens; Jansen, Nicole W. H.; Kant, IJmert – Educational and Psychological Measurement, 2018

Bayesian confirmatory factor analysis (CFA) offers an alternative to frequentist CFA based on, for example, maximum likelihood estimation for the assessment of reliability and validity of educational and psychological measures. For increasing sample sizes, however, the applicability of current fit statistics evaluating model fit within Bayesian…

Descriptors: Goodness of Fit, Bayesian Statistics, Factor Analysis, Sample Size

Measurement Invariance in International Surveys: Categorical Indicators and Fit Measure Performance

Peer reviewed

Direct link

Rutkowski, Leslie; Svetina, Dubravka – Applied Measurement in Education, 2017

In spite of the challenges inherent in making dozens of comparisons across heterogeneous populations, a relatively recent interest in scale-score equivalence for non-achievement measures in an international context has emerged. Until recently, operational procedures for establishing measurement invariance using multiple-groups analyses were…

Descriptors: International Assessment, Goodness of Fit, Statistical Analysis, Teacher Surveys

The Role of Measurement Quality on Practical Guidelines for Assessing Measurement and Structural Invariance

Peer reviewed

Direct link

Kang, Yoonjeong; McNeish, Daniel M.; Hancock, Gregory R. – Educational and Psychological Measurement, 2016

Although differences in goodness-of-fit indices (?GOFs) have been advocated for assessing measurement invariance, studies that advanced recommended differential cutoffs for adjudicating invariance actually utilized a very limited range of values representing the quality of indicator variables (i.e., magnitude of loadings). Because quality of…

Descriptors: Measurement, Goodness of Fit, Guidelines, Models

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Statistical Methodology for the Analysis of Repeated Duration Data in Behavioral Studies

Peer reviewed

Direct link

Letué, Frédérique; Martinez, Marie-José; Samson, Adeline; Vilain, Anne; Vilain, Coriandre – Journal of Speech, Language, and Hearing Research, 2018

Purpose: Repeated duration data are frequently used in behavioral studies. Classical linear or log-linear mixed models are often inadequate to analyze such data, because they usually consist of nonnegative and skew-distributed variables. Therefore, we recommend use of a statistical methodology specific to duration data. Method: We propose a…

Descriptors: Behavioral Science Research, Research Methodology, Statistical Analysis, Repetition

Do Adaptive Representations of the Item-Position Effect in APM Improve Model Fit? A Simulation Study

Peer reviewed

Direct link

Zeller, Florian; Krampen, Dorothea; Reiß, Siegbert; Schweizer, Karl – Educational and Psychological Measurement, 2017

The item-position effect describes how an item's position within a test, that is, the number of previous completed items, affects the response to this item. Previously, this effect was represented by constraints reflecting simple courses, for example, a linear increase. Due to the inflexibility of these representations our aim was to examine…

Descriptors: Goodness of Fit, Simulation, Factor Analysis, Intelligence Tests

Investigation of Rater Effects Using Social Network Analysis and Exponential Random Graph Models

Peer reviewed

Direct link

Lamprianou, Iasonas – Educational and Psychological Measurement, 2018

It is common practice for assessment programs to organize qualifying sessions during which the raters (often known as "markers" or "judges") demonstrate their consistency before operational rating commences. Because of the high-stakes nature of many rating activities, the research community tends to continuously explore new…

Descriptors: Social Networks, Network Analysis, Comparative Analysis, Innovation

Person Fit Analysis in Computerized Adaptive Testing Using Tests for a Change Point

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016

Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Unrestricted Mixture Models for Class Identification in Growth Mixture Modeling

Peer reviewed

Direct link

Liu, Min; Hancock, Gregory R. – Educational and Psychological Measurement, 2014

Growth mixture modeling has gained much attention in applied and methodological social science research recently, but the selection of the number of latent classes for such models remains a challenging issue, especially when the assumption of proper model specification is violated. The current simulation study compared the performance of a linear…

Descriptors: Models, Classification, Simulation, Comparative Analysis

A Quasi-Parametric Method for Fitting Flexible Item Response Functions

Peer reviewed

Direct link

Liang, Longjuan; Browne, Michael W. – Journal of Educational and Behavioral Statistics, 2015

If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating…

Descriptors: Item Response Theory, Statistical Analysis, Goodness of Fit, Bayesian Statistics

Modeling Information Accumulation in Psychological Tests Using Item Response Times

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jörg-Tobias – Journal of Educational and Behavioral Statistics, 2015

In this article, a latent trait model is proposed for the response times in psychological tests. The latent trait model is based on the linear transformation model and subsumes popular models from survival analysis, like the proportional hazards model and the proportional odds model. Core of the model is the assumption that an unspecified monotone…

Descriptors: Psychological Testing, Reaction Time, Statistical Analysis, Models

Posterior Predictive Model Checking in Bayesian Networks

Direct link

Crawford, Aaron – ProQuest LLC, 2014

This simulation study compared the utility of various discrepancy measures within a posterior predictive model checking (PPMC) framework for detecting different types of data-model misfit in multidimensional Bayesian network (BN) models. The investigated conditions were motivated by an applied research program utilizing an operational complex…

Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

Evaluating the Wald Test for Item-Level Comparison of Saturated and Reduced Models in Cognitive Diagnosis

Peer reviewed

Direct link

de la Torre, Jimmy; Lee, Young-Sun – Journal of Educational Measurement, 2013

This article used the Wald test to evaluate the item-level fit of a saturated cognitive diagnosis model (CDM) relative to the fits of the reduced models it subsumes. A simulation study was carried out to examine the Type I error and power of the Wald test in the context of the G-DINA model. Results show that when the sample size is small and a…

Descriptors: Statistical Analysis, Test Items, Goodness of Fit, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3

Reckase, Mark D.	3
Sinharay, Sandip	3
Hancock, Gregory R.	2
Ranger, Jochen	2
Arendasy, Martin	1
Asparouhov, Tihomir	1
Bentler, Peter M.	1
Bessent, Authella	1
Broadbooks, Wendy J.	1
Browne, Michael W.	1
Cai, Li	1
Connell, Louise	1
Crawford, Aaron	1
Debelak, Rudolf	1
Deng, Nina	1
Elmore, Patricia B.	1
Emons, Wilco H. M.	1
Fan, Xitao	1
Ferrando, Pere J.	1
Fishburn, Peter C.	1
Fletcher, Jack	1
Gehrlein, William V.	1
Ghisletta, Paolo	1
Hedges, Larry V.	1
Hertzog, Christopher	1
More ▼