NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)15
Audience
Location
New Mexico2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 18 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020
Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…
Descriptors: Test Items, Goodness of Fit, Probability, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017
This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.
Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
Tellinghuisen, Joel – Journal of Chemical Education, 2015
The method of least-squares (LS) has a built-in procedure for estimating the standard errors (SEs) of the adjustable parameters in the fit model: They are the square roots of the diagonal elements of the covariance matrix. This means that one can use least-squares to obtain numerical values of propagated errors by defining the target quantities as…
Descriptors: Least Squares Statistics, Error of Measurement, Error Patterns, Chemistry
Peer reviewed Peer reviewed
Direct linkDirect link
Xi, Nuo; Browne, Michael W. – Journal of Educational and Behavioral Statistics, 2014
A promising "underlying bivariate normal" approach was proposed by Jöreskog and Moustaki for use in the factor analysis of ordinal data. This was a limited information approach that involved the maximization of a composite likelihood function. Its advantage over full-information maximum likelihood was that very much less computation was…
Descriptors: Factor Analysis, Maximum Likelihood Statistics, Data, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Thissen, David – Journal of Educational and Behavioral Statistics, 2016
David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…
Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Westfall, Peter H.; Henning, Kevin S. S.; Howell, Roy D. – Structural Equation Modeling: A Multidisciplinary Journal, 2012
This article shows how interfactor correlation is affected by error correlations. Theoretical and practical justifications for error correlations are given, and a new equivalence class of models is presented to explain the relationship between interfactor correlation and error correlations. The class allows simple, parsimonious modeling of error…
Descriptors: Psychometrics, Correlation, Error of Measurement, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Taehun; Cai, Li – Journal of Educational and Behavioral Statistics, 2012
Model-based multiple imputation has become an indispensable method in the educational and behavioral sciences. Mean and covariance structure models are often fitted to multiply imputed data sets. However, the presence of multiple random imputations complicates model fit testing, which is an important aspect of mean and covariance structure…
Descriptors: Statistical Inference, Structural Equation Models, Goodness of Fit, Statistical Analysis
Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011
For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…
Descriptors: Scores, Reliability, Equated Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Fang; Chalhoub-Deville, Micheline – Language Testing, 2014
Newer statistical procedures are typically introduced to help address the limitations of those already in practice or to deal with emerging research needs. Quantile regression (QR) is introduced in this paper as a relatively new methodology, which is intended to overcome some of the limitations of least squares mean regression (LMR). QR is more…
Descriptors: Regression (Statistics), Language Tests, Language Proficiency, Mathematics Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Bai, Yun; Poon, Wai-Yin – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Two-level data sets are frequently encountered in social and behavioral science research. They arise when observations are drawn from a known hierarchical structure, such as when individuals are randomly drawn from groups that are randomly drawn from a target population. Although 2-level data analysis in the context of structural equation modeling…
Descriptors: Structural Equation Models, Data Analysis, Simulation, Goodness of Fit
Peer reviewed Peer reviewed
Direct linkDirect link
Little, Todd D.; Bovaird, James A.; Widaman, Keith F. – Structural Equation Modeling: A Multidisciplinary Journal, 2006
The goals of this article are twofold: (a) briefly highlight the merits of residual centering for representing interaction and powered terms in standard regression contexts (e.g., Lance, 1988), and (b) extend the residual centering procedure to represent latent variable interactions. The proposed method for representing latent variable…
Descriptors: Interaction, Structural Equation Models, Evaluation Methods, Regression (Statistics)
Peer reviewed Peer reviewed
Kingma, Johannes; Reuvekamp, Johan – Educational and Psychological Measurement, 1987
This paper describes a PASCAL program that computes both different types of transitions and learning statistics suitable for learning experiments in which a two-stage Markov model is used. The frequency counts of the different transitions are used for estimating the parameters of the two-stage Markov model. (Author/LMO)
Descriptors: Computer Software Reviews, Error of Measurement, Goodness of Fit, Input Output
Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2007
In this technical report, the authors describe the development and piloting of reading comprehension measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fifth grade. They begin with a brief overview of the two conceptual frameworks underlying the…
Descriptors: Reading Comprehension, Emergent Literacy, Test Construction, Literacy Education
Peer reviewed Peer reviewed
Direct linkDirect link
Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004
Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…
Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research
Baghi, Heibatollah – 1990
The Maryland Functional Testing Program (MFTP) uses the Rasch model as the statistical framework for the analysis of test items and scores. This paper is designed to assist the reader in developing an understanding of the fit statistics in the Rasch model. Background materials on application of the Rasch model in statistical analysis of the MFTP…
Descriptors: Computer Assisted Testing, Computer Software, Equated Scores, Error of Measurement
Previous Page | Next Page »
Pages: 1  |  2