ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	15

Descriptor

Error of Measurement	18
Goodness of Fit	18
Computation	8
Item Response Theory	7
Test Items	7
Test Construction	6
Computer Software	5
Structural Equation Models	5
Psychometrics	4
Student Evaluation	4
Cutting Scores	3
Elementary School Students	3
Mathematics Achievement	3
Measures (Individuals)	3
Simulation	3
Statistical Inference	3
Test Validity	3
Academic Standards	2
Computer Assisted Testing	2
Correlation	2
Data Analysis	2
Difficulty Level	2
Emergent Literacy	2
English	2
Equated Scores	2
More ▼

Source

Journal of Educational and…	3
Structural Equation Modeling:…	3
Behavioral Research and…	2
New Mexico Public Education…	2
Educational Testing Service	1
Educational and Psychological…	1
Journal of Chemical Education	1
Journal of Educational…	1
Language Testing	1
Measurement and Evaluation in…	1
Structural Equation Modeling	1
More ▼

Publication Type

Reports - Descriptive	18
Journal Articles	12
Numerical/Quantitative Data	4
Information Analyses	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Elementary Secondary Education	2
Grade 1	2
Grade 2	2
Grade 3	2
Grade 4	2
Grade 5	2
Kindergarten	2
Early Childhood Education	1
Intermediate Grades	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

New Mexico

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Early Childhood Longitudinal…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

Using Least Squares for Error Propagation

Peer reviewed

Direct link

Tellinghuisen, Joel – Journal of Chemical Education, 2015

The method of least-squares (LS) has a built-in procedure for estimating the standard errors (SEs) of the adjustable parameters in the fit model: They are the square roots of the diagonal elements of the covariance matrix. This means that one can use least-squares to obtain numerical values of propagated errors by defining the target quantities as…

Descriptors: Least Squares Statistics, Error of Measurement, Error Patterns, Chemistry

Contributions to the Underlying Bivariate Normal Method for Factor Analyzing Ordinal Data

Peer reviewed

Direct link

Xi, Nuo; Browne, Michael W. – Journal of Educational and Behavioral Statistics, 2014

A promising "underlying bivariate normal" approach was proposed by Jöreskog and Moustaki for use in the factor analysis of ordinal data. This was a limited information approach that involved the maximization of a composite likelihood function. Its advantage over full-information maximum likelihood was that very much less computation was…

Descriptors: Factor Analysis, Maximum Likelihood Statistics, Data, Computation

Bad Questions: An Essay Involving Item Response Theory

Peer reviewed

Direct link

Thissen, David – Journal of Educational and Behavioral Statistics, 2016

David Thissen, a professor in the Department of Psychology and Neuroscience, Quantitative Program at the University of North Carolina, has consulted and served on technical advisory committees for assessment programs that use item response theory (IRT) over the past couple decades. He has come to the conclusion that there are usually two purposes…

Descriptors: Item Response Theory, Test Construction, Testing Problems, Student Evaluation

The Effect of Error Correlation on Interfactor Correlation in Psychometric Measurement

Peer reviewed

Direct link

Westfall, Peter H.; Henning, Kevin S. S.; Howell, Roy D. – Structural Equation Modeling: A Multidisciplinary Journal, 2012

This article shows how interfactor correlation is affected by error correlations. Theoretical and practical justifications for error correlations are given, and a new equivalence class of models is presented to explain the relationship between interfactor correlation and error correlations. The class allows simple, parsimonious modeling of error…

Descriptors: Psychometrics, Correlation, Error of Measurement, Structural Equation Models

Alternative Multiple Imputation Inference for Mean and Covariance Structure Modeling

Peer reviewed

Direct link

Lee, Taehun; Cai, Li – Journal of Educational and Behavioral Statistics, 2012

Model-based multiple imputation has become an indispensable method in the educational and behavioral sciences. Mean and covariance structure models are often fitted to multiply imputed data sets. However, the presence of multiple random imputations complicates model fit testing, which is an important aspect of mean and covariance structure…

Descriptors: Statistical Inference, Structural Equation Models, Goodness of Fit, Statistical Analysis

Sources of Score Scale Inconsistency. Research Report. ETS RR-11-10

Download full text

Haberman, Shelby J.; Dorans, Neil J. – Educational Testing Service, 2011

For testing programs that administer multiple forms within a year and across years, score equating is used to ensure that scores can be used interchangeably. In an ideal world, samples sizes are large and representative of populations that hardly change over time, and very reliable alternate test forms are built with nearly identical psychometric…

Descriptors: Scores, Reliability, Equated Scores, Test Construction

Principles of Quantile Regression and an Application

Peer reviewed

Direct link

Chen, Fang; Chalhoub-Deville, Micheline – Language Testing, 2014

Newer statistical procedures are typically introduced to help address the limitations of those already in practice or to deal with emerging research needs. Quantile regression (QR) is introduced in this paper as a relatively new methodology, which is intended to overcome some of the limitations of least squares mean regression (LMR). QR is more…

Descriptors: Regression (Statistics), Language Tests, Language Proficiency, Mathematics Achievement

Using Mx to Analyze Cross-Level Effects in Two-Level Structural Equation Models

Peer reviewed

Direct link

Bai, Yun; Poon, Wai-Yin – Structural Equation Modeling: A Multidisciplinary Journal, 2009

Two-level data sets are frequently encountered in social and behavioral science research. They arise when observations are drawn from a known hierarchical structure, such as when individuals are randomly drawn from groups that are randomly drawn from a target population. Although 2-level data analysis in the context of structural equation modeling…

Descriptors: Structural Equation Models, Data Analysis, Simulation, Goodness of Fit

On the Merits of Orthogonalizing Powered and Product Terms: Implications for Modeling Interactions among Latent Variables

Peer reviewed

Direct link

Little, Todd D.; Bovaird, James A.; Widaman, Keith F. – Structural Equation Modeling: A Multidisciplinary Journal, 2006

The goals of this article are twofold: (a) briefly highlight the merits of residual centering for representing interaction and powered terms in standard regression contexts (e.g., Lance, 1988), and (b) extend the residual centering procedure to represent latent variable interactions. The proposed method for representing latent variable…

Descriptors: Interaction, Structural Equation Models, Evaluation Methods, Regression (Statistics)

Markov Count: A Program for Computing the Learning Statistics of Two- Stage Markov Learning Experiments.

Peer reviewed

Kingma, Johannes; Reuvekamp, Johan – Educational and Psychological Measurement, 1987

This paper describes a PASCAL program that computes both different types of transitions and learning statistics suitable for learning experiments in which a two-stage Markov model is used. The frequency counts of the different transitions are used for estimating the parameters of the two-stage Markov model. (Author/LMO)

Descriptors: Computer Software Reviews, Error of Measurement, Goodness of Fit, Input Output

Examining the Technical Adequacy of Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report # 41

Download full text

Alonzo, Julie; Liu, Kimy; Tindal, Gerald – Behavioral Research and Teaching, 2007

In this technical report, the authors describe the development and piloting of reading comprehension measures as part of a comprehensive progress monitoring literacy assessment system developed in 2006 for use with students in Kindergarten through fifth grade. They begin with a brief overview of the two conceptual frameworks underlying the…

Descriptors: Reading Comprehension, Emergent Literacy, Test Construction, Literacy Education

In Search of Golden Rules: Comment on Hypothesis-Testing Approaches to Setting Cutoff Values for Fit Indexes and Dangers in Overgeneralizing Hu and Bentler's (1999) Findings

Peer reviewed

Direct link

Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004

Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…

Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research

The Use of Rasch Model Fit Statistics in Selecting Items for the Maryland Functional Testing Program.

Download full text

Baghi, Heibatollah – 1990

The Maryland Functional Testing Program (MFTP) uses the Rasch model as the statistical framework for the analysis of test items and scores. This paper is designed to assist the reader in developing an understanding of the fit statistics in the Rasch model. Background materials on application of the Rasch model in statistical analysis of the MFTP…

Descriptors: Computer Assisted Testing, Computer Software, Equated Scores, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

Alonzo, Julie	2
Tindal, Gerald	2
Baghi, Heibatollah	1
Bai, Yun	1
Bolsinova, Maria	1
Bovaird, James A.	1
Browne, Michael W.	1
Cai, Li	1
Chalhoub-Deville, Micheline	1
Chen, Fang	1
Dorans, Neil J.	1
Griph, Gerald W.	1
Haberman, Shelby J.	1
Hau, Kit-Tai	1
Henning, Kevin S. S.	1
Howell, Roy D.	1
Kingma, Johannes	1
Lee, Taehun	1
Liaw, Yuan-Ling	1
Little, Todd D.	1
Liu, Kimy	1
Marsh, Herbert W.	1
Poon, Wai-Yin	1
Reuvekamp, Johan	1
Rutkowski, David	1
More ▼