ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	33

Descriptor

Error of Measurement	40
Sample Size	40
Scores	40
Item Response Theory	11
Test Items	9
Accuracy	8
Computation	8
Correlation	8
Evaluation Methods	7
Regression (Statistics)	7
Simulation	7
Statistical Analysis	7
Statistical Bias	7
Comparative Analysis	6
Models	6
Statistical Distributions	6
Test Bias	6
Achievement Tests	5
Factor Analysis	5
Longitudinal Studies	5
Multivariate Analysis	5
Psychometrics	5
Sampling	5
Structural Equation Models	5
Classification	4
More ▼

Publication Type

Reports - Research	27
Journal Articles	22
Speeches/Meeting Papers	8
Reports - Evaluative	6
Dissertations/Theses -…	4
Guides - Non-Classroom	2
Numerical/Quantitative Data	2
Books	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Elementary Education	3
Elementary Secondary Education	3
Grade 8	2
Middle Schools	2
Secondary Education	2
Adult Education	1
Grade 5	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Postsecondary Education	1
More ▼

Audience

Researchers

Location

Canada	1
Colorado	1
Indonesia	1
Israel	1
Kansas	1
Netherlands	1
South Dakota	1
Wyoming	1

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	2
Trends in International…	2
Advanced Placement…	1
National Assessment of…	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
Program for International…	1
Progress in International…	1
Student Teacher Relationship…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming

Peer reviewed

Direct link

Francesco Innocenti; Math J. J. M. Candel; Frans E. S. Tan; Gerard J. P. van Breukelen – Journal of Educational and Behavioral Statistics, 2024

Normative studies are needed to obtain norms for comparing individuals with the reference population on relevant clinical or educational measures. Norms can be obtained in an efficient way by regressing the test score on relevant predictors, such as age and sex. When several measures are normed with the same sample, a multivariate regression-based…

Descriptors: Sample Size, Multivariate Analysis, Error of Measurement, Regression (Statistics)

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Detecting Careless Responding in Multidimensional Forced-Choice Questionnaires

Peer reviewed

Direct link

Rebekka Kupffer; Susanne Frick; Eunike Wetzel – Educational and Psychological Measurement, 2024

The multidimensional forced-choice (MFC) format is an alternative to rating scales in which participants rank items according to how well the items describe them. Currently, little is known about how to detect careless responding in MFC data. The aim of this study was to adapt a number of indices used for rating scales to the MFC format and…

Descriptors: Measurement Techniques, Alternative Assessment, Rating Scales, Questionnaires

Comparing Factor Score Approaches to SEM in Multigroup Models with Small Samples

Peer reviewed

Direct link

Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…

Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Classification Consistency and Accuracy with Atypical Score Distributions

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2020

The current study aims to evaluate the performance of three non-IRT procedures (i.e., normal approximation, Livingston-Lewis, and compound multinomial) for estimating classification indices when the observed score distribution shows atypical patterns: (a) bimodality, (b) structural (i.e., systematic) bumpiness, or (c) structural zeros (i.e., no…

Descriptors: Classification, Accuracy, Scores, Cutting Scores

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Shear, Benjamin R.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2021

This article describes an extension to the use of heteroskedastic ordered probit (HETOP) models to estimate latent distributional parameters from grouped, ordered-categorical data by pooling across multiple waves of data. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in…

Descriptors: Statistical Analysis, Computation, Regression (Statistics), Sample Size

A Small Sample Correction for Factor Score Regression

Peer reviewed

Direct link

Bogaert, Jasper; Loh, Wen Wei; Rosseel, Yves – Educational and Psychological Measurement, 2023

Factor score regression (FSR) is widely used as a convenient alternative to traditional structural equation modeling (SEM) for assessing structural relations between latent variables. But when latent variables are simply replaced by factor scores, biases in the structural parameter estimates often have to be corrected, due to the measurement error…

Descriptors: Factor Analysis, Regression (Statistics), Structural Equation Models, Error of Measurement

A Comparison between the Piecewise and Parallel-Process Piecewise Latent Growth Models

Peer reviewed

Direct link

Nazari, Sanaz; Leite, Walter L.; Huggins-Manley, A. Corinne – Journal of Experimental Education, 2023

The piecewise latent growth models (PWLGMs) can be used to study changes in the growth trajectory of an outcome due to an event or condition, such as exposure to an intervention. When there are multiple outcomes of interest, a researcher may choose to fit a series of PWLGMs or a single parallel-process PWLGM. A comparison of these models is…

Descriptors: Growth Models, Statistical Analysis, Intervention, Comparative Analysis

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

The Effect of Parceling on the Measurement Invariance of US Students' Trends in International Mathematics and Science Study (TIMSS) 2015 Math Attitude Scores

Direct link

Kritika Thapa – ProQuest LLC, 2023

Measurement invariance is crucial for making valid comparisons across different groups (Kline, 2016; Vandenberg, 2002). To address the challenges associated with invariance testing such as large sample size requirements, the complexity of the model, etc., applied researchers have incorporated parcels. Parcels have been shown to alleviate skewness,…

Descriptors: Elementary Secondary Education, Achievement Tests, Foreign Countries, International Assessment

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions. CEPA Working Paper No. 19-05

Download full text

Shear, Benjamin R.; Reardon, Sean F. – Stanford Center for Education Policy Analysis, 2019

This paper describes a method for pooling grouped, ordered-categorical data across multiple waves to improve small-sample heteroskedastic ordered probit (HETOP) estimates of latent distributional parameters. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in each of a small…

Descriptors: Computation, Scores, Statistical Distributions, Sample Size

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…

Descriptors: Test Validity, Evaluation Methods, School Districts, Scores

Extreme Response Style: Which Model Is Best?

Direct link

Leventhal, Brian – ProQuest LLC, 2017

More robust and rigorous psychometric models, such as multidimensional Item Response Theory models, have been advocated for survey applications. However, item responses may be influenced by construct-irrelevant variance factors such as preferences for extreme response options. Through empirical and simulation methods, this study evaluates the use…

Descriptors: Psychometrics, Item Response Theory, Simulation, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	4
ProQuest LLC	4
Journal of Educational and…	3
International Journal of…	2
Journal of Educational…	2
Online Submission	2
Stanford Center for Education…	2
AERA Online Paper Repository	1
Applied Measurement in…	1
Applied Psychological…	1
Comparative Education Review	1
Council of Chief State School…	1
Developmental Psychology	1
ETS Research Report Series	1
Journal of Experimental…	1
National Center for Education…	1
Practical Assessment,…	1
Psicologica: International…	1
Regional Educational…	1
Social Indicators Research	1
Springer	1
Structural Equation Modeling	1
Structural Equation Modeling:…	1
More ▼

Reardon, Sean F.	3
Custer, Michael	2
Davison, Mark L.	2
Dunbar, Stephen B.	2
Lee, Won-Chan	2
Shear, Benjamin R.	2
Blaker, Lisa	1
Bogaert, Jasper	1
Brown, Jane D.	1
Bruno D. Zumbo	1
Carl Falk	1
Chang, Yu-Wen	1
Chon, Kyong Hee	1
Chris G. Richardson	1
Cope, Ronald T.	1
Culbertson, Michael J.	1
Davenport, Ernest C., Jr.	1
DeMars, Christine E.	1
Donnellan, M. Brent	1
Doorey, Nancy A.	1
Dunbar, Stephen	1
Emma Somer	1
Eunike Wetzel	1
Francesco Innocenti	1
Frans E. S. Tan	1
More ▼