ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	40
Since 2006 (last 20 years)	72

Descriptor

Error of Measurement	85
Computation	40
Statistical Analysis	29
Item Response Theory	21
Sample Size	18
Simulation	17
Models	15
Scores	14
Statistical Inference	14
Regression (Statistics)	13
Bayesian Statistics	11
Correlation	11
Monte Carlo Methods	11
Effect Size	10
Maximum Likelihood Statistics	10
Probability	10
Statistical Bias	10
Comparative Analysis	9
Data Analysis	9
Hierarchical Linear Modeling	9
Test Items	9
Equated Scores	7
Longitudinal Studies	7
Sampling	7
Statistical Distributions	7
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	85
Reports - Research	43
Reports - Evaluative	24
Reports - Descriptive	17
Book/Product Reviews	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Elementary Education	4
Higher Education	4
Grade 4	3
Grade 8	3
Intermediate Grades	3
Postsecondary Education	3
Grade 3	2
Grade 6	2
Middle Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 5	1
Grade 7	1
High Schools	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Location

Italy	2
Netherlands	1
New York	1
Pennsylvania	1
United Kingdom (Scotland)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	7
Program for International…	4
National Longitudinal Study…	2
Trends in International…	2
Behavioral Risk Factor…	1
Early Childhood Longitudinal…	1
Iowa Tests of Basic Skills	1
Measures of Academic Progress	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 85 results Save | Export

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

Analyzing Cross-Sectionally Clustered Data Using Generalized Estimating Equations

Peer reviewed

Direct link

Huang, Francis L. – Journal of Educational and Behavioral Statistics, 2022

The presence of clustered data is common in the sociobehavioral sciences. One approach that specifically deals with clustered data but has seen little use in education is the generalized estimating equations (GEEs) approach. We provide a background on GEEs, discuss why it is appropriate for the analysis of clustered data, and provide worked…

Descriptors: Multivariate Analysis, Computation, Correlation, Error of Measurement

Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming

Peer reviewed

Direct link

Francesco Innocenti; Math J. J. M. Candel; Frans E. S. Tan; Gerard J. P. van Breukelen – Journal of Educational and Behavioral Statistics, 2024

Normative studies are needed to obtain norms for comparing individuals with the reference population on relevant clinical or educational measures. Norms can be obtained in an efficient way by regressing the test score on relevant predictors, such as age and sex. When several measures are normed with the same sample, a multivariate regression-based…

Descriptors: Sample Size, Multivariate Analysis, Error of Measurement, Regression (Statistics)

Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups

Peer reviewed

Direct link

Sean Joo; Montserrat Valdivia; Dubravka Svetina Valdivia; Leslie Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Evaluating scale comparability in international large-scale assessments depends on measurement invariance (MI). The root mean square deviation (RMSD) is a standard method for establishing MI in several programs, such as the Programme for International Student Assessment and the Programme for the International Assessment of Adult Competencies.…

Descriptors: International Assessment, Monte Carlo Methods, Statistical Studies, Error of Measurement

Pooling Interactions into Error Terms in Multisite Experiments

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wendy Chan; Larry Vernon Hedges – Journal of Educational and Behavioral Statistics, 2022

Multisite field experiments using the (generalized) randomized block design that assign treatments to individuals within sites are common in education and the social sciences. Under this design, there are two possible estimands of interest and they differ based on whether sites or blocks have fixed or random effects. When the average treatment…

Descriptors: Research Design, Educational Research, Statistical Analysis, Statistical Inference

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

An Explicit Form with Continuous Attribute Profile of the Partial Mastery DINA Model

Peer reviewed

Direct link

Shu, Tian; Luo, Guanzhong; Luo, Zhaosheng; Yu, Xiaofeng; Guo, Xiaojun; Li, Yujun – Journal of Educational and Behavioral Statistics, 2023

Cognitive diagnosis models (CDMs) are the statistical framework for cognitive diagnostic assessment in education and psychology. They generally assume that subjects' latent attributes are dichotomous--mastery or nonmastery, which seems quite deterministic. As an alternative to dichotomous attribute mastery, attention is drawn to the use of a…

Descriptors: Cognitive Measurement, Models, Diagnostic Tests, Accuracy

Handling Missing Data in Cross-Classified Multilevel Analyses: An Evaluation of Different Multiple Imputation Approaches

Peer reviewed

Direct link

Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2023

Multiple imputation (MI) is a popular method for handling missing data. In education research, it can be challenging to use MI because the data often have a clustered structure that need to be accommodated during MI. Although much research has considered applications of MI in hierarchical data, little is known about its use in cross-classified…

Descriptors: Educational Research, Data Analysis, Error of Measurement, Computation

Assessing Inter-Rater Reliability with Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables

Peer reviewed

Direct link

Martinková, Patrícia; Bartoš, František; Brabec, Marek – Journal of Educational and Behavioral Statistics, 2023

Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater's or ratee's gender, major, or experience. Identification of such heterogeneity sources in IRR is important for the implementation of policies with the potential to decrease measurement error…

Descriptors: Interrater Reliability, Bayesian Statistics, Statistical Inference, Hierarchical Linear Modeling

Detecting Compromised Items Using Information from Secure Items

Peer reviewed

Direct link

Wang, Xi; Liu, Yang – Journal of Educational and Behavioral Statistics, 2020

In continuous testing programs, some items are repeatedly used across test administrations, and statistical methods are often used to evaluate whether items become compromised due to examinees' preknowledge. In this study, we proposed a residual method to detect compromised items when a test can be partitioned into two subsets of items: secure…

Descriptors: Test Items, Information Security, Error of Measurement, Cheating

Adaptive Pairwise Comparison for Educational Measurement

Peer reviewed

Direct link

Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020

Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…

Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement

Statistical Power for Estimating Treatment Effects Using Difference-in-Differences and Comparative Interrupted Time Series Estimators with Variation in Treatment Timing

Peer reviewed

Direct link

Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2022

This article develops new closed-form variance expressions for power analyses for commonly used difference-in-differences (DID) and comparative interrupted time series (CITS) panel data estimators. The main contribution is to incorporate variation in treatment timing into the analysis. The power formulas also account for other key design features…

Descriptors: Comparative Analysis, Statistical Analysis, Sample Size, Measurement Techniques

Modeling Item-Level Heterogeneous Treatment Effects with the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions

Peer reviewed

Direct link

Gilbert, Joshua B.; Kim, James S.; Miratrix, Luke W. – Journal of Educational and Behavioral Statistics, 2023

Analyses that reveal how treatment effects vary allow researchers, practitioners, and policymakers to better understand the efficacy of educational interventions. In practice, however, standard statistical methods for addressing heterogeneous treatment effects (HTE) fail to address the HTE that may exist "within" outcome measures. In…

Descriptors: Test Items, Item Response Theory, Computer Assisted Testing, Program Effectiveness

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Shear, Benjamin R.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2021

This article describes an extension to the use of heteroskedastic ordered probit (HETOP) models to estimate latent distributional parameters from grouped, ordered-categorical data by pooling across multiple waves of data. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in…

Descriptors: Statistical Analysis, Computation, Regression (Statistics), Sample Size

Estimating Linking Functions for Response Model Parameters

Peer reviewed

Direct link

Barrett, Michelle D.; van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Parameter linking in item response theory is generally necessary to adjust for differences between the true values for the same item and ability parameters due to the use of different identifiability restrictions in different calibrations. The research reported in this article explores a precision-weighted (PW) approach to the problem of…

Descriptors: Item Response Theory, Computation, Error of Measurement, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Reardon, Sean F.	3
Becker, Betsy Jane	2
Browne, Michael W.	2
Cai, Li	2
Grabovsky, Irina	2
Grund, Simon	2
Ho, Andrew D.	2
Longford, Nicholas T.	2
Lüdtke, Oliver	2
McCaffrey, Daniel F.	2
Miratrix, Luke W.	2
Oranje, Andreas	2
Robitzsch, Alexander	2
Sinharay, Sandip	2
Thissen, David	2
Wainer, Howard	2
Zwick, Rebecca	2
van der Linden, Wim J.	2
Adams, Raymond J.	1
Ahn, Soyeon	1
Algina, James	1
Aloe, Ariel M.	1
Baram, Tallie Z.	1
Barrett, Michelle D.	1
More ▼