ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	12

Descriptor

Error of Measurement	17
Scores	17
Statistical Distributions	17
Sample Size	6
Computation	5
Achievement Tests	4
Bayesian Statistics	4
Correlation	4
Statistical Analysis	4
Statistical Bias	4
Accuracy	3
Intervals	3
Regression (Statistics)	3
Test Theory	3
Comparative Analysis	2
Computer Simulation	2
Cutting Scores	2
Data Interpretation	2
Generalizability Theory	2
Grade 8	2
High Schools	2
Hypothesis Testing	2
Inferences	2
Item Response Theory	2
Mathematics Tests	2
More ▼

Source

Journal of Educational and…	3
ETS Research Report Series	2
Educational and Psychological…	2
ProQuest LLC	2
Stanford Center for Education…	2
Grantee Submission	1
Journal of Statistics…	1
Springer	1

Publication Type

Reports - Research	10
Journal Articles	8
Reports - Evaluative	4
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Books	1
Guides - Non-Classroom	1

Education Level

Secondary Education	2
Elementary Secondary Education	1
Grade 4	1
Grade 8	1
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Alabama High School…	1
National Longitudinal Survey…	1
National Merit Scholarship…	1
Peabody Individual…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Shear, Benjamin R.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2021

This article describes an extension to the use of heteroskedastic ordered probit (HETOP) models to estimate latent distributional parameters from grouped, ordered-categorical data by pooling across multiple waves of data. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in…

Descriptors: Statistical Analysis, Computation, Regression (Statistics), Sample Size

Using Pooled Heteroskedastic Ordered Probit Models to Improve Small-Sample Estimates of Latent Test Score Distributions. CEPA Working Paper No. 19-05

Download full text

Shear, Benjamin R.; Reardon, Sean F. – Stanford Center for Education Policy Analysis, 2019

This paper describes a method for pooling grouped, ordered-categorical data across multiple waves to improve small-sample heteroskedastic ordered probit (HETOP) estimates of latent distributional parameters. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in each of a small…

Descriptors: Computation, Scores, Statistical Distributions, Sample Size

Robust Bayesian Approaches in Growth Curve Modeling: Using Student's "t" Distributions versus a Semiparametric Method

Peer reviewed
PDF on ERIC

Download full text

Direct link

Tong, Xin; Zhang, Zhiyong – Grantee Submission, 2020

Despite broad applications of growth curve models, few studies have dealt with a practical issue -- nonnormality of data. Previous studies have used Student's "t" distributions to remedy the nonnormal problems. In this study, robust distributional growth curve models are proposed from a semiparametric Bayesian perspective, in which…

Descriptors: Robustness (Statistics), Bayesian Statistics, Models, Error of Measurement

Analysis of Thursday Night NFL Winning Margins

Peer reviewed

Direct link

Vaughan, Timothy S. – Journal of Statistics Education, 2015

This paper introduces a dataset and associated analysis of the scores of National Football League (NFL) games over the 2012, 2013, and first five weeks of the 2014 season. In the face of current media attention to "lopsided" scores in Thursday night games in the early part of the 2014 season, t-test results indicate no statistically…

Descriptors: Team Sports, Success, Scores, Statistics

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Direct link

Quesen, Sarah – ProQuest LLC, 2016

When studying differential item functioning (DIF) with students with disabilities (SWD) focal groups typically suffer from small sample size, whereas the reference group population is usually large. This makes it possible for a researcher to select a sample from the reference population to be similar to the focal group on the ability scale. Doing…

Descriptors: Test Items, Academic Accommodations (Disabilities), Testing Accommodations, Disabilities

Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Stanford Center for Education Policy Analysis, 2017

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…

Descriptors: School Districts, Scores, Statistical Distributions, Database Design

Effect of Observation Mode on Measures of Secondary Mathematics Teaching

Peer reviewed

Direct link

Casabianca, Jodi M.; McCaffrey, Daniel F.; Gitomer, Drew H.; Bell, Courtney A.; Hamre, Bridget K.; Pianta, Robert C. – Educational and Psychological Measurement, 2013

Classroom observation of teachers is a significant part of educational measurement; measurements of teacher practice are being used in teacher evaluation systems across the country. This research investigated whether observations made live in the classroom and from video recording of the same lessons yielded similar inferences about teaching.…

Descriptors: Secondary School Mathematics, Mathematics Instruction, Classroom Observation Techniques, Algebra

Comparing Trend and Gap Statistics across Tests: Distributional Change Using Ordinal Methods and Bayesian Inference

Direct link

Denbleyker, John Nickolas – ProQuest LLC, 2012

The shortcomings of the proportion above cut (PAC) statistic used so prominently in the educational landscape renders it a very problematic measure for making correct inferences with student test data. The limitations of PAC-based statistics are more pronounced with cross-test comparisons due to their dependency on cut-score locations. A better…

Descriptors: Achievement Gap, Bayesian Statistics, Inferences, Trend Analysis

Improved Reliability Estimates for Small Samples Using Empirical Bayes Techniques. Research Report. ETS RR-09-46

Peer reviewed
PDF on ERIC

Download full text

Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009

Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…

Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics

Statistics and Data Interpretation for Social Work

Direct link

Rosenthal, James A. – Springer, 2011

Written by a social worker for social work students, this is a nuts and bolts guide to statistics that presents complex calculations and concepts in clear, easy-to-understand language. It includes numerous examples, data sets, and issues that students will encounter in social work practice. The first section introduces basic concepts and terms to…

Descriptors: Statistics, Data Interpretation, Social Work, Social Science Research

Standard Error of Linear Equating for the Counterbalanced Design.

Peer reviewed

Zeng, Lingjia; Cope, Ronald T. – Journal of Educational and Behavioral Statistics, 1995

Large-sample standard errors of linear equating for the counterbalanced design are derived using the general delta method. Computer simulations found that standard errors derived without the normality assumption were more accurate than those derived with the normality assumption in a large sample with moderately skewed score distributions. (SLD)

Descriptors: Computer Simulation, Error of Measurement, Research Design, Sample Size

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

The Use of Confidence Intervals When Interpreting Test Scores. EREAPA Publication Series No. 93-4.

Download full text

Wheeler, Patricia H. – 1993

A person's obtained score on a test provides an estimate of the individual's "true" score on that test. The obtained score is considered to have two parts, the true component and the error component. Classical test theory assumes that obtained scores for an individual over multiple administrations of the same test will lie symmetrically…

Descriptors: Cutting Scores, Error of Measurement, Scores, Statistical Distributions

Determinants of the Quota Selection Inequality Phenomenon: Clarification of the Basis for Gillett's (1991) Findings.

Peer reviewed

You, Soon-Hyung; Stone-Romero, Eugene F. – Educational and Psychological Measurement, 1996

To clarify the findings of R. Gillett (1991) about the inequality of the means of test scores of minority and majority examinees, the standard errors of the quota-selected sample means and the sampling distribution of these means were studied through Monte Carlo simulation. Results explain that the quota selection inequality results from…

Descriptors: Error of Measurement, Minority Groups, Monte Carlo Methods, Sampling

Previous Page | Next Page »

Pages: 1 | 2

Reardon, Sean F.	3
Shear, Benjamin R.	2
Bell, Courtney A.	1
Borrello, Gloria M.	1
Brennan, Robert L.	1
Casabianca, Jodi M.	1
Cope, Ronald T.	1
Denbleyker, John Nickolas	1
Gitomer, Drew H.	1
Guo, Hongwen	1
Hamre, Bridget K.	1
Ho, Andrew D.	1
Kalogrides, Demetra	1
Kim, Sooyeon	1
Kolen, Michael J.	1
Lee, Won-Chan	1
Lockwood, Robert E.	1
McCaffrey, Daniel F.	1
Moses, Tim	1
Oh, Hyeonjoo J.	1
Pianta, Robert C.	1
Quesen, Sarah	1
Rosenthal, James A.	1
Stone-Romero, Eugene F.	1
More ▼