ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Evaluation Methods	13
Sampling	13
Statistical Inference	13
Computation	5
Intervals	4
Research Methodology	4
Comparative Analysis	3
Correlation	3
Statistical Analysis	3
Statistical Significance	3
Classification	2
Evaluative Thinking	2
Foreign Countries	2
Item Response Theory	2
Measurement	2
Models	2
Psychology	2
Randomized Controlled Trials	2
Research Design	2
Simulation	2
Statistical Bias	2
Statistics	2
Alzheimers Disease	1
Animals	1
Aptitude Tests	1
More ▼

Source

Grantee Submission	3
Applied Psychological…	2
Topics in Early Childhood…	2
Career and Technical…	1
College Board	1
ProQuest LLC	1
Psychological Review	1
Statistics Education Research…	1

Publication Type

Journal Articles	7
Reports - Research	7
Opinion Papers	2
Reports - Evaluative	2
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Secondary Education	2
Adult Education	1
Elementary Secondary Education	1
High Schools	1
Postsecondary Education	1

Audience

Researchers

Location

Australia	1
Canada	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Head Start

Assessments and Surveys

National Merit Scholarship…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Unified Approach to Estimating the Intraclass Correlation Coefficient and Its Bias: An Exploratory Study

Direct link

Kelvin Terrell Pompey – ProQuest LLC, 2021

Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…

Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation

General Forms of Finite Population Central Limit Theorems with Applications to Causal Inference

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xinran Li; Peng Ding – Grantee Submission, 2018

Frequentists' inference often delivers point estimators associated with confidence intervals or sets for parameters of interest. Constructing the confidence intervals or sets requires understanding the sampling distributions of the point estimators, which, in many but not all cases, are related to asymptotic Normal distributions ensured by central…

Descriptors: Correlation, Intervals, Sampling, Evaluation Methods

Randomization Inference for Treatment Effect Variation

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ding Peng; Avi Feller; Luke Miratrix – Grantee Submission, 2016

Applied researchers are increasingly interested in whether and how treatment effects vary in randomized evaluations, especially variation not explained by observed covariates. We propose a model-free approach for testing for the presence of such unexplained variation. To use this randomization-based approach, we must address the fact that the…

Descriptors: Randomized Controlled Trials, Statistical Inference, Evaluation Methods, Testing

Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling

Peer reviewed
PDF on ERIC

Download full text

Direct link

Gongjun Xu; Tony Sit; Lan Wang; Chiung-Yu Huang – Grantee Submission, 2017

Biased sampling occurs frequently in economics, epidemiology, and medical studies either by design or due to data collecting mechanism. Failing to take into account the sampling bias usually leads to incorrect inference. We propose a unified estimation procedure and a computationally fast resampling method to make statistical inference for…

Descriptors: Sampling, Statistical Inference, Computation, Generalization

Evaluating Equity at the Local Level Using Bootstrap Tests. Research Report 2016-4

Download full text

Kim, YoungKoung; DeCarlo, Lawrence T. – College Board, 2016

Because of concerns about test security, different test forms are typically used across different testing occasions. As a result, equating is necessary in order to get scores from the different test forms that can be used interchangeably. In order to assure the quality of equating, multiple equating methods are often examined. Various equity…

Descriptors: Equated Scores, Evaluation Methods, Sampling, Statistical Inference

Coefficient Alpha Bootstrap Confidence Interval under Nonnormality

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin; Newton, Matthew – Applied Psychological Measurement, 2012

Three different bootstrap methods for estimating confidence intervals (CIs) for coefficient alpha were investigated. In addition, the bootstrap methods were compared with the most promising coefficient alpha CI estimation methods reported in the literature. The CI methods were assessed through a Monte Carlo simulation utilizing conditions…

Descriptors: Intervals, Monte Carlo Methods, Computation, Sampling

Standard Errors and Confidence Intervals from Bootstrapping for Ramsay-Curve Item Response Theory Model Item Parameters

Peer reviewed

Direct link

Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011

Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…

Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods

Bayesian Analogy with Relational Transformations

Peer reviewed

Direct link

Lu, Hongjing; Chen, Dawn; Holyoak, Keith J. – Psychological Review, 2012

How can humans acquire relational representations that enable analogical inference and other forms of high-level reasoning? Using comparative relations as a model domain, we explore the possibility that bottom-up learning mechanisms applied to objects coded as feature vectors can yield representations of relations sufficient to solve analogy…

Descriptors: Inferences, Thinking Skills, Comparative Analysis, Models

Reporting Confidence Intervals and Effect Sizes: Collecting the Evidence

Peer reviewed

Direct link

Zientek, Linda Reichwein; Ozel, Z. Ebrar Yetkiner; Ozel, Serkan; Allen, Jeff – Career and Technical Education Research, 2012

Confidence intervals (CIs) and effect sizes are essential to encourage meta-analytic thinking and to accumulate research findings. CIs provide a range of plausible values for population parameters with a degree of confidence that the parameter is in that particular interval. CIs also give information about how precise the estimates are. Comparison…

Descriptors: Vocational Education, Effect Size, Intervals, Self Esteem

Significance Testing; Necessary but Insufficient.

Peer reviewed

Suen, Hoi K. – Topics in Early Childhood Special Education, 1992

This commentary on EC 603 695 argues that significance testing is a necessary but insufficient condition for positivistic research, that judgment-based assessment and single-subject research are not substitutes for significance testing, and that sampling fluctuation should be considered as one of numerous epistemological concerns in any…

Descriptors: Evaluation Methods, Evaluative Thinking, Research Design, Research Methodology

Large-Group Fantasies versus Single-Subject Science.

Peer reviewed

Da Prato, Robert A. – Topics in Early Childhood Special Education, 1992

This paper argues that judgment-based assessment of data from multiply replicated single-subject or small-N studies should replace normative-based (p=less than 0.05) assessment of large-N research in the clinical sciences, and asserts that inferential statistics should be abandoned as a method of evaluating clinical research data. (Author/JDD)

Descriptors: Evaluation Methods, Evaluative Thinking, Norms, Research Design

Student Description of Variation while Working with Weather Data

Peer reviewed

Direct link

Reading, Chris – Statistics Education Research Journal, 2004

Variation is a key concept in the study of statistics and its understanding is a crucial aspect of most statistically related tasks. This study aimed to extend and apply a hierarchy for describing students' understanding of variation that was developed in a sampling context to the context of a natural event in which variation occurs. Students aged…

Descriptors: Weather, Classification, Secondary School Students, Student Evaluation

Applying Generalizability Theory To Evaluate Treatment Effect in Single-Subject Research.

Download full text

Lefebvre, Daniel J.; Suen, Hoi K. – 1990

An empirical investigation of methodological issues associated with evaluating treatment effect in single-subject research (SSR) designs is presented. This investigation: (1) conducted a generalizability (G) study to identify the sources of systematic and random measurement error (SRME); (2) used an analytic approach based on G theory to integrate…

Descriptors: Classroom Observation Techniques, Disabilities, Educational Research, Error of Measurement

Suen, Hoi K.	2
Allen, Jeff	1
Avi Feller	1
Chen, Dawn	1
Chiung-Yu Huang	1
Da Prato, Robert A.	1
DeCarlo, Lawrence T.	1
Ding Peng	1
Divers, Jasmin	1
Gongjun Xu	1
Gu, Fei	1
Holyoak, Keith J.	1
Hoyle, Larry	1
Kelvin Terrell Pompey	1
Kim, YoungKoung	1
Kingston, Neal M.	1
Lan Wang	1
Lefebvre, Daniel J.	1
Lu, Hongjing	1
Luke Miratrix	1
Newton, Matthew	1
Ozel, Serkan	1
Ozel, Z. Ebrar Yetkiner	1
Padilla, Miguel A.	1
Peng Ding	1
More ▼