ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	36
Since 2006 (last 20 years)	92

Descriptor

Evaluation Methods	92
Generalization	87
Models	19
Foreign Countries	15
Teaching Methods	13
Scores	12
Statistical Analysis	12
Student Evaluation	12
Comparative Analysis	11
Simulation	11
Validity	11
Data Analysis	10
Feedback (Response)	10
Computation	9
Correlation	9
Classification	8
Data Collection	8
Decision Making	8
Item Response Theory	8
Problem Solving	8
Research Methodology	8
Scoring	8
Academic Achievement	7
Autism	7
Prediction	7
More ▼

Publication Type

Journal Articles	79
Reports - Research	55
Reports - Evaluative	19
Reports - Descriptive	8
Information Analyses	7
Collected Works - Proceedings	3
Dissertations/Theses -…	3
Tests/Questionnaires	3
Books	1
Collected Works - General	1
Opinion Papers	1
More ▼

Education Level

Higher Education	21
Postsecondary Education	13
Middle Schools	8
Elementary Education	7
Junior High Schools	7
Secondary Education	7
Elementary Secondary Education	5
Early Childhood Education	3
Grade 6	3
Intermediate Grades	3
Adult Education	2
High Schools	2
Grade 2	1
Grade 5	1
Grade 9	1
Preschool Education	1
Primary Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	4
Teachers	1

Location

California	5
Indiana	2
United Kingdom	2
Afghanistan	1
Canada	1
China	1
Finland	1
France	1
Germany	1
Illinois (Chicago)	1
Iran	1
Malaysia	1
Romania	1
Singapore	1
Tennessee	1
Thailand	1
Ukraine	1
United Kingdom (England)	1
United Kingdom (Wales)	1
More ▼

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Child Behavior Checklist	1
Childrens Embedded Figures…	1
Indiana Statewide Testing for…	1
Program for International…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 92 results Save | Export

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Propensity Score Methods for Causal Inference and Generalization

Peer reviewed

Direct link

Wendy Chan – Asia Pacific Education Review, 2024

As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…

Descriptors: Probability, Scores, Causal Models, Statistical Inference

Dynamic Fit Index Cutoffs for Hierarchical and Second-Order Factor Models

Peer reviewed

Direct link

Daniel McNeish; Patrick D. Manapat – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A recent review found that 11% of published factor models are hierarchical models with second-order factors. However, dedicated recommendations for evaluating hierarchical model fit have yet to emerge. Traditional benchmarks like RMSEA <0.06 or CFI >0.95 are often consulted, but they were never intended to generalize to hierarchical models.…

Descriptors: Factor Analysis, Goodness of Fit, Hierarchical Linear Modeling, Benchmarking

Informative Hypothesis for Group Means Comparison

Peer reviewed
PDF on ERIC

Download full text

Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023

Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…

Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines

Selecting Relevant Moderators with Bayesian Regularized Meta-Regression

Peer reviewed

Direct link

Van Lissa, Caspar J.; van Erp, Sara; Clapper, Eli-Boaz – Research Synthesis Methods, 2023

When meta-analyzing heterogeneous bodies of literature, meta-regression can be used to account for potentially relevant between-studies differences. A key challenge is that the number of candidate moderators is often high relative to the number of studies. This introduces risks of overfitting, spurious results, and model non-convergence. To…

Descriptors: Bayesian Statistics, Regression (Statistics), Maximum Likelihood Statistics, Meta Analysis

Redefining Populations of Inference for Generalizations from Small Studies

Peer reviewed

Direct link

Wendy Chan; Jimin Oh; Katherine Wilson – Society for Research on Educational Effectiveness, 2022

Background: Over the past decade, research on the development and assessment of tools to improve the generalizability of experimental findings has grown extensively (Tipton & Olsen, 2018). However, many experimental studies in education are based on small samples, which may include 30-70 schools while inference populations to which…

Descriptors: Educational Research, Research Problems, Sample Size, Research Methodology

A Within-Study Approach to Evaluating the Role of Moderators of Impact in Limiting Generalizations from "Large to Small"

Peer reviewed

Direct link

Jaciw, Andrew P.; Unlu, Fatih; Nguyen, Thanh – American Journal of Evaluation, 2022

There is a burgeoning body of evidence on the average impacts of educational programs. Yet, for many local decision makers, because impacts can vary across sites, the question of whether a certain program will work in their particular district or school remains. This article addresses the question of the generalizability of large-scale average…

Descriptors: Program Effectiveness, Generalization, Outcome Measures, Institutional Characteristics

Developing Competency Frameworks Using Natural Language Processing: An Exploratory Study

Peer reviewed

Direct link

Garman, Andrew N.; Erwin, Taylor S.; Garman, Tyler R.; Kim, Dae Hyun – Journal of Competency-Based Education, 2021

Background: Competency models provide useful frameworks for organizing learning and assessment programs, but their construction is both time intensive and subject to perceptual biases. Some aspects of model development may be particularly well-suited to automation, specifically natural language processing (NLP), which could also help make them…

Descriptors: Natural Language Processing, Automation, Guidelines, Leadership Effectiveness

How Many Raters Can Be Enough: G Theory Applied to Assessment and Measurement of L2 Speech Perception

Peer reviewed
PDF on ERIC

Download full text

Kevin Hirschi; Okim Kang – Language Teaching Research Quarterly, 2023

This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on "G studies"--a method of breaking down measurement variance--and "D studies"--a predictive study of the impact on reliability when…

Descriptors: Evaluators, Generalization, Evaluation Methods, Speech Communication

Characterizing Students' Conceptual Difficulties with Mathematical Induction Using Visual Proofs

Peer reviewed

Direct link

Relaford-Doyle, Josephine; Núñez, Rafael – International Journal of Research in Undergraduate Mathematics Education, 2021

This paper describes a study that used a novel method to investigate conceptual difficulties with mathematical induction among two groups of undergraduate students: students who had received university-level instruction in formal mathematical induction, and students who had not been exposed to formal mathematical induction at the university level.…

Descriptors: Concept Formation, Mathematical Concepts, Difficulty Level, Undergraduate Students

Combining Machine Learning and Qualitative Methods to Elaborate Students' Ideas about the Generality of Their Model-Based Explanations

Peer reviewed

Direct link

Rosenberg, Joshua M.; Krist, Christina – Journal of Science Education and Technology, 2021

Assessing students' participation in science practices presents several challenges, especially when aiming to differentiate meaningful (vs. rote) forms of participation. In this study, we sought to use machine learning (ML) for a novel purpose in science assessment: developing a construct map for students' "consideration of generality,"…

Descriptors: Artificial Intelligence, Educational Technology, Technology Uses in Education, Models

Functional Communication Training: A Comprehensive Approach to Success for Educators

Peer reviewed

Direct link

McClure, Erica B.; Burt, Jonathan L. – Beyond Behavior, 2023

Functional communication training (FCT) is a strategy to address problem behavior for students with various disabilities that is supported by a broad evidence base. Despite this support, multiple factors continue to dissuade educators from utilizing FCT in their classrooms. This article outlines the process of developing and implementing FCT plans…

Descriptors: Behavior Problems, Students with Disabilities, Intervention, Evidence Based Practice

An Intersectional Approach to DIF: Comparing Outcomes across Methods

Peer reviewed

Direct link

Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022

Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…

Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Building an Initial Validity Argument for Binary and Analytic Rating Scales for an EFL Classroom Writing Assessment: Evidence from Many-Facets Rasch Measurement

Peer reviewed
PDF on ERIC

Download full text

Khamboonruang, Apichat – rEFLections, 2022

Although much research has compared the functioning between analytic and holistic rating scales, little research has compared the functioning of binary rating scales with other types of rating scales. This quantitative study set out to preliminarily and comparatively validate binary and analytic rating scales intended for use in formative…

Descriptors: Writing Evaluation, Evaluation Methods, Second Language Learning, Second Language Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Journal of Applied Behavior…	4
Cognitive Science	3
Educational and Psychological…	3
International Educational…	3
ProQuest LLC	3
Society for Research on…	3
Advances in Health Sciences…	2
American Journal of Evaluation	2
Behavior Analyst Today	2
Educational Assessment	2
Journal of Behavioral…	2
Measurement:…	2
Psychological Assessment	2
Structural Equation Modeling:…	2
Topics in Early Childhood…	2
American Journal of…	1
Applied Measurement in…	1
Asia Pacific Education Review	1
Autism: The International…	1
Beyond Behavior	1
British Journal of…	1
British Journal of Sociology…	1
Cogent Education	1
Comparative Professional…	1
Developmental Psychology	1
More ▼

Barnes, Tiffany, Ed.	2
Blunk, Merrie	2
Chan, Wendy	2
Hill, Heather C.	2
Tipton, Elizabeth	2
Wagenmakers, Eric-Jan	2
Wendy Chan	2
Achenbach, Thomas M.	1
Ahearn, William H.	1
Ahn, Woo-Young	1
Algina, James	1
Alice, Kvale	1
Aljunied, Mariam	1
Allam, Reynald	1
Almqvist, Fredrik	1
Ambridge, Ben	1
Babb, Michelle	1
Ball, Deborah Loewenberg	1
Bilenberg, Niels	1
Billington, Abigail Q.	1
Bird, Hector	1
Blair, Kristen P.	1
Blair, Kwang-Sun Cho	1
Bonfiglio, Christine M.	1
Borko, Hilda	1
More ▼