ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	36

Descriptor

Evaluation Methods	47
Models	47
Inferences	39
Bayesian Statistics	9
Statistical Inference	9
Student Evaluation	9
Data Analysis	7
Probability	7
Simulation	7
Academic Achievement	6
Comparative Analysis	6
Observation	6
Classification	5
Computation	5
Measurement Techniques	5
Scores	5
Test Validity	5
Validity	5
Accountability	4
Cognitive Processes	4
Correlation	4
Educational Assessment	4
Evidence	4
Intervals	4
Item Response Theory	4
More ▼

Publication Type

Journal Articles	37
Reports - Descriptive	16
Reports - Evaluative	16
Reports - Research	11
Opinion Papers	3
Dissertations/Theses -…	2
Books	1
Guides - Classroom - Teacher	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Speeches/Meeting Papers	1
More ▼

Education Level

Elementary Secondary Education	6
Higher Education	4
Elementary Education	3
High Schools	2
Junior High Schools	2
Postsecondary Education	2
Secondary Education	2
Grade 1	1
Grade 4	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Kindergarten	1
Middle Schools	1
More ▼

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

Algeria	1
California	1
Florida (Miami)	1
Kentucky	1
Netherlands	1
Taiwan	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Woodcock Reading Mastery Test

What Works Clearinghouse Rating

Showing 1 to 15 of 47 results Save | Export

Addressing Threats to Validity in Supervised Machine Learning: A Framework and Best Practices for Education Researchers. EdWorkingPaper No. 25-1117

Download full text

Kylie L. Anglin – Annenberg Institute for School Reform at Brown University, 2025

Since 2018, institutions of higher education have been aware of the "enrollment cliff" which refers to expected declines in future enrollment. This paper attempts to describe how prepared institutions in Ohio are for this future by looking at trends leading up to the anticipated decline. Using IPEDS data from 2012-2022, we analyze trends…

Descriptors: Validity, Artificial Intelligence, Models, Best Practices

Addressing Threats to Validity in Supervised Machine Learning: A Framework and Best Practices for Education Researchers

Peer reviewed
PDF on ERIC

Download full text

Kylie Anglin – AERA Open, 2024

Given the rapid adoption of machine learning methods by education researchers, and the growing acknowledgment of their inherent risks, there is an urgent need for tailored methodological guidance on how to improve and evaluate the validity of inferences drawn from these methods. Drawing on an integrative literature review and extending a…

Descriptors: Validity, Artificial Intelligence, Models, Best Practices

Robustness and Model Selection in Configurational Causal Modeling

Peer reviewed

Direct link

Parkkinen, Veli-Pekka; Baumgartner, Michael – Sociological Methods & Research, 2023

In recent years, proponents of configurational comparative methods (CCMs) have advanced various dimensions of robustness as instrumental to model selection. But these robustness considerations have not led to computable robustness measures, and they have typically been applied to the analysis of real-life data with unknown underlying causal…

Descriptors: Robustness (Statistics), Comparative Analysis, Causal Models, Models

Examining the Robustness of the Graded Response and 2-Parameter Logistic Models to Violations of Construct Normality

Peer reviewed

Direct link

Manapat, Patrick D.; Edwards, Michael C. – Educational and Psychological Measurement, 2022

When fitting unidimensional item response theory (IRT) models, the population distribution of the latent trait ([theta]) is often assumed to be normally distributed. However, some psychological theories would suggest a nonnormal [theta]. For example, some clinical traits (e.g., alcoholism, depression) are believed to follow a positively skewed…

Descriptors: Robustness (Statistics), Computational Linguistics, Item Response Theory, Psychological Patterns

The Impact of Markov Chain Convergence on Estimation of Mixture IRT Model Parameters

Peer reviewed

Direct link

Jang, Yoonsun; Cohen, Allan S. – Educational and Psychological Measurement, 2020

A nonconverged Markov chain can potentially lead to invalid inferences about model parameters. The purpose of this study was to assess the effect of a nonconverged Markov chain on the estimation of parameters for mixture item response theory models using a Markov chain Monte Carlo algorithm. A simulation study was conducted to investigate the…

Descriptors: Markov Processes, Item Response Theory, Accuracy, Inferences

Perspectives on the Use of Null Hypothesis Statistical Testing. Part III: the Various Nuts and Bolts of Statistical and Hypothesis Testing

Peer reviewed

Direct link

Marmolejo-Ramos, Fernando; Cousineau, Denis – Educational and Psychological Measurement, 2017

The number of articles showing dissatisfaction with the null hypothesis statistical testing (NHST) framework has been progressively increasing over the years. Alternatives to NHST have been proposed and the Bayesian approach seems to have achieved the highest amount of visibility. In this last part of the special issue, a few alternative…

Descriptors: Hypothesis Testing, Bayesian Statistics, Evaluation Methods, Statistical Inference

An Introduction to the Analysis of Ranked Response Data

Peer reviewed
PDF on ERIC

Download full text

Finch, Holmes – Practical Assessment, Research & Evaluation, 2022

Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…

Descriptors: Data Analysis, Statistical Inference, Models, College Faculty

Observation-Oriented Modeling: Going beyond "Is It All a Matter of Chance"?

Peer reviewed

Direct link

Grice, James W.; Yepez, Maria; Wilson, Nicole L.; Shoda, Yuichi – Educational and Psychological Measurement, 2017

An alternative to null hypothesis significance testing is presented and discussed. This approach, referred to as observation-oriented modeling, is centered on model building in an effort to explicate the structures and processes believed to generate a set of observations. In terms of analysis, this novel approach complements traditional methods…

Descriptors: Hypothesis Testing, Models, Observation, Statistical Inference

The Role of Empirical Evidence in Modeling Speech Segmentation

Direct link

Phillips, Lawrence – ProQuest LLC, 2015

Choosing specific implementational details is one of the most important aspects of creating and evaluating a model. In order to properly model cognitive processes, choices for these details must be made based on empirical research. Unfortunately, modelers are often forced to make decisions in the absence of relevant data. My work investigates the…

Descriptors: Role, Inferences, Prediction, Models

Bayesian Analysis and Design for Joint Modeling of Two Binary Responses with Misclassification

Peer reviewed

Direct link

Stamey, James D.; Beavers, Daniel P.; Sherr, Michael E. – Sociological Methods & Research, 2017

Survey data are often subject to various types of errors such as misclassification. In this article, we consider a model where interest is simultaneously in two correlated response variables and one is potentially subject to misclassification. A motivating example of a recent study of the impact of a sexual education course for adolescents is…

Descriptors: Bayesian Statistics, Classification, Models, Correlation

Bayesian Analogy with Relational Transformations

Peer reviewed

Direct link

Lu, Hongjing; Chen, Dawn; Holyoak, Keith J. – Psychological Review, 2012

How can humans acquire relational representations that enable analogical inference and other forms of high-level reasoning? Using comparative relations as a model domain, we explore the possibility that bottom-up learning mechanisms applied to objects coded as feature vectors can yield representations of relations sufficient to solve analogy…

Descriptors: Inferences, Thinking Skills, Comparative Analysis, Models

Approaches for Combining Multiple Measures of Teacher Performance: Reliability, Validity, and Implications for Evaluation Policy

Peer reviewed

Direct link

Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016

A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…

Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy

Rethinking Teacher Evaluation: A Conversation about Statistical Inferences and Value-Added Models

Peer reviewed

Direct link

Callister Everson, Kimberlee; Feinauer, Erika; Sudweeks, Richard R. – Harvard Educational Review, 2013

In this article, the authors provide a methodological critique of the current standard of value-added modeling forwarded in educational policy contexts as a means of measuring teacher effectiveness. Conventional value-added estimates of teacher quality are attempts to determine to what degree a teacher would theoretically contribute, on average,…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Evaluation Methods, Accountability

In Search of Validity Evidence in Support of the Interpretation and Use of Assessments of Complex Constructs: Discussion of Research on Assessing 21st Century Skills

Peer reviewed

Direct link

Ercikan, Kadriye; Oliveri, María Elena – Applied Measurement in Education, 2016

Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…

Descriptors: Evaluation Methods, Test Construction, Design, Scaling

Why We (Usually) Don't Have to Worry about Multiple Comparisons

Peer reviewed

Direct link

Gelman, Andrew; Hill, Jennifer; Yajima, Masanao – Journal of Research on Educational Effectiveness, 2012

Applied researchers often find themselves making statistical inferences in settings that would seem to require multiple comparisons adjustments. We challenge the Type I error paradigm that underlies these corrections. Moreover we posit that the problem of multiple comparisons can disappear entirely when viewed from a hierarchical Bayesian…

Descriptors: Intervals, Comparative Analysis, Inferences, Error Patterns

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	4
ProQuest LLC	2
Psychological Methods	2
Sociological Methods &…	2
AERA Open	1
American Journal of Evaluation	1
Annenberg Institute for…	1
Applied Measurement in…	1
Applied Psychological…	1
Cognition	1
Cognitive Science	1
College Board	1
Computers & Education	1
Education Finance and Policy	1
Educational Assessment	1
Educational Evaluation and…	1
Educational Psychology Review	1
Evaluation Review	1
Evaluation and the Health…	1
Harvard Educational Review	1
Information Processing and…	1
International Journal of…	1
Jossey-Bass, An Imprint of…	1
Journal of Educational and…	1
Journal of Experimental…	1
More ▼

Goldschmidt, Pete	2
Kane, Michael T.	2
Lee, Michael D.	2
Levy, Roy	2
Amirat, Abdelkrim	1
Baker, Eva L.	1
Baumgartner, Michael	1
Beavers, Daniel P.	1
Bonnefon, Jean-Francois	1
Browder, Diane M.	1
Callister Everson, Kimberlee	1
Chen, Dawn	1
Cohen, Allan S.	1
Cousineau, Denis	1
De Boeck, Paul	1
Deiglmayr, Anne	1
Diakow, Ronli Phyllis	1
Donmez, Oktay	1
Drissi, Samia	1
Edwards, Michael C.	1
Ercikan, Kadriye	1
Farrington, David P.	1
Feinauer, Erika	1
Finch, Holmes	1
Fletcher, Jack M.	1
More ▼