ERIC - Search Results

Publication Date

In 2025	1
Since 2024	6
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	35
Since 2006 (last 20 years)	159

Descriptor

Evaluation Methods	198
Probability	198
Models	60
Statistical Analysis	28
Comparative Analysis	27
Research Methodology	24
Foreign Countries	23
Item Response Theory	23
Bayesian Statistics	22
Simulation	22
Computation	21
Prediction	20
Data Analysis	19
Measurement Techniques	19
Student Evaluation	19
Test Items	19
Validity	19
Psychometrics	17
Scores	16
Correlation	15
Hypothesis Testing	15
Sampling	15
Classification	13
Decision Making	13
Sample Size	11
More ▼

Publication Type

Journal Articles	198
Reports - Research	107
Reports - Evaluative	52
Reports - Descriptive	28
Opinion Papers	9
Information Analyses	3
Book/Product Reviews	1
Collected Works - Serials	1

Education Level

Higher Education	29
Postsecondary Education	14
Elementary Secondary Education	6
Early Childhood Education	5
Elementary Education	5
Adult Education	3
High Schools	3
Middle Schools	3
Preschool Education	3
Secondary Education	3
Grade 3	2
Grade 4	2
Grade 9	2
Junior High Schools	2
Primary Education	2
Grade 1	1
Grade 5	1
Grade 6	1
Grade 8	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Researchers	6
Practitioners	2
Teachers	1

Location

Canada	3
Germany	3
United Kingdom (England)	3
Australia	2
Illinois	2
Indiana	2
Italy	2
United Kingdom	2
United Kingdom (Scotland)	2
United Kingdom (Wales)	2
Bulgaria	1
California (Los Angeles)	1
Denmark	1
Ecuador	1
Iceland	1
Indonesia	1
Iowa	1
Kenya	1
Latvia	1
Missouri	1
New York	1
Pennsylvania	1
South Africa	1
South Carolina	1
Turkey	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	1
Program for International…	1
Raven Progressive Matrices	1
Trends in International…	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 198 results Save | Export

Propensity Score Methods for Causal Inference and Generalization

Peer reviewed

Direct link

Wendy Chan – Asia Pacific Education Review, 2024

As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…

Descriptors: Probability, Scores, Causal Models, Statistical Inference

A Comparison of Three Popular Methods for Handling Missing Data: Complete-Case Analysis, Inverse Probability Weighting, and Multiple Imputation

Peer reviewed

Direct link

Roderick J. Little; James R. Carpenter; Katherine J. Lee – Sociological Methods & Research, 2024

Missing data are a pervasive problem in data analysis. Three common methods for addressing the problem are (a) complete-case analysis, where only units that are complete on the variables in an analysis are included; (b) weighting, where the complete cases are weighted by the inverse of an estimate of the probability of being complete; and (c)…

Descriptors: Foreign Countries, Probability, Robustness (Statistics), Responses

A Context-Dependent Bayesian Account for Causal-Based Categorization

Peer reviewed

Direct link

Marchant, Nicolás; Quillien, Tadeg; Chaigneau, Sergio E. – Cognitive Science, 2023

The causal view of categories assumes that categories are represented by features and their causal relations. To study the effect of causal knowledge on categorization, researchers have used Bayesian causal models. Within that framework, categorization may be viewed as dependent on a likelihood computation (i.e., the likelihood of an exemplar with…

Descriptors: Classification, Bayesian Statistics, Causal Models, Evaluation Methods

Quantitative Techniques with Small Sample Sizes: An Educational Summer Camp Example

Peer reviewed
PDF on ERIC

Download full text

Trina Johnson Kilty; Kevin T. Kilty; Andrea C. Burrows Borowczak; Mike Borowczak – Problems of Education in the 21st Century, 2024

A computer science camp for pre-collegiate students was operated during the summers of 2022 and 2023. The effect the camp had on attitudes was quantitatively assessed using a survey instrument. However, enrollment at the summer camp was small, which meant the well-known Pearson's Chi-Squared to measure the significance of results was not applied.…

Descriptors: Summer Programs, Camps, Computer Science Education, 21st Century Skills

Enhancing Recall in Automated Record Screening: A Resampling Algorithm

Peer reviewed

Direct link

Zhipeng Hou; Elizabeth Tipton – Research Synthesis Methods, 2024

Literature screening is the process of identifying all relevant records from a pool of candidate paper records in systematic review, meta-analysis, and other research synthesis tasks. This process is time consuming, expensive, and prone to human error. Screening prioritization methods attempt to help reviewers identify most relevant records while…

Descriptors: Meta Analysis, Research Reports, Identification, Evaluation Methods

Latent Profile Transition Analysis with Random Intercepts (RI-LPTA)

Peer reviewed

Direct link

Ming-Chi Tseng – Structural Equation Modeling: A Multidisciplinary Journal, 2024

The primary objective of this investigation is the formulation of random intercept latent profile transition analysis (RI-LPTA). Our simulation investigation suggests that the election between LPTA and RI-LPTA for examination has negligible impact on the estimation of transition probability parameters when the population parameters are generated…

Descriptors: Monte Carlo Methods, Predictor Variables, Research Methodology, Test Bias

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Peer reviewed

Direct link

Baldwin, Peter; Margolis, Melissa J.; Clauser, Brian E.; Mee, Janet; Winward, Marcia – Educational Measurement: Issues and Practice, 2020

Evidence of the internal consistency of standard-setting judgments is a critical part of the validity argument for tests used to make classification decisions. The bookmark standard-setting procedure is a popular approach to establishing performance standards, but there is relatively little research that reflects on the internal consistency of the…

Descriptors: Standard Setting (Scoring), Probability, Cutting Scores, Evaluation Methods

Estimating a Dose-Response Relationship in Quasi-Experimental Student Success Studies

Peer reviewed

Direct link

Shao, Lucy; Levine, Richard A.; Guarcello, Maureen A.; Wilke, Morten C.; Stronach, Jeanne; Frazee, James P.; Fan, Juanjuan – International Journal of Artificial Intelligence in Education, 2023

Propensity score matching and weighting methods are applied to balance covariates and reduce selection bias in the analysis of observational study data, and ultimately estimate a treatment effect. We wish to evaluate the impact of a Supplemental Instruction (SI) program on student success in an Introductory Statistics course. In such student…

Descriptors: Statistical Bias, Probability, Scores, Weighted Scores

Optimizing Count Responses in Surveys: A Machine-Learning Approach

Peer reviewed

Direct link

Fu, Qiang; Guo, Xin; Land, Kenneth C. – Sociological Methods & Research, 2020

Count responses with grouping and right censoring have long been used in surveys to study a variety of behaviors, status, and attitudes. Yet grouping or right-censoring decisions of count responses still rely on arbitrary choices made by researchers. We develop a new method for evaluating grouping and right-censoring decisions of count responses…

Descriptors: Surveys, Artificial Intelligence, Evaluation Methods, Probability

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Modeling and Analyzing Inquiry Strategies in Open-Ended Learning Environments

Peer reviewed

Direct link

Käser, Tanja; Schwartz, Daniel L. – International Journal of Artificial Intelligence in Education, 2020

Modeling and predicting student learning in computer-based environments often relies solely on sequences of accuracy data. Previous research suggests that it does not only matter what we learn, but also how we learn. The detection and analysis of learning behavior becomes especially important, when dealing with open-ended exploration environments,…

Descriptors: Inquiry, Learning Strategies, Outcomes of Education, Academic Achievement

A Propensity Score Method for Investigating Differential Item Functioning in Performance Assessment

Peer reviewed

Direct link

Chen, Michelle Y.; Liu, Yan; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

This study introduces a novel differential item functioning (DIF) method based on propensity score matching that tackles two challenges in analyzing performance assessment data, that is, continuous task scores and lack of a reliable internal variable as a proxy for ability or aptitude. The proposed DIF method consists of two main stages. First,…

Descriptors: Probability, Scores, Evaluation Methods, Test Items

Motivations for Using the Item Response Theory Nominal Response Model to Rank Responses to Multiple-Choice Items

Peer reviewed

Direct link

Smith, Trevor I.; Bendjilali, Nasrine – Physical Review Physics Education Research, 2022

Several recent studies have employed item response theory (IRT) to rank incorrect responses to commonly used research-based multiple-choice assessments. These studies use Bock's nominal response model (NRM) for applying IRT to categorical (nondichotomous) data, but the response rankings only utilize half of the parameters estimated by the model.…

Descriptors: Item Response Theory, Test Items, Multiple Choice Tests, Science Tests

Using a Naive Bayesian Approach to Identify Academic Risk Based on Multiple Sources: A Conceptual Replication

Peer reviewed

Direct link

Carly Oddleifson; Stephen Kilgus; David A. Klingbeil; Alexander D. Latham; Jessica S. Kim; Ishan N. Vengurlekar – Grantee Submission, 2025

The purpose of this study was to conduct a conceptual replication of Pendergast et al.'s (2018) study that examined the diagnostic accuracy of a nomogram procedure, also known as a naive Bayesian approach. The specific naive Bayesian approach combined academic and social-emotional and behavioral (SEB) screening data to predict student performance…

Descriptors: Bayesian Statistics, Accuracy, Social Emotional Learning, Diagnostic Tests

Metrics for Discrete Student Models: Chance Levels, Comparisons, and Use Cases

Peer reviewed
PDF on ERIC

Download full text

Bosch, Nigel; Paquette, Luc – Journal of Learning Analytics, 2018

Metrics including Cohen's kappa, precision, recall, and F[subscript 1] are common measures of performance for models of discrete student states, such as a student's affect or behaviour. This study examined discrete model metrics for previously published student model examples to identify situations where metrics provided differing perspectives on…

Descriptors: Models, Comparative Analysis, Prediction, Probability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Psychological Methods	9
Educational and Psychological…	7
Psychological Review	7
Psychometrika	7
Journal of Experimental…	6
Applied Psychological…	5
Cognition	5
International Journal of…	5
Journal of Educational and…	5
Psicologica: International…	4
American Journal of Evaluation	3
Journal of Educational Data…	3
Measurement:…	3
Multivariate Behavioral…	3
Social Indicators Research	3
American Educational Research…	2
Cognitive Psychology	2
Cognitive Science	2
ETS Research Report Series	2
Educational Measurement:…	2
Evaluation Review	2
International Journal of…	2
Journal of Applied Behavior…	2
Journal of Experimental…	2
Journal of Experimental…	2
More ▼

Zumbo, Bruno D.	3
Beretvas, S. Natasha	2
Chater, Nick	2
Ferrando, Pere J.	2
Gierl, Mark J.	2
Lee, Michael D.	2
Liu, Yan	2
Wagenmakers, Eric-Jan	2
Xu, Fei	2
von Davier, Matthias	2
Acredolo, Curt	1
Akbari, Alireza	1
Alexander D. Latham	1
Alvarado, Jesus M.	1
Alves, Cecilia	1
Amemiya, Yasuo	1
Anderson, Edward R.	1
Andjelic, Svetlana	1
Andrea C. Burrows Borowczak	1
Andrews, Mark	1
Araujo, Catia	1
Arciuli, Joanne	1
Armstrong, Ronald D.	1
Atalay, Zumra	1
More ▼