ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	14
Since 2007 (last 20 years)	31

Descriptor

Bayesian Statistics	48
Comparative Analysis	48
Computer Assisted Testing	20
Hypothesis Testing	16
Adaptive Testing	15
Statistical Analysis	9
Models	8
Inferences	7
Simulation	7
Test Items	7
Computation	6
Estimation (Mathematics)	6
Mathematical Models	6
Maximum Likelihood Statistics	6
Probability	6
Test Construction	6
Test Reliability	6
Accuracy	5
Correlation	5
Evaluation Methods	5
Foreign Countries	5
Item Response Theory	5
Scoring	5
Testing	5
Computer Simulation	4
More ▼

Publication Type

Journal Articles	36
Reports - Research	32
Reports - Evaluative	11
Speeches/Meeting Papers	7
Reports - Descriptive	3
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Higher Education	7
Elementary Education	5
Postsecondary Education	5
Grade 4	3
Middle Schools	3
Early Childhood Education	2
Grade 7	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Primary Education	2
Secondary Education	2
Adult Education	1
Elementary Secondary Education	1
Grade 10	1
Grade 12	1
Grade 2	1
Grade 3	1
Grade 5	1
Grade 9	1
High Schools	1
Kindergarten	1
More ▼

Audience

Location

Netherlands	3
Australia	1
Canada	1
Czech Republic	1
Israel	1
Massachusetts	1
North Carolina	1
Pennsylvania	1
Pennsylvania (Pittsburgh)	1
Slovakia	1
South Africa	1
Spain	1
United Kingdom	1
United States	1
Utah	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Intelligence Scale…	2
Law School Admission Test	1
Massachusetts Comprehensive…	1
School and College Ability…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 48 results Save | Export

Informative Hypothesis for Group Means Comparison

Peer reviewed
PDF on ERIC

Download full text

Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023

Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…

Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines

Evaluating Methods for Assessing Model Fit in Diagnostic Classification Models

Peer reviewed
PDF on ERIC

Download full text

W. Jake Thompson – Grantee Submission, 2024

Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…

Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit

The Probability of a Robust Inference for Internal Validity

Peer reviewed

Direct link

Li, Tenglong; Frank, Ken – Sociological Methods & Research, 2022

The internal validity of observational study is often subject to debate. In this study, we define the counterfactuals as the unobserved sample and intend to quantify its relationship with the null hypothesis statistical testing (NHST). We propose the probability of a robust inference for internal validity, that is, the PIV, as a robustness index…

Descriptors: Probability, Inferences, Validity, Correlation

Investigating Orthographic versus Auditory Cross-Situational Word Learning with Online and Laboratory-Based Testing

Peer reviewed

Direct link

Escudero, Paola; Smit, Eline A.; Angwin, Anthony J. – Language Learning, 2023

Research has shown that novel words can be learned through the mechanism of statistical or cross-situational word learning (CSWL). So far, CSWL studies using adult populations have focused on the presentation of spoken words. However, words can also be learned through their written form. This study compared auditory and orthographic presentations…

Descriptors: Word Lists, Vocabulary Development, Comparative Analysis, Auditory Stimuli

Progress Monitoring with Computer Adaptive Assessments: The Impact of Data Collection Schedule on Growth Estimates

Peer reviewed

Direct link

Nelson, Peter M.; Van Norman, Ethan R.; Klingbeil, Dave A.; Parker, David C. – Psychology in the Schools, 2017

Although extensive research exists on the use of curriculum-based measures for progress monitoring, little is known about using computer adaptive tests (CATs) for progress-monitoring purposes. The purpose of this study was to evaluate the impact of the frequency of data collection on individual and group growth estimates using a CAT. Data were…

Descriptors: Progress Monitoring, Computer Assisted Testing, Data Collection, Scheduling

Debiasing Causal Inferences: Over and beyond Suboptimal Sampling

Peer reviewed

Direct link

Rodríguez-Ferreiro, Javier; Vadillo, Miguel A.; Barberia, Itxaso – Teaching of Psychology, 2023

Background: We have previously presented two educational interventions aimed to diminish causal illusions and promote critical thinking. In both cases, these interventions reduced causal illusions developed in response to active contingency learning tasks, in which participants were able to decide whether to introduce the potential cause in each…

Descriptors: Sampling, Inferences, Psychology, Undergraduate Students

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Best, Second-Best, and Good-Enough Explanations: How They Matter to Reasoning

Peer reviewed

Direct link

Douven, Igor; Mirabile, Patricia – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018

There is a wealth of evidence that people's reasoning is influenced by explanatory considerations. Little is known, however, about the exact form this influence takes, for instance about whether the influence is unsystematic or because of people's following some rule. Three experiments investigate the descriptive adequacy of a precise proposal to…

Descriptors: Probability, Bayesian Statistics, Hypothesis Testing, Thinking Skills

Evaluating Predictive Models of Student Success: Closing the Methodological Gap

Peer reviewed
PDF on ERIC

Download full text

Gardner, Josh; Brooks, Christopher – Journal of Learning Analytics, 2018

Model evaluation -- the process of making inferences about the performance of predictive models -- is a critical component of predictive modelling research in learning analytics. We survey the state of the practice with respect to model evaluation in learning analytics, which overwhelmingly uses only naïve methods for model evaluation or…

Descriptors: Prediction, Models, Evaluation, Evaluation Methods

Methodological Reform in Quantitative Second Language Research: Effect Sizes, Bayesian Hypothesis Testing, and Bayesian Estimation of Effect Sizes

Direct link

Norouzian, Reza – ProQuest LLC, 2018

This dissertation consists of three manuscripts. The manuscripts contribute to a budding "methodological reform" currently taking place in quantitative second-language (L2) research. In the first manuscript, the researcher describes an empirical investigation on the application of two well-known effect size estimators, eta-squared (eta…

Descriptors: Bayesian Statistics, Second Language Learning, Language Research, Periodicals

Playing with BEARS: Balancing Effort, Accuracy, and Response Speed in a Semantic Feature Verification Anomia Treatment Game

Peer reviewed

Direct link

Evans, William S.; Cavanaugh, Robert; Quique, Yina; Boss, Emily; Starns, Jeffrey J.; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: The purpose of this study was to develop and pilot a novel treatment framework called "BEARS" (Balancing Effort, Accuracy, and Response Speed). People with aphasia (PWA) have been shown to maladaptively balance speed and accuracy during language tasks. BEARS is designed to train PWA to balance speed-accuracy trade-offs and…

Descriptors: Accuracy, Semantics, Aphasia, Reaction Time

What Are the Odds? Modern Relevance and Bayes Factor Solutions for MacAlister's Problem from the 1881 "Educational Times"

Peer reviewed

Direct link

Jamil, Tahira; Marsman, Maarten; Ly, Alexander; Morey, Richard D.; Wagenmakers, Eric-Jan – Educational and Psychological Measurement, 2017

In 1881, Donald MacAlister posed a problem in the "Educational Times" that remains relevant today. The problem centers on the statistical evidence for the effectiveness of a treatment based on a comparison between two proportions. A brief historical sketch is followed by a discussion of two default Bayesian solutions, one based on a…

Descriptors: Bayesian Statistics, Evidence, Comparative Analysis, Problem Solving

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	4
Journal of Experimental…	4
Applied Measurement in…	3
Journal of Educational…	3
Journal of Educational and…	2
Language Learning	2
Applied Psychological…	1
Biochemistry and Molecular…	1
British Journal of Guidance &…	1
Cognitive Psychology	1
ETS Research Report Series	1
Education and Information…	1
Grantee Submission	1
International Working Group…	1
Journal of Learning Analytics	1
Journal of Learning…	1
Journal of Speech, Language,…	1
Multivariate Behavioral…	1
Practical Assessment,…	1
ProQuest LLC	1
Psicologica: International…	1
Psychological Methods	1
Psychology in the Schools	1
Reading and Writing: An…	1
Research & Practice in…	1
More ▼

Reckase, Mark D.	3
Hsu, Tse-Chi	2
Kim, Sooyeon	2
Kirisci, Levent	2
Kroesbergen, Evelyn H.	2
Morey, Richard D.	2
Moses, Tim	2
Veldkamp, Bernard P.	2
de Bree, Elise H.	2
van Viersen, Sietske	2
Angwin, Anthony J.	1
Avetisyan, Marianna	1
Baker, Ryan S.	1
Balota, David A.	1
Barberia, Itxaso	1
Barnes, Tiffany, Ed.	1
Boss, Emily	1
Brooks, Christopher	1
Campbell, Megan	1
Carvajal, Jorge	1
Cavanaugh, Robert	1
Chen, Po-Hsi	1
Corbett, Albert T.	1
De Ayala, R. J.	1
More ▼