NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 48 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023
Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…
Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines
Peer reviewed Peer reviewed
PDF on ERIC Download full text
W. Jake Thompson – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…
Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Tenglong; Frank, Ken – Sociological Methods & Research, 2022
The internal validity of observational study is often subject to debate. In this study, we define the counterfactuals as the unobserved sample and intend to quantify its relationship with the null hypothesis statistical testing (NHST). We propose the probability of a robust inference for internal validity, that is, the PIV, as a robustness index…
Descriptors: Probability, Inferences, Validity, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Escudero, Paola; Smit, Eline A.; Angwin, Anthony J. – Language Learning, 2023
Research has shown that novel words can be learned through the mechanism of statistical or cross-situational word learning (CSWL). So far, CSWL studies using adult populations have focused on the presentation of spoken words. However, words can also be learned through their written form. This study compared auditory and orthographic presentations…
Descriptors: Word Lists, Vocabulary Development, Comparative Analysis, Auditory Stimuli
Peer reviewed Peer reviewed
Direct linkDirect link
Nelson, Peter M.; Van Norman, Ethan R.; Klingbeil, Dave A.; Parker, David C. – Psychology in the Schools, 2017
Although extensive research exists on the use of curriculum-based measures for progress monitoring, little is known about using computer adaptive tests (CATs) for progress-monitoring purposes. The purpose of this study was to evaluate the impact of the frequency of data collection on individual and group growth estimates using a CAT. Data were…
Descriptors: Progress Monitoring, Computer Assisted Testing, Data Collection, Scheduling
Peer reviewed Peer reviewed
Direct linkDirect link
Rodríguez-Ferreiro, Javier; Vadillo, Miguel A.; Barberia, Itxaso – Teaching of Psychology, 2023
Background: We have previously presented two educational interventions aimed to diminish causal illusions and promote critical thinking. In both cases, these interventions reduced causal illusions developed in response to active contingency learning tasks, in which participants were able to decide whether to introduce the potential cause in each…
Descriptors: Sampling, Inferences, Psychology, Undergraduate Students
Peer reviewed Peer reviewed
Direct linkDirect link
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018
In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…
Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Douven, Igor; Mirabile, Patricia – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018
There is a wealth of evidence that people's reasoning is influenced by explanatory considerations. Little is known, however, about the exact form this influence takes, for instance about whether the influence is unsystematic or because of people's following some rule. Three experiments investigate the descriptive adequacy of a precise proposal to…
Descriptors: Probability, Bayesian Statistics, Hypothesis Testing, Thinking Skills
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gardner, Josh; Brooks, Christopher – Journal of Learning Analytics, 2018
Model evaluation -- the process of making inferences about the performance of predictive models -- is a critical component of predictive modelling research in learning analytics. We survey the state of the practice with respect to model evaluation in learning analytics, which overwhelmingly uses only naïve methods for model evaluation or…
Descriptors: Prediction, Models, Evaluation, Evaluation Methods
Norouzian, Reza – ProQuest LLC, 2018
This dissertation consists of three manuscripts. The manuscripts contribute to a budding "methodological reform" currently taking place in quantitative second-language (L2) research. In the first manuscript, the researcher describes an empirical investigation on the application of two well-known effect size estimators, eta-squared (eta…
Descriptors: Bayesian Statistics, Second Language Learning, Language Research, Periodicals
Peer reviewed Peer reviewed
Direct linkDirect link
Evans, William S.; Cavanaugh, Robert; Quique, Yina; Boss, Emily; Starns, Jeffrey J.; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The purpose of this study was to develop and pilot a novel treatment framework called "BEARS" (Balancing Effort, Accuracy, and Response Speed). People with aphasia (PWA) have been shown to maladaptively balance speed and accuracy during language tasks. BEARS is designed to train PWA to balance speed-accuracy trade-offs and…
Descriptors: Accuracy, Semantics, Aphasia, Reaction Time
Peer reviewed Peer reviewed
Direct linkDirect link
Jamil, Tahira; Marsman, Maarten; Ly, Alexander; Morey, Richard D.; Wagenmakers, Eric-Jan – Educational and Psychological Measurement, 2017
In 1881, Donald MacAlister posed a problem in the "Educational Times" that remains relevant today. The problem centers on the statistical evidence for the effectiveness of a treatment based on a comparison between two proportions. A brief historical sketch is followed by a discussion of two default Bayesian solutions, one based on a…
Descriptors: Bayesian Statistics, Evidence, Comparative Analysis, Problem Solving
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015
This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…
Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012
In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4