Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 5 |
Descriptor
| Bayesian Statistics | 11 |
| Comparative Analysis | 11 |
| Computer Simulation | 11 |
| Adaptive Testing | 4 |
| Computer Assisted Testing | 4 |
| Estimation (Mathematics) | 4 |
| Computer Software | 3 |
| Models | 3 |
| Efficiency | 2 |
| Evaluation Methods | 2 |
| Item Response Theory | 2 |
| More ▼ | |
Source
Author
| Andrade, Alejandro | 1 |
| Chen, Po-Hsi | 1 |
| Danish, Joshua A. | 1 |
| De Ayala, R. J. | 1 |
| Gifford, Janice A. | 1 |
| Hsu, Tse-Chi | 1 |
| Iseli, Markus R. | 1 |
| Kirisci, Levent | 1 |
| Koenig, Alan D. | 1 |
| Lee, John J. | 1 |
| Maltese, Adam V. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 6 |
| Reports - Evaluative | 5 |
| Reports - Research | 5 |
| Speeches/Meeting Papers | 3 |
| Collected Works - Proceedings | 1 |
Education Level
| Grade 3 | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
| Two Year Colleges | 1 |
Audience
Location
| California | 1 |
| Cameroon | 1 |
| Japan (Tokyo) | 1 |
| Nigeria | 1 |
| Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Norouzian, Reza; de Miranda, Michael; Plonsky, Luke – Language Learning, 2018
Frequentist methods have long dominated data analysis in quantitative second language (L2) research. Recently, however, several empirical fields have begun to embrace alternatives known as Bayesian methods. Using an open-source approach, we provide an applied, nontechnical rationale for Bayesian methods in L2 research. First, we compare the…
Descriptors: Second Language Learning, Language Research, Bayesian Statistics, Comparative Analysis
Andrade, Alejandro; Danish, Joshua A.; Maltese, Adam V. – Journal of Learning Analytics, 2017
Interactive learning environments with body-centric technologies lie at the intersection of the design of embodied learning activities and multimodal learning analytics. Sensing technologies can generate large amounts of fine-grained data automatically captured from student movements. Researchers can use these fine-grained data to create a…
Descriptors: Measurement, Interaction, Models, Educational Environment
Iseli, Markus R.; Koenig, Alan D.; Lee, John J.; Wainess, Richard – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010
Assessment of complex task performance is crucial to evaluating personnel in critical job functions such as Navy damage control operations aboard ships. Games and simulations can be instrumental in this process, as they can present a broad range of complex scenarios without involving harm to people or property. However, "automatic"…
Descriptors: Performance Tests, Performance Based Assessment, Decision Making Skills, Military Training
Simonson, Michael, Ed. – Association for Educational Communications and Technology, 2015
For the thirty-eighth time, the Research and Theory Division of the Association for Educational Communications and Technology (AECT) is sponsoring the publication of these Proceedings. Papers published in this volume were presented at the annual AECT Convention in Indianapolis, Indiana. The Proceedings of AECT's Convention are published in two…
Descriptors: Information Technology, Educational Technology, Student Attitudes, Online Courses
Wang, Wen-Chung; Chen, Po-Hsi – Applied Psychological Measurement, 2004
Multidimensional adaptive testing (MAT) procedures are proposed for the measurement of several latent traits by a single examination. Bayesian latent trait estimation and adaptive item selection are derived. Simulations were conducted to compare the measurement efficiency of MAT with those of unidimensional adaptive testing and random…
Descriptors: Item Analysis, Adaptive Testing, Computer Assisted Testing, Computer Simulation
Penfield, Randall D. – Applied Measurement in Education, 2006
This study applied the maximum expected information (MEI) and the maximum posterior-weighted information (MPI) approaches of computer adaptive testing item selection to the case of a test using polytomous items following the partial credit model. The MEI and MPI approaches are described. A simulation study compared the efficiency of ability…
Descriptors: Bayesian Statistics, Adaptive Testing, Computer Assisted Testing, Test Items
Peer reviewedDe Ayala, R. J.; And Others – Journal of Educational Measurement, 1990
F. M. Lord's flexilevel, computerized adaptive testing (CAT) procedure was compared to an item-response theory-based CAT procedure that uses Bayesian ability estimation with various standard errors of estimates used for terminating the test. Ability estimates of flexilevel CATs were as accurate as were those of Bayesian CATs. (TJH)
Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Comparative Analysis
Peer reviewedGifford, Janice A.; Swaminathan, Hariharan – Applied Psychological Measurement, 1990
The effects of priors and amount of bias in the Bayesian approach to the estimation problem in item response models are examined using simulation studies. Different specifications of prior information have only modest effects on Bayesian estimates, which are less biased than joint maximum likelihood estimates for small samples. (TJH)
Descriptors: Bayesian Statistics, Comparative Analysis, Computer Simulation, Estimation (Mathematics)
Rule, David L. – 1993
Several regression methods were examined within the framework of weighted structural regression (WSR), comparing their regression weight stability and score estimation accuracy in the presence of outlier contamination. The methods compared are: (1) ordinary least squares; (2) WSR ridge regression; (3) minimum risk regression; (4) minimum risk 2;…
Descriptors: Analysis of Covariance, Bayesian Statistics, Comparative Analysis, Computer Simulation
Kirisci, Levent; Hsu, Tse-Chi – 1992
A predictive adaptive testing (PAT) strategy was developed based on statistical predictive analysis, and its feasibility was studied by comparing PAT performance to those of the Flexilevel, Bayesian modal, and expected a posteriori (EAP) strategies in a simulated environment. The proposed adaptive test is based on the idea of using item difficulty…
Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing

Direct link
