Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 5 |
Descriptor
Comparative Analysis | 12 |
Computer Simulation | 12 |
Test Construction | 12 |
Foreign Countries | 4 |
Teaching Methods | 4 |
Equations (Mathematics) | 3 |
Estimation (Mathematics) | 3 |
Evaluation Methods | 3 |
Mathematical Models | 3 |
Scores | 3 |
Bayesian Statistics | 2 |
More ▼ |
Source
Applied Psychological… | 1 |
International Educational… | 1 |
International Group for the… | 1 |
Journal of Education and… | 1 |
Journal of Education for… | 1 |
Journal of Educational… | 1 |
Online Submission | 1 |
Psychometrika | 1 |
Author
Publication Type
Reports - Evaluative | 7 |
Journal Articles | 6 |
Reports - Research | 4 |
Speeches/Meeting Papers | 4 |
Collected Works - Proceedings | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Arkansas | 1 |
Austria | 1 |
Belgium | 1 |
Botswana | 1 |
Brazil | 1 |
China (Shanghai) | 1 |
Cyprus | 1 |
Czech Republic | 1 |
Egypt | 1 |
Indonesia | 1 |
Ireland | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Major Field Achievement Test… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Abdullah, Mahmoud M. S.; Abdel-Gawad, Rehab A. El-sayed; Ibrahim, Ibrahim Badry Marwany – Online Submission, 2023
This study investigated the effectiveness of using a social learning programme facilitated by Facebook to develop some creative writing skills and motivation to learn English among secondary-one school students. Seventy students in secondary-one grade in Al-Shahid Hussein A. Abdul-Raouf Mixed Secondary School in Al-Maabda in the second semester of…
Descriptors: Creative Writing, Teaching Methods, English (Second Language), Second Language Learning
Green, Jeffrey J.; Stone, Courtenay Clifford; Zegeye, Abera – Journal of Education for Business, 2014
Colleges and universities are being asked by numerous sources to provide assurance of learning assessments of their students and programs. Colleges of business have responded by using a plethora of assessment tools, including the Major Field Test in Business. In this article, the authors show that the use of the Major Field Test in Business for…
Descriptors: Business Administration Education, Student Evaluation, Accreditation (Institutions), Comparative Analysis
Amin, Bunga Dara; Mahmud, Alimuddin; Muris – Journal of Education and Practice, 2016
This research aims to produce a learning instrument based on hypermedia which is valid, interesting, practical, and effective as well as to know its influence on the problem based skill of students Mathematical and Science Faculty, Makassar State University. This research is a research and development at (R&D) type. The development procedure…
Descriptors: Test Construction, Science Tests, Physics, Hypermedia

Rogers, H. Jane; Swaminathan, Hariharan – Applied Psychological Measurement, 1993
Performance of the logistic regression (LR) procedure was compared to that of the Mantel Haenszel (MH) procedure in the detection of uniform and nonuniform differential item functioning on a simulation examining distributional properties of the LR and MH test statistics and the relative power of the two procedures. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Item Bias, Mathematical Models
Liu, Xiufeng – 1992
The difference between compensatory and non-compensatory item response theory (IRT) models in terms of the dimensionality of test data generated by them, and its effect on the model-data-fit were examined. The STRESS (proportion of variance not accounted for by the multidimensional scaling model) and RSQ (proportion of variance accounted for by…
Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Foreign Countries
Tang, K. Linda; And Others – 1993
This study compared the performance of the LOGIST and BILOG computer programs on item response theory (IRT) based scaling and equating for the Test of English as a Foreign Language (TOEFL) using real and simulated data and two calibration structures. Applications of IRT for the TOEFL program are based on the three-parameter logistic (3PL) model.…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Estimation (Mathematics)

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Psychometrika, 1994
A modification of a test of the equality of nonindependent alpha reliability coefficients is proposed. It avoids the limitation that the product of the number of test parts times the number of subjects be quite large. Monte Carlo studies indicate that this test can be used in comparing interrater reliabilities. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Interrater Reliability

McKinley, Robert L. – Journal of Educational Measurement, 1988
Six procedures for combining sets of item response theory (IRT) item parameter estimates from different samples were evaluated using real and simulated response data. Results support use of covariance matrix-weighted averaging and a procedure using sample-size-weighted averaging of estimated item characteristic curves at the center of the ability…
Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Estimation (Mathematics)
Chapman, Dane M.; And Others – 1993
Three critical procedural skills in emergency medicine were evaluated using three assessment modalities--written, computer, and animal model. The effects of computer practice and previous procedure experience on skill competence were also examined in an experimental sequential assessment design. Subjects were six medical students, six residents,…
Descriptors: Animals, Comparative Analysis, Competence, Computer Assisted Testing
Kirisci, Levent; Hsu, Tse-Chi – 1992
A predictive adaptive testing (PAT) strategy was developed based on statistical predictive analysis, and its feasibility was studied by comparing PAT performance to those of the Flexilevel, Bayesian modal, and expected a posteriori (EAP) strategies in a simulated environment. The proposed adaptive test is based on the idea of using item difficulty…
Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing
Novotna, Jarmila, Ed.; Moraova, Hana, Ed.; Kratka, Magdalena, Ed.; Stehlikova, Nad'a, Ed. – International Group for the Psychology of Mathematics Education, 2006
This volume of the 30th annual proceedings of the International Group for the Psychology of Mathematics Education conference presents: plenary panel papers; research forum papers; short oral communication papers; and poster presentation papers from the meeting. Information relating to discussion groups and working sessions is also provided.…
Descriptors: Program Effectiveness, Foreign Countries, Secondary School Students, Mathematics Instruction