ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Comparative Analysis	12
Computer Simulation	12
Test Construction	12
Foreign Countries	4
Teaching Methods	4
Equations (Mathematics)	3
Estimation (Mathematics)	3
Evaluation Methods	3
Mathematical Models	3
Scores	3
Bayesian Statistics	2
Competence	2
Computation	2
Computer Assisted Testing	2
Computer Software	2
Decision Making	2
Interrater Reliability	2
Item Response Theory	2
Models	2
Monte Carlo Methods	2
Problem Solving	2
Sample Size	2
Secondary School Students	2
Student Attitudes	2
Test Items	2
More ▼

Source

Applied Psychological…	1
International Educational…	1
International Group for the…	1
Journal of Education and…	1
Journal of Education for…	1
Journal of Educational…	1
Online Submission	1
Psychometrika	1

Publication Type

Reports - Evaluative	7
Journal Articles	6
Reports - Research	4
Speeches/Meeting Papers	4
Collected Works - Proceedings	1

Education Level

Higher Education	3
Postsecondary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Arkansas	1
Austria	1
Belgium	1
Botswana	1
Brazil	1
China (Shanghai)	1
Cyprus	1
Czech Republic	1
Egypt	1
Indonesia	1
Ireland	1
Malawi	1
Peru	1
Singapore	1
South Korea	1
Spain	1
Taiwan	1
United Kingdom (London)	1
United States	1
Virginia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Major Field Achievement Test…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

A Social Learning Program Facilitated by Facebook for Developing Some Creative Writing Skills and Motivation to Learn English among Secondary-One Students

Download full text

Abdullah, Mahmoud M. S.; Abdel-Gawad, Rehab A. El-sayed; Ibrahim, Ibrahim Badry Marwany – Online Submission, 2023

This study investigated the effectiveness of using a social learning programme facilitated by Facebook to develop some creative writing skills and motivation to learn English among secondary-one school students. Seventy students in secondary-one grade in Al-Shahid Hussein A. Abdul-Raouf Mixed Secondary School in Al-Maabda in the second semester of…

Descriptors: Creative Writing, Teaching Methods, English (Second Language), Second Language Learning

The Major Field Test in Business: A Solution to the Problem of Assurance of Learning Assessment?

Peer reviewed

Direct link

Green, Jeffrey J.; Stone, Courtenay Clifford; Zegeye, Abera – Journal of Education for Business, 2014

Colleges and universities are being asked by numerous sources to provide assurance of learning assessments of their students and programs. Colleges of business have responded by using a plethora of assessment tools, including the Major Field Test in Business. In this article, the authors show that the use of the Major Field Test in Business for…

Descriptors: Business Administration Education, Student Evaluation, Accreditation (Institutions), Comparative Analysis

The Development of Physics Learning Instrument Based on Hypermedia and Its Influence on the Student Problem Solving Skill

Peer reviewed
PDF on ERIC

Download full text

Amin, Bunga Dara; Mahmud, Alimuddin; Muris – Journal of Education and Practice, 2016

This research aims to produce a learning instrument based on hypermedia which is valid, interesting, practical, and effective as well as to know its influence on the problem based skill of students Mathematical and Science Faculty, Makassar State University. This research is a research and development at (R&D) type. The development procedure…

Descriptors: Test Construction, Science Tests, Physics, Hypermedia

A Comparison of the Logistic Regression and Mantel-Haenszel Procedures for Detecting Differential Item Functioning.

Peer reviewed

Rogers, H. Jane; Swaminathan, Hariharan – Applied Psychological Measurement, 1993

Performance of the logistic regression (LR) procedure was compared to that of the Mantel Haenszel (MH) procedure in the detection of uniform and nonuniform differential item functioning on a simulation examining distributional properties of the LR and MH test statistics and the relative power of the two procedures. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Item Bias, Mathematical Models

The Dimensionality of Test Data Generated by Compensatory and Non-Compensatory Two-Dimensional IRT Models and Its Effect on Model-Data-Fit.

Download full text

Liu, Xiufeng – 1992

The difference between compensatory and non-compensatory item response theory (IRT) models in terms of the dimensionality of test data generated by them, and its effect on the model-data-fit were examined. The STRESS (proportion of variance not accounted for by the multidimensional scaling model) and RSQ (proportion of variance accounted for by…

Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Foreign Countries

The Effect of Small Calibration Sample Sizes on TOEFL IRT-Based Equating.

Download full text

Tang, K. Linda; And Others – 1993

This study compared the performance of the LOGIST and BILOG computer programs on item response theory (IRT) based scaling and equating for the Test of English as a Foreign Language (TOEFL) using real and simulated data and two calibration structures. Applications of IRT for the TOEFL program are based on the three-parameter logistic (3PL) model.…

Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Estimation (Mathematics)

A Modification of Feldt's Test of the Equality of Two Dependent Alpha Coefficients.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Psychometrika, 1994

A modification of a test of the equality of nonindependent alpha reliability coefficients is proposed. It avoids the limitation that the product of the number of test parts times the number of subjects be quite large. Monte Carlo studies indicate that this test can be used in comparing interrater reliabilities. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Interrater Reliability

A Comparison of Six Methods for Combining Multiple IRT Item Parameter Estimates.

Peer reviewed

McKinley, Robert L. – Journal of Educational Measurement, 1988

Six procedures for combining sets of item response theory (IRT) item parameter estimates from different samples were evaluated using real and simulated response data. Results support use of covariance matrix-weighted averaging and a procedure using sample-size-weighted averaging of estimated item characteristic curves at the center of the ability…

Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Estimation (Mathematics)

Critical Emergency Medicine Procedural Skills: A Comparative Study of Methods for Teaching and Assessment.

Download full text

Chapman, Dane M.; And Others – 1993

Three critical procedural skills in emergency medicine were evaluated using three assessment modalities--written, computer, and animal model. The effects of computer practice and previous procedure experience on skill competence were also examined in an experimental sequential assessment design. Subjects were six medical students, six residents,…

Descriptors: Animals, Comparative Analysis, Competence, Computer Assisted Testing

Estimation of Ability Level by Using Only Observable Quantities in Adaptive Testing.

Download full text

Kirisci, Levent; Hsu, Tse-Chi – 1992

A predictive adaptive testing (PAT) strategy was developed based on statistical predictive analysis, and its feasibility was studied by comparing PAT performance to those of the Flexilevel, Bayesian modal, and expected a posteriori (EAP) strategies in a simulated environment. The proposed adaptive test is based on the idea of using item difficulty…

Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing

Proceedings of the Conference of the International Group for the Psychology of Mathematics Education (30th, Prague, Czech Republic, July 16-21, 2006). Volume 1

Download full text

Novotna, Jarmila, Ed.; Moraova, Hana, Ed.; Kratka, Magdalena, Ed.; Stehlikova, Nad'a, Ed. – International Group for the Psychology of Mathematics Education, 2006

This volume of the 30th annual proceedings of the International Group for the Psychology of Mathematics Education conference presents: plenary panel papers; research forum papers; short oral communication papers; and poster presentation papers from the meeting. Information relating to discussion groups and working sessions is also provided.…

Descriptors: Program Effectiveness, Foreign Countries, Secondary School Students, Mathematics Instruction

Abdel-Gawad, Rehab A. El-sayed	1
Abdullah, Mahmoud M. S.	1
Alsawalmeh, Yousef M.	1
Amin, Bunga Dara	1
Chapman, Dane M.	1
Feldt, Leonard S.	1
Green, Jeffrey J.	1
Hsu, Tse-Chi	1
Ibrahim, Ibrahim Badry Marwany	1
Kirisci, Levent	1
Kratka, Magdalena, Ed.	1
Liu, Xiufeng	1
Mahmud, Alimuddin	1
McKinley, Robert L.	1
Moraova, Hana, Ed.	1
Muris	1
Novotna, Jarmila, Ed.	1
Piech, Chris	1
Rogers, H. Jane	1
Stehlikova, Nad'a, Ed.	1
Stone, Courtenay Clifford	1
Swaminathan, Hariharan	1
Tack, Anaïs	1
Tang, K. Linda	1
More ▼