ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	6

Descriptor

Item Analysis	11
Multiple Choice Tests	11
Reliability	11
Difficulty Level	6
Validity	6
Test Items	5
Foreign Countries	4
Achievement Tests	3
Test Construction	3
Accuracy	2
Computer Software	2
Correlation	2
Cutting Scores	2
Guessing (Tests)	2
Higher Education	2
Mathematical Models	2
Reaction Time	2
Response Style (Tests)	2
Academic Achievement	1
Academic Standards	1
Analysis of Variance	1
Artificial Intelligence	1
Astronomy	1
Bayesian Statistics	1
Behavior Patterns	1
More ▼

Source

Educational and Psychological…	3
Advances in Physiology…	1
Educational Process:…	1
Journal of Education and…	1
Journal of Educational Data…	1
Shanlax International Journal…	1

Publication Type

Reports - Research	11
Journal Articles	8
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Intermediate Grades	2
Middle Schools	2
Secondary Education	2
Grade 5	1
Grade 6	1
Grade 7	1
Higher Education	1
Junior High Schools	1
Postsecondary Education	1

Audience

Researchers

Location

Germany	1
Thailand	1
Turkey	1
Turkey (Istanbul)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Claude, ChatGPT, Copilot, and Gemini Performance versus Students in Different Topics of Neuroscience

Peer reviewed

Direct link

Volodymyr Mavrych; Ahmed Yaqinuddin; Olena Bolgova – Advances in Physiology Education, 2025

Despite extensive studies on large language models and their capability to respond to questions from various licensed exams, there has been limited focus on employing chatbots for specific subjects within the medical curriculum, specifically medical neuroscience. This research compared the performances of Claude 3.5 Sonnet (Anthropic), GPT-3.5 and…

Descriptors: Artificial Intelligence, Computer Software, Neurosciences, Medical Education

Developing a Systems Thinking Skills Assessment for Upper Primary Students in Thailand

Peer reviewed
PDF on ERIC

Download full text

Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025

Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…

Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries

Development of Ecology Achievement Test for Secondary School Students

Peer reviewed
PDF on ERIC

Download full text

Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024

This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…

Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests

Changes in the Speed-Ability Relation through Different Treatments of Rapid Guessing

Peer reviewed

Direct link

Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023

As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…

Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias

Towards Design-Loop Adaptivity: Identifying Items for Revision

Peer reviewed
PDF on ERIC

Download full text

Pelánek, Radek; Effenberger, Tomáš; Kukucka, Adam – Journal of Educational Data Mining, 2022

We study the automatic identification of educational items worthy of content authors' attention. Based on the results of such analysis, content authors can revise and improve the content of learning environments. We provide an overview of item properties relevant to this task, including difficulty and complexity measures, item discrimination, and…

Descriptors: Item Analysis, Identification, Difficulty Level, Case Studies

Developing Achievement Test: A Research for Assessment of 5th Grade Biology Subject

Peer reviewed
PDF on ERIC

Download full text

Sener, Nilay; Tas, Erol – Journal of Education and Learning, 2017

The purpose of this study is to prepare a multiple-choice achievement test with high reliability and validity for the "Let's Solve the Puzzle of Our Body" unit. For this purpose, a multiple choice achievement test consisting of 46 items was applied to 178 fifth grade students in total. As a result of the test and material analysis…

Descriptors: Achievement Tests, Grade 5, Science Instruction, Biology

Correction of Item-Test Correlations and Attempts at Improving Reproducibility in Item-Analysis: An Experimental Approach.

Peer reviewed

Melzer, Charles W.; And Others – Educational and Psychological Measurement, 1981

The magnitude of statistical bias for the phi-coefficient was investigated, using computer simulated examinations in which all the students had equal knowledge. Several modifications of phi were tested, but when applied to real examinations, none succeeded in improving its reproducibility when items are re-used on equivalent student groups.…

Descriptors: Correlation, Item Analysis, Mathematical Models, Multiple Choice Tests

Subjective Judgment of Multiple-Choice Item Characteristics.

Peer reviewed

Green, Kathy E. – Educational and Psychological Measurement, 1983

This study was concerned with the reliability and validity of subjective judgments about five characteristics of multiple-choice test items from an introductory college-level astronomy test: (1) item difficulty, (2) language complexity, (3) content importance or relevance, (4) response set convergence, and (5) process complexity. (Author)

Descriptors: Achievement Tests, Astronomy, Difficulty Level, Evaluative Thinking

Validation of a Simplified Method for Determining Passing Scores for Criterion-Referenced, Multiple-Choice Tests.

Meredith, John B., Jr. – 1978

The complexity of defining accurate passing scores with a minimum classification error when evaluating criterion-referenced, multiple-choice tests has been a major problem for classroom teachers. Therefore, a practical procedure in which the instructor determines the plausibility of each item option for the minimally acceptable examinee is…

Descriptors: Criterion Referenced Tests, Cutting Scores, Difficulty Level, Item Analysis

Item Reliabilities for a Family of Answer-Until-Correct (AUC) Scoring Rules.

PDF pending restoration

Kane, Michael T.; Moloney, James M. – 1976

The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…

Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests

A Bayes' Subjective Probability Approach to the Nedelsky Procedure.

Miao, Chang Yu – 1987

Nedelsky (1954) has suggested a procedure for determining the minimum passing score on a multiple-choice test. In this procedure expert judges estimate the probable score of a minimally competent examinee. The technique does not refer to the students' performance data. The purposes of this paper are: (1) to introduce a modification to the Nedelsky…

Descriptors: Academic Standards, Analysis of Variance, Bayesian Statistics, Cutting Scores

Ahmed Yaqinuddin	1
Apantee Poonputta	1
Asli Görgülü Ari	1
Deribo, Tobias	1
Effenberger, Tomáš	1
Goldhammer, Frank	1
Green, Kathy E.	1
Kane, Michael T.	1
Kevser Arslan	1
Kroehne, Ulf	1
Kukucka, Adam	1
Melzer, Charles W.	1
Meredith, John B., Jr.	1
Miao, Chang Yu	1
Moloney, James M.	1
Olena Bolgova	1
Pelánek, Radek	1
Sener, Nilay	1
Tas, Erol	1
Thayaamol Upapong	1
Volodymyr Mavrych	1
More ▼