ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	10

Descriptor

Item Analysis	29
Multiple Choice Tests	29
Scoring	29
Test Items	16
Test Construction	12
Test Reliability	10
Test Validity	9
Test Interpretation	7
Scoring Formulas	6
Measurement Techniques	5
Computer Assisted Testing	4
Guessing (Tests)	4
Latent Trait Theory	4
Response Style (Tests)	4
Standardized Tests	4
Weighted Scores	4
Academic Achievement	3
Biology	3
College Students	3
Computer Oriented Programs	3
Confidence Testing	3
Criterion Referenced Tests	3
Decision Making	3
Difficulty Level	3
Educational Testing	3
More ▼

Source

Psychometrika	2
Applied Measurement in…	1
Educational Technology &…	1
Educational and Psychological…	1
Electronic Journal of Science…	1
European Journal of…	1
Innovations in Education and…	1
International Electronic…	1
Journal of Educational…	1
Language Education &…	1
Language Learning in Higher…	1
Office of Education, US…	1
ProQuest LLC	1
Science Insights Education…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	11
Reports - Evaluative	7
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Historical Materials	1
Reports - Descriptive	1

Education Level

Secondary Education	5
Higher Education	4
Elementary Education	3
Postsecondary Education	3
Elementary Secondary Education	2
High Schools	2
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Administrators	1
Counselors	1
Researchers	1
Teachers	1

Location

California	1
Canada	1
China	1
Europe	1
United Kingdom	1

Laws, Policies, & Programs

National Defense Education Act

Assessments and Surveys

Alberta Grade Twelve Diploma…	1
Cornell Critical Thinking Test	1
National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Impacts of Scoring Methods on Multiple-Select Multiple-Choice Item Statistics

Direct link

Alicia A. Stoltenberg – ProQuest LLC, 2024

Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…

Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

Development of a Protein Concept Inventory: A Proposal for Item Scoring and Responding

Peer reviewed
PDF on ERIC

Download full text

Güntay Tasçi – Science Insights Education Frontiers, 2024

The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…

Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology

Decoding Student Insights: Analyzing Response Change in NAEP Mathematics Constructed Response Items

Peer reviewed
PDF on ERIC

Download full text

Congning Ni; Bhashithe Abeysinghe; Juanita Hicks – International Electronic Journal of Elementary Education, 2025

The National Assessment of Educational Progress (NAEP), often referred to as The Nation's Report Card, offers a window into the state of U.S. K-12 education system. Since 2017, NAEP has transitioned to digital assessments, opening new research opportunities that were previously impossible. Process data tracks students' interactions with the…

Descriptors: Reaction Time, Multiple Choice Tests, Behavior Change, National Competency Tests

The Role of Expert Judgement in Language Test Validation

Peer reviewed
PDF on ERIC

Download full text

Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022

The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…

Descriptors: Specialists, Language Tests, Test Validity, College Faculty

Development of a Test of Scientific Argumentation

Peer reviewed
PDF on ERIC

Download full text

Frey, Bruce B.; Ellis, James D.; Bulgreen, Janis A.; Hare, Jana Craig; Ault, Marilyn – Electronic Journal of Science Education, 2015

"Scientific argumentation," defined as the ability to develop and analyze scientific claims, support claims with evidence from investigations of the natural world, and explain and evaluate the reasoning that connects the evidence to the claim, is a critical component of current science standards and is consistent with "Common Core…

Descriptors: Test Construction, Science Tests, Persuasive Discourse, Science Process Skills

Piloting a Polychotomous Partial-Credit Scoring Procedure in a Multiple-Choice Test

Peer reviewed

Direct link

Tsopanoglou, Antonios; Ypsilandis, George S.; Mouti, Anna – Language Learning in Higher Education, 2014

Multiple-choice (MC) tests are frequently used to measure language competence because they are quick, economical and straightforward to score. While degrees of correctness have been investigated for partially correct responses in combined-response MC tests, degrees of incorrectness in distractors and the role they play in determining the…

Descriptors: Scoring, Pilot Projects, Multiple Choice Tests, Language Tests

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Measures of Partial Knowledge and Unexpected Responses in Multiple-Choice Tests

Peer reviewed

Direct link

Chang, Shao-Hua; Lin, Pei-Chun; Lin, Zih-Chuan – Educational Technology & Society, 2007

This study investigates differences in the partial scoring performance of examinees in elimination testing and conventional dichotomous scoring of multiple-choice tests implemented on a computer-based system. Elimination testing that uses the same set of multiple-choice items rewards examinees with partial knowledge over those who are simply…

Descriptors: Multiple Choice Tests, Computer Assisted Testing, Scoring, Item Analysis

Recommendations for Preparing and Scoring Constructed-Response Items: What the Experts Say

Peer reviewed

Direct link

Hogan, Thomas P.; Murphy, Gavin – Applied Measurement in Education, 2007

We determined the recommendations for preparing and scoring constructed-response (CR) test items in 25 sources (textbooks and chapters) on educational and psychological measurement. The project was similar to Haladyna's (2004) analysis for multiple-choice items. We identified 12 recommendations for preparing CR items given by multiple sources,…

Descriptors: Test Items, Scoring, Test Construction, Educational Indicators

Basic Item Analysis for Multiple-Choice Tests. ERIC/AE Digest.

Download full text

Kehoe, Jerard – 1995

This digest presents a list of recommendations for writing multiple-choice test items, based on psychometrics statistics are typically provided by a measurement, or test scoring, service, where tests are machine-scored or by testing software packages. Test makers can capitalize on the fact that "bad" items can be differentiated from…

Descriptors: Item Analysis, Item Banks, Measurement Techniques, Multiple Choice Tests

An Approximation of the K Out of N Reliability of a Test, and a Scoring Procedure for Determining which Items an Examinee Knows.

Peer reviewed

Wilcox, Rand R. – Psychometrika, 1983

A procedure for determining the reliability of an examinee knowing k out of n possible multiple choice items given his or her performance on those items is presented. Also, a scoring procedure for determining which items an examinee knows is presented. (Author/JKS)

Descriptors: Item Analysis, Latent Trait Theory, Measurement Techniques, Multiple Choice Tests

Systematic Scoring of Ranked Distractors for the Assessment of Piagetian Reasoning Levels

Peer reviewed

Feldman, David H.; Markwalder, Winston – Educational and Psychological Measurement, 1971

Descriptors: Cognitive Development, Cognitive Measurement, Developmental Psychology, Item Analysis

The Use of Invariant Item Parameters to Derive an Absolute Score Metric.

Bradshaw, Charles W., Jr. – 1968

A method for determining invariant item parameters is presented, along with a scheme for obtaining test scores which are interpretable in terms of a common metric. The method assumes a unidimensional latent trait and uses a three parameter normal ogive model. The assumptions of the model are explored, and the methods for calculating the proposed…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Mathematical Models

Assessment of Foundation Knowledge: Are Students Confident in Their Ability?

Peer reviewed

Direct link

Fenna, Doug S. – European Journal of Engineering Education, 2004

Multiple-choice testing (MCT) has several advantages which are becoming more relevant in the current financial climate. In particular, they can be machine marked. As an objective testing method it is particularly relevant to engineering and other factual courses, but MCTs are not widely used in engineering because students can benefit from…

Descriptors: Guessing (Tests), Testing, Multiple Choice Tests, Engineering Education

Previous Page | Next Page »

Pages: 1 | 2

Bock, R. Darrell	2
Echternacht, Gary	2
Alicia A. Stoltenberg	1
Ault, Marilyn	1
Bennett, Randy Elliot	1
Bhashithe Abeysinghe	1
Bradshaw, Charles W., Jr.	1
Bruno, James E.	1
Bulgreen, Janis A.	1
Chan, James Y.	1
Chang, Shao-Hua	1
Congning Ni	1
Coniam, David	1
Ellis, James D.	1
Feldman, David H.	1
Fenna, Doug S.	1
Frey, Bruce B.	1
Gilmer, Jerry S.	1
Güntay Tasçi	1
Hare, Jana Craig	1
Hogan, Thomas P.	1
Juanita Hicks	1
Kehoe, Jerard	1
Kim, Sooyeon	1
More ▼