ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	28

Descriptor

Difficulty Level	89
Test Construction	89
Test Items	69
Item Response Theory	28
Test Format	21
Item Analysis	19
Multiple Choice Tests	17
Computer Assisted Testing	15
Test Validity	15
Test Reliability	14
Foreign Countries	11
Item Banks	11
Models	11
Psychometrics	11
Achievement Tests	10
Higher Education	10
Scoring	10
Reading Comprehension	9
Language Tests	8
Equated Scores	7
Mathematical Models	7
Reading Tests	7
Scaling	7
Statistical Analysis	7
Ability	6
More ▼

Publication Type

Reports - Evaluative	89
Journal Articles	39
Speeches/Meeting Papers	22
Numerical/Quantitative Data	7
Information Analyses	4
Reports - Research	3
Tests/Questionnaires	2
Opinion Papers	1

Education Level

Higher Education	5
Secondary Education	5
Elementary Education	4
Postsecondary Education	4
Elementary Secondary Education	3
Grade 5	3
Grade 6	3
Grade 7	3
Grade 8	3
Middle Schools	3
Early Childhood Education	2
Grade 1	2
Grade 2	2
Grade 3	2
Grade 4	2
Junior High Schools	2
High Schools	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	2
Practitioners	1
Teachers	1

Location

Netherlands	3
Australia	2
Canada	2
Alabama	1
California	1
Cyprus	1
Florida	1
Germany	1
Hawaii	1
Louisiana	1
Missouri	1
New York	1
North Dakota	1
Oregon	1
South Carolina	1
Tennessee	1
Texas	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	3
Program for International…	3
Test of English as a Foreign…	3
ACT Assessment	2
Advanced Placement…	2
SAT (College Admission Test)	2
Alabama High School…	1
Bender Visual Motor Gestalt…	1
Goodenough Harris Drawing Test	1
Hidden Figures Test	1
Metropolitan Achievement Tests	1
National Assessment of…	1
Praxis Series	1
Sentence Completion Test	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 89 results Save | Export

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Idea-Sharing Crafting Item Difficulty in TOEFL iBT Listening Tests

Peer reviewed
PDF on ERIC

Download full text

Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…

Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests

Item Statistics Derived from Three-Option Versions of Multiple-Choice Questions Are Usually as Robust as Four-or Five-Option Versions: Implications for Exam Design

Peer reviewed

Direct link

Loudon, Catherine; Macias-Muñoz, Aide – Advances in Physiology Education, 2018

Different versions of multiple-choice exams were administered to an undergraduate class in human physiology as part of normal testing in the classroom. The goal was to evaluate whether the number of options (possible answers) per question influenced the effectiveness of this assessment. Three exams (each with three versions) were given to each of…

Descriptors: Multiple Choice Tests, Test Construction, Test Items, Science Tests

Investigation of 2018 ACT Score Declines Final Report

Download full text

Keng, Leslie; Boyer, Michelle – National Center for the Improvement of Educational Assessment, 2020

ACT requested assistance from the National Center for the Improvement of Educational Assessment (Center for Assessment) to investigate declines of scores for states administering the ACT to its 11th grade students in 2018. This request emerged from conversations among state leaders, the Center for Assessment, and ACT in trying to understand the…

Descriptors: College Entrance Examinations, Scores, Test Score Decline, Educational Trends

Large-Scale Assessments, Personalized Learning, and Creativity: Paradoxes and Possibilities

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beghetto, Ronald A. – ECNU Review of Education, 2019

Purpose: This article, based on an invited talk, aims to explore the relationship among large-scale assessments, creativity and personalized learning. Design/Approach/Methods: Starting with the working definition of large-scale assessments, creativity, and personalized learning, this article identified the paradox of combining these three…

Descriptors: Measurement, Creativity, Problem Solving, Artificial Intelligence

Constructing Multiple-Choice Items to Measure Higher-Order Thinking

Peer reviewed
PDF on ERIC

Download full text

Scully, Darina – Practical Assessment, Research & Evaluation, 2017

Across education, certification and licensure, there are repeated calls for the development of assessments that target "higher-order thinking," as opposed to mere recall of facts. A common assumption is that this necessitates the use of constructed response or essay-style test questions; however, empirical evidence suggests that this may…

Descriptors: Test Construction, Test Items, Multiple Choice Tests, Thinking Skills

The Limits of Measurement: Misplaced Precision, Phronesis, and Other Aristotelian Cautions for the Makers of PISA, APPR, Etc.

Peer reviewed

Direct link

Meyer, Heinz-Dieter – Comparative Education, 2017

Quantitative measures of student performance are increasingly used as proxies of educational quality and teacher ability. Such assessments assume that the quality of educational practices can be unambiguously quantitatively measured and that such measures are sufficiently precise and robust to be aggregated into policy-relevant rankings like…

Descriptors: Student Evaluation, Evaluation Problems, Accuracy, Scholarship

Evidence-Centered Design: Recommendations for Implementation and Practice

Peer reviewed

Direct link

Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013

Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…

Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories

Assessment of Uncertainty-Infused Scientific Argumentation

Peer reviewed

Direct link

Lee, Hee-Sun; Liu, Ou Lydia; Pallant, Amy; Roohr, Katrina Crotts; Pryputniewicz, Sarah; Buck, Zoë E. – Journal of Research in Science Teaching, 2014

Though addressing sources of uncertainty is an important part of doing science, it has largely been neglected in assessing students' scientific argumentation. In this study, we initially defined a scientific argumentation construct in four structural elements consisting of claim, justification, uncertainty qualifier, and uncertainty…

Descriptors: Persuasive Discourse, Student Evaluation, High School Students, Science Tests

Lessons Learned in Designing and Implementing a Computer-Adaptive Test for English

Peer reviewed
PDF on ERIC

Download full text

Burston, Jack; Neophytou, Maro – The EUROCALL Review, 2014

This paper describes the lessons learned in designing and implementing a computer-adaptive test (CAT) for English. The early identification of students with weak L2 English proficiency is of critical importance in university settings that have compulsory English language course graduation requirements. The most efficient means of diagnosing the L2…

Descriptors: English (Second Language), Second Language Instruction, Second Language Learning, Computer Assisted Instruction

Taking Decisions: Assessment for University Entry

Peer reviewed

Direct link

Plassmann, Sibylle; Zeidler, Beate – Language Learning in Higher Education, 2014

Language testing means taking decisions: about the test taker's results, but also about the test construct and the measures taken in order to ensure quality. This article takes the German test "telc Deutsch C1 Hochschule" as an example to illustrate this decision-making process in an academic context. The test is used for university…

Descriptors: Language Tests, Test Wiseness, Test Construction, Decision Making

Combining the Best of Two Standard Setting Methods: The Ordered Item Booklet Angoff

Peer reviewed

Direct link

Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S. – Journal of Applied Testing Technology, 2014

This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…

Descriptors: Standard Setting, Difficulty Level, Test Items, Rating Scales

Peer Review Improves the Quality of MCQ Examinations

Peer reviewed

Direct link

Malau-Aduli, Bunmi S.; Zimitat, Craig – Assessment & Evaluation in Higher Education, 2012

The aim of this study was to assess the effect of the introduction of peer review processes on the quality of multiple-choice examinations in the first three years of an Australian medical course. The impact of the peer review process and overall quality assurance (QA) processes were evaluated by comparing the examination data generated in earlier…

Descriptors: Foreign Countries, Peer Evaluation, Multiple Choice Tests, Test Construction

A Control Systems Concept Inventory Test Design and Assessment

Peer reviewed

Direct link

Bristow, M.; Erkorkmaz, K.; Huissoon, J. P.; Jeon, Soo; Owen, W. S.; Waslander, S. L.; Stubley, G. D. – IEEE Transactions on Education, 2012

Any meaningful initiative to improve the teaching and learning in introductory control systems courses needs a clear test of student conceptual understanding to determine the effectiveness of proposed methods and activities. The authors propose a control systems concept inventory. Development of the inventory was collaborative and iterative. The…

Descriptors: Diagnostic Tests, Concept Formation, Undergraduate Students, Engineering Education

The Development and Technical Adequacy of Seventh-Grade Reading Comprehension Measures in a Progress Monitoring Assessment System. Technical Report #1102

Download full text

Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011

This technical report describes the process of development and piloting of reading comprehension measures that are appropriate for seventh-grade students as part of an online progress screening and monitoring assessment system, http://easycbm.com. Each measure consists of an original fictional story of approximately 1,600 to 1,900 words with 20…

Descriptors: Reading Comprehension, Reading Tests, Grade 7, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	6
Behavioral Research and…	5
Applied Measurement in…	3
Applied Psychological…	3
Journal of Applied Testing…	2
Journal of Educational…	2
Advances in Physiology…	1
Assessment & Evaluation in…	1
Canadian Journal of Special…	1
Comparative Education	1
ECNU Review of Education	1
Education and Information…	1
Educational Measurement:…	1
IEEE Transactions on Education	1
Instructional Science: An…	1
International Journal of…	1
Journal of Dental Education	1
Journal of Educational and…	1
Journal of Research in…	1
Journal of Research in…	1
Journal of Research on…	1
Language Learning in Higher…	1
Ministerial Council on…	1
Multivariate Behavioral…	1
National Center for the…	1
More ▼

Tindal, Gerald	5
Alonzo, Julie	3
Liu, Kimy	3
Bejar, Isaac I.	2
Gershon, Richard C.	2
Green, Donald Ross	2
Hicks, Marilyn M.	2
Ketterlin-Geller, Leanne R.	2
Wainer, Howard	2
Adams, Richard	1
Aditya Shah	1
Aiken, Lewis R.	1
Ajay Devmane	1
Alan Shaw	1
Allen, Nancy L.	1
Armstrong, Ronald D.	1
Bacon, Tina P.	1
Barron, Ann E.	1
Beghetto, Ronald A.	1
Berger, Martijn P. F.	1
Bickel, Peter	1
Bock, H. Darrell	1
Boeijen, Marijke	1
Boyer, Michelle	1
More ▼