ERIC - Search Results

Publication Date

In 2025	6
Since 2024	15
Since 2021 (last 5 years)	56
Since 2016 (last 10 years)	130
Since 2006 (last 20 years)	218

Descriptor

Difficulty Level	425
Test Construction	425
Test Items	425
Item Analysis	123
Multiple Choice Tests	114
Test Reliability	113
Test Validity	109
Foreign Countries	99
Item Response Theory	97
Test Format	67
Higher Education	56
Computer Assisted Testing	51
Psychometrics	49
Item Banks	45
Statistical Analysis	44
Mathematics Tests	43
Achievement Tests	42
Scores	41
Science Tests	40
Comparative Analysis	37
Reading Tests	35
Language Tests	34
Models	34
Elementary School Students	32
Undergraduate Students	31
More ▼

Publication Type

Reports - Research	297
Journal Articles	228
Speeches/Meeting Papers	78
Reports - Evaluative	69
Reports - Descriptive	32
Tests/Questionnaires	24
Numerical/Quantitative Data	18
Information Analyses	14
Dissertations/Theses -…	9
Guides - Non-Classroom	5
Opinion Papers	3
Non-Print Media	2
Reference Materials - General	2
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - General	1
More ▼

Education Level

Higher Education	60
Elementary Education	53
Secondary Education	51
Postsecondary Education	50
Middle Schools	25
High Schools	20
Junior High Schools	18
Grade 8	15
Intermediate Grades	15
Early Childhood Education	13
Grade 4	13
Primary Education	13
Grade 6	12
Elementary Secondary Education	11
Grade 5	11
Grade 2	9
Grade 7	9
Grade 3	8
Grade 1	7
Kindergarten	7
Grade 12	5
Grade 9	2
Adult Education	1
Grade 10	1
More ▼

Audience

Researchers	17
Teachers	4
Policymakers	2
Administrators	1
Practitioners	1

Location

Indonesia	14
Turkey	9
Australia	8
Florida	7
Nigeria	6
Canada	5
Germany	4
United Kingdom	4
United Kingdom (England)	4
Alabama	3
Japan	3
Mexico	3
New York	3
Turkey (Istanbul)	3
Belgium	2
California	2
China	2
Georgia	2
Indiana	2
Jordan	2
Kansas	2
Kentucky	2
New Jersey	2
Oregon	2
Russia	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 425 results Save | Export

Text-Based Question Difficulty Prediction: A Systematic Review of Automatic Approaches

Peer reviewed

Direct link

Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024

Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…

Descriptors: Test Items, Test Construction, Difficulty Level, Prediction

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Developing the Mental Effort and Load-Translingual Scale (MEL-TS) as a Foundation for Translingual Research in Self-Regulated Learning

Peer reviewed

Direct link

Tino Endres; Lisa Bender; Stoo Sepp; Shirong Zhang; Louise David; Melanie Trypke; Dwayne Lieck; Juliette C. Désiron; Johanna Bohm; Sophia Weissgerber; Juan Cristobal Castro-Alonso; Fred Paas – Educational Psychology Review, 2025

Assessing cognitive demand is crucial for research on self-regulated learning; however, discrepancies in translating essential concepts across languages can hinder the comparison of research findings. Different languages often emphasize various components and interpret certain constructs differently. This paper aims to develop a translingual set…

Descriptors: Cognitive Processes, Difficulty Level, Metacognition, Translation

Developing and Validating a Biological System Thinking Test for Middle School Students

Peer reviewed

Direct link

Ruying Li; Gaofeng Li – International Journal of Science and Mathematics Education, 2025

Systems thinking (ST) is an essential competence for future life and biology learning. Appropriate assessment is critical for collecting sufficient information to develop ST in biology education. This research offers an ST framework based on a comprehensive understanding of biological systems, encompassing four skills across three complexity…

Descriptors: Test Construction, Test Validity, Science Tests, Cognitive Tests

Idea-Sharing Crafting Item Difficulty in TOEFL iBT Listening Tests

Peer reviewed
PDF on ERIC

Download full text

Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…

Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests

Design, Development, and Evaluation of the Organic Chemistry Representational Competence Assessment (ORCA)

Peer reviewed

Direct link

Lyniesha Ward; Fridah Rotich; Jeffrey R. Raker; Regis Komperda; Sachin Nedungadi; Maia Popova – Chemistry Education Research and Practice, 2025

This paper describes the design and evaluation of the Organic chemistry Representational Competence Assessment (ORCA). Grounded in Kozma and Russell's representational competence framework, the ORCA measures the learner's ability to "interpret," "translate," and "use" six commonly used representations of molecular…

Descriptors: Organic Chemistry, Science Tests, Test Construction, Student Evaluation

From Investigating the Alignment of a Priori Item Characteristics Based on the CTT and Four-Parameter Logistic (4-PL) IRT Models to Further Exploring the Comparability of the Two Models

Peer reviewed
PDF on ERIC

Download full text

Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024

The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…

Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction

The Knowledge of Autism Questionnaire-UK: Development and Initial Psychometric Evaluation

Peer reviewed

Direct link

Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025

Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…

Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Starting Tests with Easy versus Difficult Tasks: Effects on Appraisals and Emotions

Peer reviewed

Direct link

Bieleke, Maik; Goetz, Thomas; Krannich, Maike; Roos, Anna-Lena; Yanagida, Takuya – Journal of Experimental Education, 2023

Tests in educational contexts often start with easy tasks, assuming that this fosters positive experiences--a sense of control, higher valuing of the test, and more positive and less negative emotions. Although intuitive and widespread, this assumption lacks an empirical basis and a theoretical framework. We conducted a field experiment and…

Descriptors: Foreign Countries, Secondary School Students, Mathematics Tests, Test Construction

Development of Ecology Achievement Test for Secondary School Students

Peer reviewed
PDF on ERIC

Download full text

Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024

This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…

Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests

Taking Inventory of the Creative Behavior Inventory: An Item Response Theory Analysis of the CBI

Peer reviewed

Direct link

Rodriguez, Rebekah M.; Silvia, Paul J.; Kaufman, James C.; Reiter-Palmon, Roni; Puryear, Jeb S. – Creativity Research Journal, 2023

The original 90-item Creative Behavior Inventory (CBI) was a landmark self-report scale in creativity research, and the 28-item brief form developed nearly 20 years ago continues to be a popular measure of everyday creativity. Relatively little is known, however, about the psychometric properties of this widely used scale. In the current research,…

Descriptors: Creativity Tests, Creativity, Creative Thinking, Psychometrics

Development of an Achievement Test for the 6th Grade Sound and Its Properties Unit

Peer reviewed
PDF on ERIC

Download full text

Büsra Kilinç; Mehmet Diyaddin Yasar – Science Insights Education Frontiers, 2024

In this study, it was aimed to develop an achievement test taking into account the subject acquisitions of the sound and properties unit in the sixth-grade science course. In the test development phase, firstly, literature review for the study was conducted. Then, 30 multiple choice questions in align with the subject acquisition in the 2018…

Descriptors: Science Tests, Test Construction, Grade 6, Science Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 29

Educational and Psychological…	16
Behavioral Research and…	15
Journal of Educational…	12
ETS Research Report Series	10
Online Submission	9
Applied Measurement in…	8
ProQuest LLC	8
Applied Psychological…	6
Journal of Experimental…	5
Language Assessment Quarterly	5
Educational Assessment	4
Educational Measurement:…	4
International Journal of…	4
Journal of Applied Testing…	4
CBE - Life Sciences Education	3
International Journal of…	3
Journal of Chemical Education	3
Journal of Educational…	3
National Assessment Governing…	3
Pegem Journal of Education…	3
Physical Review Physics…	3
Practical Assessment,…	3
Annenberg Institute for…	2
Assessment & Evaluation in…	2
Assessment in Education:…	2
More ▼

Tindal, Gerald	15
Alonzo, Julie	11
Anderson, Daniel	8
Park, Bitnara Jasmine	7
Huntley, Renee M.	6
Irvin, P. Shawn	6
Liu, Kimy	6
Roid, Gale	6
Saven, Jessica L.	6
Bejar, Isaac I.	4
Cizek, Gregory J.	4
DiLuzio, Geneva J.	4
Ketterlin-Geller, Leanne R.	4
Lord, Frederic M.	4
Reckase, Mark D.	4
Crisp, Victoria	3
Ferrara, Steve	3
Graf, Edith Aurora	3
Haladyna, Tom	3
Hambleton, Ronald K.	3
Huff, Kristen	3
Kaliski, Pamela	3
Plake, Barbara S.	3
Schoen, Robert C.	3
More ▼

Graduate Record Examinations	10
National Assessment of…	8
ACT Assessment	4
SAT (College Admission Test)	4
Test of English as a Foreign…	4
Advanced Placement…	3
Program for International…	3
Raven Progressive Matrices	3
Armed Services Vocational…	2
Flesch Kincaid Grade Level…	2
Hidden Figures Test	2
Metropolitan Achievement Tests	2
Praxis Series	2
Test of English for…	2
Trends in International…	2
Alabama High School…	1
Bender Visual Motor Gestalt…	1
Bracken Basic Concept Scale	1
Flesch Reading Ease Formula	1
Goodenough Harris Drawing Test	1
International English…	1
Michigan Test of English…	1
Peabody Picture Vocabulary…	1
Sentence Completion Test	1
Sequential Tests of…	1
More ▼