ERIC - Search Results

Publication Date

In 2025	10
Since 2024	27
Since 2021 (last 5 years)	87
Since 2016 (last 10 years)	189
Since 2006 (last 20 years)	302

Descriptor

Difficulty Level	584
Test Construction	584
Test Items	425
Test Validity	154
Item Analysis	151
Test Reliability	150
Foreign Countries	146
Multiple Choice Tests	132
Item Response Theory	110
Higher Education	78
Test Format	74
Psychometrics	69
Computer Assisted Testing	68
Achievement Tests	66
Language Tests	60
Statistical Analysis	60
Scores	57
Item Banks	53
Comparative Analysis	50
Reading Tests	48
Mathematics Tests	47
Science Tests	47
Elementary School Students	43
Models	42
Undergraduate Students	41
More ▼

Education Level

Higher Education	86
Postsecondary Education	73
Secondary Education	70
Elementary Education	69
Middle Schools	31
High Schools	29
Junior High Schools	22
Early Childhood Education	20
Intermediate Grades	17
Primary Education	17
Grade 8	16
Grade 5	15
Elementary Secondary Education	14
Grade 4	13
Grade 6	12
Grade 1	9
Grade 2	9
Grade 3	9
Grade 7	9
Kindergarten	8
Grade 12	6
Grade 9	3
Preschool Education	2
Adult Education	1
Grade 10	1
More ▼

Audience

Researchers	21
Teachers	10
Practitioners	9
Policymakers	5
Administrators	4

Location

Indonesia	18
Turkey	11
Australia	9
China	8
Florida	8
Germany	7
Japan	7
Nigeria	7
Canada	6
United Kingdom (England)	6
Mexico	5
United Kingdom	5
Alabama	3
Belgium	3
Netherlands	3
New York	3
Russia	3
Spain	3
Sweden	3
Taiwan	3
Tennessee	3
Thailand	3
Turkey (Istanbul)	3
California	2
Ethiopia	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 584 results Save | Export

Effects of Survey Order on Subjective Measures of Cognitive Load: A Randomized Controlled Trial

Peer reviewed

Direct link

Onur Dönmez; Yavuz Akbulut; Gözde Zabzun; Berrin Köseoglu – Applied Cognitive Psychology, 2025

This study investigates the effect of survey order in measuring self-reported cognitive load. Understanding how survey order influences responses is crucial, but it has been largely overlooked in the context of cognitive load. Using a 2 × 2 experimental design with 319 high school students, the study manipulated intrinsic cognitive load (ICL)…

Descriptors: Surveys, Test Construction, Measurement, Cognitive Processes

Identifying Difficult Questions and Student Difficulties in a Spanish Version of a Programming Assessment Instrument (SCS1)

Peer reviewed

Direct link

Camilo Vieira; Andrea Vásquez; Federico Meza; Roxana Quintero-Manes; Pedro Godoy – ACM Transactions on Computing Education, 2024

Currently, there is little evidence about how non-English-speaking students learn computer programming. For example, there are few validated assessment instruments to measure the development of programming skills, especially for the Spanish-speaking population. Having valid assessment instruments is essential to identify the difficulties of the…

Descriptors: Programming, Spanish Speaking, Translation, Test Validity

Text-Based Question Difficulty Prediction: A Systematic Review of Automatic Approaches

Peer reviewed

Direct link

Samah AlKhuzaey; Floriana Grasso; Terry R. Payne; Valentina Tamma – International Journal of Artificial Intelligence in Education, 2024

Designing and constructing pedagogical tests that contain items (i.e. questions) which measure various types of skills for different levels of students equitably is a challenging task. Teachers and item writers alike need to ensure that the quality of assessment materials is consistent, if student evaluations are to be objective and effective.…

Descriptors: Test Items, Test Construction, Difficulty Level, Prediction

Development and Validation of a Theory-Based Questionnaire to Measure Different Types of Cognitive Load

Peer reviewed

Direct link

Krieglstein, Felix; Beege, Maik; Rey, Günter Daniel; Sanchez-Stockhammer, Christina; Schneider, Sascha – Educational Psychology Review, 2023

According to cognitive load theory, learning can only be successful when instructional materials and procedures are designed in accordance with human cognitive architecture. In this context, one of the biggest challenges is the accurate measurement of the different cognitive load types as these are associated with various activities during…

Descriptors: Test Construction, Test Validity, Questionnaires, Cognitive Processes

Development of a Multimedia-Based Psychological Education Assessment System for Higher Education Institutions

Peer reviewed

Direct link

Yue Rong – International Journal of Web-Based Learning and Teaching Technologies, 2024

Mental health education in colleges and universities has made considerable progress, but the existing assessment model still faces challenges in terms of time overhead and rank indicators. In response, this paper proposes a new psychological education assessment model for colleges and universities, based on multimedia feature extraction…

Descriptors: Multimedia Instruction, Test Construction, Psychological Evaluation, Mental Health

Why Do Regular and Reversed Items Load on Separate Factors? Response Difficulty vs. Item Extremity

Peer reviewed

Direct link

Kam, Chester Chun Seng – Educational and Psychological Measurement, 2023

When constructing measurement scales, regular and reversed items are often used (e.g., "I am satisfied with my job"/"I am not satisfied with my job"). Some methodologists recommend excluding reversed items because they are more difficult to understand and therefore engender a second, artificial factor distinct from the…

Descriptors: Test Items, Difficulty Level, Test Construction, Construct Validity

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Development and Validation of a Cognitive Load Measure for General Educational Settings

Peer reviewed

Direct link

Ober, Teresa M.; Lu, Yikai; Blacklock, Chessley B.; Liu, Cheng; Cheng, Ying – Journal of Psychoeducational Assessment, 2023

We develop and validate a self-report measure of intrinsic and extrinsic cognitive load suitable for measuring the constructs in a variety of learning contexts. Data were collected from three independent samples of college students in the U.S. (N[subscript total]= 513; M[subscript age]= 21.13 years). Kane's (2013) framework was used to validate…

Descriptors: Test Construction, Test Validity, Cognitive Processes, Difficulty Level

Identification-Based Multiple-Choice Assessments in Anatomy Can Be as Reliable and Challenging as Their Free-Response Equivalents

Peer reviewed

Direct link

Douglas-Morris, Jan; Ritchie, Helen; Willis, Catherine; Reed, Darren – Anatomical Sciences Education, 2021

Multiple-choice (MC) anatomy "spot-tests" (identification-based assessments on tagged cadaveric specimens) offer a practical alternative to traditional free-response (FR) spot-tests. Conversion of the two spot-tests in an upper limb musculoskeletal anatomy unit of study from FR to a novel MC format, where one of five tagged structures on…

Descriptors: Multiple Choice Tests, Anatomy, Test Reliability, Difficulty Level

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Developing the Mental Effort and Load-Translingual Scale (MEL-TS) as a Foundation for Translingual Research in Self-Regulated Learning

Peer reviewed

Direct link

Tino Endres; Lisa Bender; Stoo Sepp; Shirong Zhang; Louise David; Melanie Trypke; Dwayne Lieck; Juliette C. Désiron; Johanna Bohm; Sophia Weissgerber; Juan Cristobal Castro-Alonso; Fred Paas – Educational Psychology Review, 2025

Assessing cognitive demand is crucial for research on self-regulated learning; however, discrepancies in translating essential concepts across languages can hinder the comparison of research findings. Different languages often emphasize various components and interpret certain constructs differently. This paper aims to develop a translingual set…

Descriptors: Cognitive Processes, Difficulty Level, Metacognition, Translation

Developing and Validating a Biological System Thinking Test for Middle School Students

Peer reviewed

Direct link

Ruying Li; Gaofeng Li – International Journal of Science and Mathematics Education, 2025

Systems thinking (ST) is an essential competence for future life and biology learning. Appropriate assessment is critical for collecting sufficient information to develop ST in biology education. This research offers an ST framework based on a comprehensive understanding of biological systems, encompassing four skills across three complexity…

Descriptors: Test Construction, Test Validity, Science Tests, Cognitive Tests

Idea-Sharing Crafting Item Difficulty in TOEFL iBT Listening Tests

Peer reviewed
PDF on ERIC

Download full text

Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…

Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests

What Do We Mean by Question Paper Error? An Analysis of Criteria and Working Definitions

Download full text

Rushton, Nicky; Vitello, Sylvia; Suto, Irenka – Research Matters, 2021

It is important to define what an error in a question paper is so that there is a common understanding and to avoid people's own conceptions impacting upon the way in which they write or check question papers. We carried out an interview study to investigate our colleagues' definitions of error. We found that there is no single accepted definition…

Descriptors: Definitions, Tests, Foreign Countries, Problems

Development and Validation of the Student Perception of Academic Challenge Scale

Direct link

Jenna M. T. Vest – ProQuest LLC, 2024

This study focuses on creating a reliable and valid instrument to measure high school students' perceptions of academic challenge. The research is divided into four phases: qualitative analysis, item development, exploratory factor analysis (EFA), and validation. Initial data from college students' retrospective views and high school students'…

Descriptors: Test Construction, Test Validity, Student Attitudes, Academic Achievement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 39

Educational and Psychological…	18
Behavioral Research and…	17
Journal of Educational…	14
ETS Research Report Series	12
ProQuest LLC	12
Applied Measurement in…	10
Online Submission	9
Applied Psychological…	6
Journal of Experimental…	6
Language Assessment Quarterly	6
Physical Review Physics…	6
CBE - Life Sciences Education	5
Journal of Educational…	5
American Institutes for…	4
Assessment & Evaluation in…	4
Chemistry Education Research…	4
Educational Assessment	4
Educational Measurement:…	4
International Journal of…	4
Journal of Applied Testing…	4
Journal of Education and…	4
Education and Information…	3
International Journal of…	3
International Journal of…	3
International Journal of…	3
More ▼

Tindal, Gerald	17
Alonzo, Julie	13
Anderson, Daniel	8
Park, Bitnara Jasmine	8
Huntley, Renee M.	6
Irvin, P. Shawn	6
Liu, Kimy	6
Roid, Gale	6
Saven, Jessica L.	6
Bejar, Isaac I.	5
Reckase, Mark D.	5
Cizek, Gregory J.	4
DiLuzio, Geneva J.	4
Ketterlin-Geller, Leanne R.	4
Lord, Frederic M.	4
Weiss, David J.	4
Benson, Jeri	3
Crisp, Victoria	3
Ferrara, Steve	3
Graf, Edith Aurora	3
Haladyna, Thomas M.	3
Haladyna, Tom	3
Hambleton, Ronald K.	3
Huff, Kristen	3
More ▼

Reports - Research	388
Journal Articles	308
Speeches/Meeting Papers	104
Reports - Evaluative	89
Reports - Descriptive	47
Tests/Questionnaires	38
Numerical/Quantitative Data	22
Information Analyses	17
Dissertations/Theses -…	13
Guides - Non-Classroom	10
Opinion Papers	6
Books	4
Collected Works - General	3
Collected Works - Proceedings	3
Non-Print Media	2
Reference Materials - General	2
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Teacher	1
Guides - General	1
Reference Materials -…	1
More ▼

Graduate Record Examinations	10
National Assessment of…	8
Program for International…	7
SAT (College Admission Test)	6
Test of English as a Foreign…	6
ACT Assessment	5
Advanced Placement…	3
Raven Progressive Matrices	3
Alabama High School…	2
Armed Services Vocational…	2
Flesch Kincaid Grade Level…	2
Hidden Figures Test	2
International English…	2
Metropolitan Achievement Tests	2
Praxis Series	2
Sentence Completion Test	2
Test of English for…	2
Trends in International…	2
Bender Visual Motor Gestalt…	1
Bracken Basic Concept Scale	1
Flesch Reading Ease Formula	1
Goodenough Harris Drawing Test	1
Michigan Test of English…	1
My Class Inventory	1
Peabody Picture Vocabulary…	1
More ▼