ERIC - Search Results

Publication Date

In 2026	0
Since 2025	20
Since 2022 (last 5 years)	84
Since 2017 (last 10 years)	235
Since 2007 (last 20 years)	369

Descriptor

Difficulty Level	369
Test Reliability	253
Test Items	212
Foreign Countries	181
Test Validity	149
Test Construction	103
Item Response Theory	86
Reliability	83
Psychometrics	71
Scores	62
Multiple Choice Tests	61
Item Analysis	59
Undergraduate Students	51
Correlation	48
Interrater Reliability	44
Science Tests	44
Statistical Analysis	44
Comparative Analysis	43
Measures (Individuals)	38
High School Students	37
Language Tests	37
Cognitive Processes	35
Scientific Concepts	35
Elementary School Students	34
Validity	34
More ▼

Publication Type

Journal Articles	320
Reports - Research	316
Tests/Questionnaires	34
Reports - Evaluative	23
Dissertations/Theses -…	21
Numerical/Quantitative Data	6
Reports - Descriptive	5
Speeches/Meeting Papers	5
Information Analyses	4
Collected Works - Proceedings	1
Reports - General	1
More ▼

Education Level

Higher Education	121
Postsecondary Education	105
Secondary Education	90
Elementary Education	75
High Schools	44
Middle Schools	37
Junior High Schools	24
Intermediate Grades	21
Early Childhood Education	19
Primary Education	18
Elementary Secondary Education	14
Grade 8	14
Grade 7	13
Kindergarten	11
Grade 4	10
Grade 6	10
Grade 1	9
Grade 5	9
Grade 2	8
Grade 3	7
Grade 9	7
Grade 10	5
Grade 12	5
Preschool Education	3
Grade 11	2
More ▼

Audience

Administrators	1
Community	1
Counselors	1
Parents	1
Teachers	1

Location

Indonesia	27
Turkey	24
Germany	14
Florida	9
Nigeria	8
United Kingdom	8
United States	8
Canada	6
Iran	6
Japan	6
Jordan	6
South Korea	6
Taiwan	6
China	5
California	4
India	4
Israel	4
Philippines	4
Saudi Arabia	4
South Africa	4
Turkey (Istanbul)	4
United Kingdom (England)	4
Australia	3
Chile	3
Finland	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 369 results Save | Export

Investigation of Response Aggregation Methods in Divergent Thinking Assessments

Peer reviewed

Direct link

Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025

Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…

Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Decision-Making Efficiency with Aided Information: The Impact of Automation Reliability and Task Difficulty

Peer reviewed

Direct link

Hanshu Zhang; Ran Zhou; Cheng-You Cheng; Sheng-Hsu Huang; Ming-Hui Cheng; Cheng-Ta Yang – Cognitive Research: Principles and Implications, 2025

Although it is commonly believed that automation aids human decision-making, conflicting evidence raises questions about whether individuals would gain greater advantages from automation in difficult tasks. Our study examines the combined influence of task difficulty and automation reliability on aided decision-making. We assessed decision…

Descriptors: Task Analysis, Difficulty Level, Decision Making, Automation

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Evaluating Mathematics Lessons for Cognitive Demand: Applying a Discursive Lens to the Process of Achieving Inter-Rater Reliability

Peer reviewed

Direct link

Weingarden, Merav; Heyd-Metzuyanim, Einat – Journal of Mathematics Teacher Education, 2023

In this study, we examine "what went wrong" in our professional development program for encouraging cognitively demanding instruction, focusing on the difficulties we encountered in using an observational tool for evaluating this type of instruction and reaching inter-rater reliability. We do so through the lens of a discursive theory of…

Descriptors: Mathematics Instruction, Interrater Reliability, Cognitive Processes, Difficulty Level

Effect of Sample Length on MLU in Mandarin-Speaking Hard-of-Hearing Children

Peer reviewed

Direct link

Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024

This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…

Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition

Taxonomy of Digital Curation Activities That Promote Critical Thinking

Peer reviewed

Direct link

Rivka Gadot; Dina Tsybulsky – Smart Learning Environments, 2025

Critical thinking (CT) consists of a deliberate and reflective process that can lead to informed decisions. It involves scrutinizing the trustworthiness and consistency of underlying assumptions, the sources of data, and the validity of other information. CT embodies deliberate, self-regulated judgment incorporating cognitive abilities such as…

Descriptors: Critical Thinking, Data Collection, Information Management, Decision Making Skills

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Inventory of Galilean Transformation of Uniform Linear Motion in Position-Time Graphs

Peer reviewed

Direct link

E.?B. Merki; S.?I. Hofer; A. Vaterlaus; A. Lichtenberger – Physical Review Physics Education Research, 2025

When describing motion in physics, the selection of a frame of reference is crucial. The graph of a moving object can look quite different based on the frame of reference. In recent years, various tests have been developed to assess the interpretation of kinematic graphs, but none of these tests have specifically addressed differences in reference…

Descriptors: Graphs, Motion, Physics, Secondary School Students

A Systematic Meta-Analysis of the Reliability and Validity of Subjective Cognitive Load Questionnaires in Experimental Multimedia Learning Research

Peer reviewed

Direct link

Krieglstein, Felix; Beege, Maik; Rey, Günter Daniel; Ginns, Paul; Krell, Moritz; Schneider, Sascha – Educational Psychology Review, 2022

For more than three decades, cognitive load theory has been addressing learning from a cognitive perspective. Based on this instructional theory, design recommendations and principles have been derived to manage the load on working memory while learning. The increasing attention paid to cognitive load theory in educational science quickly…

Descriptors: Cognitive Processes, Difficulty Level, Learning Theories, Test Reliability

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Reliable but Multi-Dimensional Cognitive Demand in Operating Partially Automated Vehicles: Implications for Real-World Automation Research

Peer reviewed

Direct link

Monika Lohani; Joel M. Cooper; Amy S. McDonnell; Gus G. Erickson; Trent G. Simmons; Amanda E. Carriero; Kaedyn W. Crabtree; David L. Strayer – Cognitive Research: Principles and Implications, 2024

The reliability of cognitive demand measures in controlled laboratory settings is well-documented; however, limited research has directly established their stability under real-life and high-stakes conditions, such as operating automated technology on actual highways. Partially automated vehicles have advanced to become an everyday mode of…

Descriptors: Cognitive Processes, Difficulty Level, Automation, Psychophysiology

The Influence of Representations on Task Difficulty in Organic Chemistry: An Exploration Using a Novel Paired-Items Test Instrument

Peer reviewed

Direct link

Martin Steinbach; Carolin Eitemüller; Marc Rodemer; Maik Walpuski – International Journal of Science Education, 2025

The intricate relationship between representational competence and content knowledge in organic chemistry has been widely debated, and the ways in which representations contribute to task difficulty, particularly in assessment, remain unclear. This paper presents a multiple-choice test instrument for assessing individuals' knowledge of fundamental…

Descriptors: Organic Chemistry, Difficulty Level, Multiple Choice Tests, Fundamental Concepts

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 25

ProQuest LLC	20
Online Submission	14
Grantee Submission	9
ETS Research Report Series	6
International Journal of…	6
Physical Review Physics…	6
Educational Research and…	5
International Journal of…	5
Journal of Education and…	5
Language Assessment Quarterly	5
Advances in Health Sciences…	4
Anatomical Sciences Education	4
Chemistry Education Research…	4
Cogent Education	4
Educational and Psychological…	4
International Journal of…	4
International Journal of…	4
Journal of Turkish Science…	4
Language Testing	4
SAGE Open	4
Applied Measurement in…	3
Assessment for Effective…	3
Behavioral Research and…	3
CBE - Life Sciences Education	3
Cognitive Research:…	3
More ▼

Schoen, Robert C.	6
Yang, Xiaotong	4
Al-Jarf, Reima	3
Alonzo, Julie	3
Anderson, Daniel	3
Paek, Insu	3
Prather, Edward E.	3
Tindal, Gerald	3
Alexander, Patricia A.	2
Atalmis, Erkan Hasan	2
Barniol, Pablo	2
Bauduin, Charity	2
Bauer, Daniel	2
Beach, Kristen D.	2
Benton, Tom	2
Bocian, Kathleen M.	2
Fischer, Martin R.	2
Gu, Jianjun	2
Hamby, Tyler	2
Istiyono, Edi	2
Jandaghi, Gholamreza	2
Krell, Moritz	2
Lee, Young-Sun	2
Liu, Sicong	2
Lubiano, Michael Leonard D.	2
More ▼

Flesch Kincaid Grade Level…	3
Flesch Reading Ease Formula	3
SAT (College Admission Test)	3
Test of English as a Foreign…	3
Raven Progressive Matrices	2
Woodcock Johnson Tests of…	2
edTPA (Teacher Performance…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adult Attachment Interview	1
Ages and Stages Questionnaires	1
Career Decision Making…	1
Child Behavior Checklist	1
Clinical Evaluation of…	1
Dale Chall Readability Formula	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Fry Readability Formula	1
Gates MacGinitie Reading Tests	1
Iowa Tests of Basic Skills	1
MacArthur Communicative…	1
Measures of Academic Progress	1
National Survey of Student…	1
Peabody Developmental Motor…	1
Peabody Individual…	1
More ▼