ERIC - Search Results

Publication Date

In 2026	0
Since 2025	15
Since 2022 (last 5 years)	60
Since 2017 (last 10 years)	172
Since 2007 (last 20 years)	253

Descriptor

Difficulty Level	409
Test Reliability	409
Test Items	279
Test Validity	199
Test Construction	152
Foreign Countries	146
Item Analysis	89
Multiple Choice Tests	85
Item Response Theory	80
Psychometrics	70
Scores	54
Achievement Tests	44
Higher Education	44
Statistical Analysis	44
Science Tests	41
Undergraduate Students	41
Comparative Analysis	40
Correlation	40
Language Tests	39
High School Students	35
Computer Assisted Testing	33
Test Format	33
Factor Analysis	31
Elementary School Students	30
Mathematics Tests	28
More ▼

Publication Type

Reports - Research	319
Journal Articles	263
Speeches/Meeting Papers	44
Reports - Evaluative	38
Tests/Questionnaires	29
Reports - Descriptive	14
Dissertations/Theses -…	12
Numerical/Quantitative Data	7
Guides - Non-Classroom	6
Opinion Papers	4
Information Analyses	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Computer Programs	1
Guides - Classroom - Teacher	1
Guides - General	1
Reports - General	1
More ▼

Education Level

Higher Education	87
Postsecondary Education	76
Secondary Education	74
Elementary Education	54
High Schools	35
Middle Schools	30
Junior High Schools	19
Intermediate Grades	15
Early Childhood Education	14
Primary Education	13
Elementary Secondary Education	10
Grade 1	9
Grade 6	9
Grade 7	9
Grade 8	9
Kindergarten	9
Grade 2	8
Grade 5	8
Grade 4	6
Grade 9	6
Grade 12	5
Grade 3	4
Grade 10	3
Grade 11	2
Preschool Education	2
More ▼

Audience

Researchers	8
Practitioners	4
Teachers	3
Administrators	1
Community	1
Parents	1

Location

Indonesia	25
Turkey	20
Germany	12
Florida	8
Nigeria	8
United Kingdom	7
United States	7
Australia	5
China	5
Japan	5
South Korea	5
Canada	4
India	4
Iran	4
United Kingdom (England)	4
Finland	3
Jordan	3
Mexico	3
New York	3
Norway	3
Philippines	3
Spain	3
Taiwan	3
Turkey (Istanbul)	3
Asia	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 409 results Save | Export

Investigation of Response Aggregation Methods in Divergent Thinking Assessments

Peer reviewed

Direct link

Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025

Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…

Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Effect of Sample Length on MLU in Mandarin-Speaking Hard-of-Hearing Children

Peer reviewed

Direct link

Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024

This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…

Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Inventory of Galilean Transformation of Uniform Linear Motion in Position-Time Graphs

Peer reviewed

Direct link

E.?B. Merki; S.?I. Hofer; A. Vaterlaus; A. Lichtenberger – Physical Review Physics Education Research, 2025

When describing motion in physics, the selection of a frame of reference is crucial. The graph of a moving object can look quite different based on the frame of reference. In recent years, various tests have been developed to assess the interpretation of kinematic graphs, but none of these tests have specifically addressed differences in reference…

Descriptors: Graphs, Motion, Physics, Secondary School Students

A Systematic Meta-Analysis of the Reliability and Validity of Subjective Cognitive Load Questionnaires in Experimental Multimedia Learning Research

Peer reviewed

Direct link

Krieglstein, Felix; Beege, Maik; Rey, Günter Daniel; Ginns, Paul; Krell, Moritz; Schneider, Sascha – Educational Psychology Review, 2022

For more than three decades, cognitive load theory has been addressing learning from a cognitive perspective. Based on this instructional theory, design recommendations and principles have been derived to manage the load on working memory while learning. The increasing attention paid to cognitive load theory in educational science quickly…

Descriptors: Cognitive Processes, Difficulty Level, Learning Theories, Test Reliability

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

The Influence of Representations on Task Difficulty in Organic Chemistry: An Exploration Using a Novel Paired-Items Test Instrument

Peer reviewed

Direct link

Martin Steinbach; Carolin Eitemüller; Marc Rodemer; Maik Walpuski – International Journal of Science Education, 2025

The intricate relationship between representational competence and content knowledge in organic chemistry has been widely debated, and the ways in which representations contribute to task difficulty, particularly in assessment, remain unclear. This paper presents a multiple-choice test instrument for assessing individuals' knowledge of fundamental…

Descriptors: Organic Chemistry, Difficulty Level, Multiple Choice Tests, Fundamental Concepts

Brief Research Report: Psychometric Properties of a Cognitive Load Measure When Assessing the Load Associated with a Course

Peer reviewed

Direct link

Miller, Dan J.; Noble, Prisca; Medlen, Sue; Jones, Karina; Munns, Suzanne L. – Journal of Experimental Education, 2023

The cognitive load imposed by instruction is an important consideration for instructional designers. Theoretical models have traditionally divided total cognitive load into intrinsic, extrinsic, and germane load. The 10-item Cognitive Load Inventory (CLI-10) is designed to measure these three types of cognitive load. It is typically administered…

Descriptors: Psychometrics, Cognitive Processes, Difficulty Level, Factor Analysis

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Assessing Lower-Secondary School Students' Critical Thinking Skills in Photosynthesis: A Rasch Model Approach

Peer reviewed
PDF on ERIC

Download full text

Suwita Suwita; Sulistyo Saputro; Sajidan Sajidan; Sutarno Sutarno – Journal of Baltic Science Education, 2024

The current study uses the Rasch Model to measure lower-secondary school students' critical thinking skills on photosynthesis topics. Critical thinking skills are considered essential in science education, but few valid and practical measurement instruments remain. The current study fills the gap by adapting the instrument from the Watson-Glaser…

Descriptors: Secondary School Students, Critical Thinking, Thinking Skills, Botany

Development of a Fraction Vocabulary Measure

Peer reviewed

Direct link

Xin Lin; Sarah R. Powell – Assessment for Effective Intervention, 2024

Developing mathematics proficiency requires an understanding of mathematics vocabulary. Although previous research has developed several measures of mathematics vocabulary at different grade levels, no study focused solely on fraction vocabularies. We developed and tested a measure of fraction vocabulary for students in Grade 4 to determine the…

Descriptors: Mathematics Education, Mathematics Skills, Fractions, Vocabulary

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

Educational and Psychological…	12
ProQuest LLC	12
Online Submission	10
Journal of Educational…	8
Journal of Experimental…	7
Grantee Submission	6
Physical Review Physics…	6
Applied Psychological…	5
International Journal of…	5
Language Testing	5
Applied Measurement in…	4
Chemistry Education Research…	4
ETS Research Report Series	4
Educational Research and…	4
International Journal of…	4
Journal of Education and…	4
Journal of Turkish Science…	4
SAGE Open	4
Advances in Health Sciences…	3
Assessment for Effective…	3
Behavioral Research and…	3
Cogent Education	3
IEEE Transactions on Education	3
International Journal of…	3
International Journal of…	3
More ▼

Schoen, Robert C.	6
DiLuzio, Geneva J.	4
Yang, Xiaotong	4
Alonzo, Julie	3
Anderson, Daniel	3
Huck, Schuyler W.	3
Paek, Insu	3
Prather, Edward E.	3
Thompson, Bruce	3
Tindal, Gerald	3
Weiten, Wayne	3
Al-Jarf, Reima	2
Alexander, Patricia A.	2
Anderson, Paul S.	2
Atalmis, Erkan Hasan	2
Barniol, Pablo	2
Bauduin, Charity	2
Bauer, Daniel	2
Benson, Jeri	2
Cliff, Norman	2
Dorans, Neil J.	2
Feldt, Leonard S.	2
Fischer, Martin R.	2
Frisbie, David A.	2
More ▼

Test of English as a Foreign…	4
SAT (College Admission Test)	3
Comprehensive Tests of Basic…	2
Flesch Kincaid Grade Level…	2
Flesch Reading Ease Formula	2
Graduate Record Examinations	2
Metropolitan Achievement Tests	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
Test of English for…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adult Attachment Interview	1
Armed Services Vocational…	1
Bayley Scales of Infant…	1
Career Decision Making…	1
Cattell Culture Fair…	1
Child Behavior Checklist	1
Clinical Evaluation of…	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
Graduate Management Admission…	1
Hidden Figures Test	1
More ▼