ERIC - Search Results

Publication Date

In 2025	4
Since 2024	9
Since 2021 (last 5 years)	31
Since 2016 (last 10 years)	133
Since 2006 (last 20 years)	318

Descriptor

Comparative Analysis	790
Test Reliability	790
Test Validity	424
Foreign Countries	174
Test Construction	132
Correlation	119
Statistical Analysis	117
Scores	105
Higher Education	98
Psychometrics	91
Test Items	88
Factor Analysis	70
Language Tests	64
English (Second Language)	61
Item Analysis	60
Testing	60
College Students	58
Evaluation Methods	58
Measures (Individuals)	58
Measurement Techniques	57
Test Format	57
Computer Assisted Testing	52
Multiple Choice Tests	52
Rating Scales	51
Student Attitudes	48
More ▼

Education Level

Higher Education	91
Postsecondary Education	66
Elementary Education	41
Secondary Education	40
Elementary Secondary Education	22
High Schools	18
Middle Schools	16
Early Childhood Education	11
Intermediate Grades	9
Junior High Schools	9
Grade 8	8
Preschool Education	8
Kindergarten	7
Grade 10	5
Grade 12	5
Grade 4	5
Primary Education	5
Grade 1	4
Grade 11	4
Grade 5	4
Adult Education	3
Grade 2	3
Grade 6	3
Grade 7	3
Grade 9	3
More ▼

Audience

Researchers	18
Practitioners	17
Teachers	9
Administrators	4
Counselors	2
Policymakers	2
Parents	1
Support Staff	1

Location

United States	21
Turkey	20
Australia	16
China	11
United Kingdom (England)	11
Germany	9
Hong Kong	9
Iran	9
Taiwan	9
United Kingdom	9
Canada	8
Belgium	7
Spain	7
France	6
Greece	6
Japan	6
Indonesia	5
Israel	5
Ohio	5
Portugal	5
Illinois	4
New York	4
Singapore	4
South Korea	4
Texas	4
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Comparative Analysis X

Showing 1 to 15 of 790 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

A Methodological Review of Listening Comprehension Tests for Primary School Children

Peer reviewed

Direct link

Kiri Mealings; Kelly Miles; Joerg M. Buchholz – International Journal of Listening, 2025

A child's ability to comprehend speech in the mainstream classroom is vital for intellectual and social development. However, listening conditions are often sub-optimal; the presence of multiple talkers, high noise levels, and long reverberation times add to the challenge of listening with a developing auditory system. An assessment that captures…

Descriptors: Elementary School Students, Listening Comprehension Tests, Comparative Analysis, Speech Communication

Psychometric Properties of the Metacognitive Awareness Inventory (MAI): Standardization to an International Spanish with 12 Countries

Peer reviewed

Direct link

Antonio P. Gutierrez de Blume; Diana Marcela Montoya Londoño; Virginia Jiménez Rodríguez; Olivia Morán Núñez; Ariel Cuadro; Lilián Daset; Mauricio Molina Delgado; Claudia García de la Cadena; María Beatríz Beltrán Navarro; Aníbal Puente Ferreras; Sebastián Urquijo; Walter Lizandro Arias – Metacognition and Learning, 2024

Metacognition is defined as a higher-order thinking skill that enables individuals to monitor, control, and regulate their thinking and behavior. In education, this skill is important, as learners need to self-regulate their learning behaviors for successful lifelong learning. Thus, it is essential for educators and learners alike to know their…

Descriptors: Metacognition, Measures (Individuals), Psychometrics, Standards

Generating Social and Emotional Skill Items: Humans vs. ChatGPT. ACT Research. Issue Brief

Download full text

Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024

Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

How to Evaluate Students' Decisions in a Data Comparison Problem: Correct Decision for the Wrong Reasons?

Peer reviewed

Direct link

Karel Kok; Sophia Chroszczinsky; Burkhard Priemer – Physical Review Physics Education Research, 2024

Data comparison problems are used in teaching and science education research that focuses on students' ability to compare datasets and their conceptual understanding of measurement uncertainties. However, the evaluation of students' decisions in these problems can pose a problem: e.g., students making a correct decision for the wrong reasons.…

Descriptors: Secondary School Students, Undergraduate Students, Comparative Analysis, Evaluation Methods

German, Portuguese and Spanish Versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R")

Peer reviewed

Direct link

Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023

The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…

Descriptors: German, Portuguese, Spanish, Test Construction

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Are the Verbal TTCT Forms Actually Interchangeable?

Peer reviewed

Direct link

Grajzel, Katalin; Dumas, Denis; Acar, Selcuk – Journal of Creative Behavior, 2022

One of the best-known and most frequently used measures of creative idea generation is the Torrance Test of Creative Thinking (TTCT). The TTCT Verbal, assessing verbal ideation, contains two forms created to be used interchangeably by researchers and practitioners. However, the parallel forms reliability of the two versions of the TTCT Verbal has…

Descriptors: Test Reliability, Creative Thinking, Creativity Tests, Verbal Ability

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Short-Term Test-Retest Reliability of Contralateral Suppression of Click-Evoked Otoacoustic Emissions in Normal-Hearing Subjects

Peer reviewed

Direct link

Keppler, Hannah; Degeest, Sofie; Vinck, Bart – Journal of Speech, Language, and Hearing Research, 2021

Purpose: The objective of the current study was to investigate the short-term test-retest reliability of contralateral suppression (CS) of click-evoked otoacoustic emissions (CEOAEs) using commercially available otoacoustic emission equipment. Method: Twenty-three young normal-hearing subjects were tested. An otoscopic evaluation, admittance…

Descriptors: Test Reliability, Hearing (Physiology), Acoustics, Auditory Tests

The Effectiveness of the Predict-Explain-Enact-Observe-Reflect (PEEOR) Instructional Strategy on Conceptual Understanding and Motivation in Motion and Force Topic

Peer reviewed
PDF on ERIC

Download full text

Amssalu Wondmagegn Getu; Fikadu Edhetu Gashaw; Menberu Mengesha Woldemariam – Shanlax International Journal of Education, 2024

The study aimed to assess the effectiveness of the Predict-Explain-Enact-Observe-Reflect (PEEOR) instructional strategy on general science students' conceptual understanding and motivation in the topic of motion and force. The research employed a pre-test post-test quasi-experimental design. The sample consisted of 107 general science summer, year…

Descriptors: Physics, Science Instruction, Learning Motivation, Reflection

Treatments of Differential Item Functioning: A Comparison of Four Methods

Peer reviewed

Direct link

Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022

Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…

Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity

Integration of Interactive Computer Simulations in Teaching and Learning Chemical Reaction: Students' Performance and Concept Retention

Peer reviewed
PDF on ERIC

Download full text

Jane Batamuliza; Gonzague Habinshuti; Jean Baptiste Nkurunziza – Journal of Technology and Science Education, 2024

This current study presents the effects of interactive computer simulations on students' performance and concept retention in the unit of chemical reactions. Purposive sampling was used to select four schools with a sample population of 320. The Achievement test on chemical reactions was developed, validated, and checked for reliability. The…

Descriptors: Chemistry, Science Instruction, Teaching Methods, Comparative Analysis

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

The Effect of Gersmehl's Spatial Learning on Students' Disaster Spatial Literacy

Peer reviewed
PDF on ERIC

Download full text

Purwanto; Hidayah, Niswatul; Wagistina, Satti – International Journal of Educational Methodology, 2023

Learning geography in Indonesia philosophically aims to develop spatial literacy. Students must improve spatial literacy to form reasoning skills and apply spatial concepts in real life. Applying Gersmehl's spatial learning can improve students' spatial literacy through syntax arranged based on spatial aspects. The use of google earth helps…

Descriptors: Spatial Ability, Natural Disasters, Geography Instruction, Teaching Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 53

Educational and Psychological…	31
Journal of Educational…	13
Measurement and Evaluation in…	13
Language Testing	12
Psychology in the Schools	12
Journal of Consulting and…	10
Measurement in Physical…	10
ETS Research Report Series	9
Journal of Autism and…	8
Online Submission	8
ProQuest LLC	8
Research in Developmental…	8
Journal of Experimental…	7
Psychometrika	7
Advances in Health Sciences…	5
International Journal of…	5
Journal of Clinical Psychology	5
Journal of Speech, Language,…	5
Research Quarterly for…	5
Assessment	4
Educational Research and…	4
Hispanic Journal of…	4
International Education…	4
Journal of Attention Disorders	4
Journal of Chemical Education	4
More ▼

Reckase, Mark D.	5
Bashaw, W. L.	3
Bennett, Randy Elliot	3
Benson, Jeri	3
Crehan, Kevin D.	3
Ebel, Robert L.	3
Frisbie, David A.	3
Hakstian, A. Ralph	3
Henk, William A.	3
Weiss, David J.	3
Winke, Paula	3
Algozzine, Bob	2
August, Diane	2
Baron-Cohen, Simon	2
Bauer, Christopher F.	2
Bauer, Daniel	2
Betz, Nancy E.	2
Brennan, Robert L.	2
Brown, James Dean	2
Byrne, Brian	2
Cheung, Ping Chung	2
Christine, Charles T.	2
Clare, Isabel C. H.	2
Crabtree, Jason	2
More ▼

Reports - Research	505
Journal Articles	449
Reports - Evaluative	108
Speeches/Meeting Papers	91
Reports - Descriptive	27
Tests/Questionnaires	23
Information Analyses	17
Opinion Papers	10
Dissertations/Theses -…	9
Books	4
Dissertations/Theses -…	4
Numerical/Quantitative Data	4
Collected Works - General	3
Collected Works - Proceedings	3
Guides - Non-Classroom	3
Collected Works - Serials	2
Book/Product Reviews	1
Dissertations/Theses	1
Guides - Classroom - Teacher	1
Guides - General	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Wechsler Intelligence Scale…	14
Wechsler Adult Intelligence…	8
Minnesota Multiphasic…	7
SAT (College Admission Test)	6
Peabody Picture Vocabulary…	5
Torrance Tests of Creative…	5
Wide Range Achievement Test	5
General Educational…	4
Metropolitan Achievement Tests	4
Strong Campbell Interest…	4
Trends in International…	4
Center for Epidemiologic…	3
Childrens Manifest Anxiety…	3
Graduate Record Examinations	3
Iowa Tests of Basic Skills	3
McCarthy Scales of Childrens…	3
National Assessment of…	3
Raven Progressive Matrices	3
Rosenberg Self Esteem Scale	3
Self Directed Search	3
Stanford Binet Intelligence…	3
Test of English as a Foreign…	3
ACT Assessment	2
ACTFL Oral Proficiency…	2
Armed Services Vocational…	2
More ▼