ERIC - Search Results

Publication Date

In 2026	0
Since 2025	12
Since 2022 (last 5 years)	114
Since 2017 (last 10 years)	375
Since 2007 (last 20 years)	1130

Descriptor

Comparative Analysis	1943
Reliability	880
Test Reliability	792
Foreign Countries	554
Test Validity	443
Correlation	350
Validity	332
Interrater Reliability	327
Statistical Analysis	321
Scores	280
Measures (Individuals)	236
Evaluation Methods	212
Higher Education	201
Psychometrics	180
Questionnaires	165
Factor Analysis	161
Test Construction	160
College Students	159
English (Second Language)	149
Student Attitudes	141
Test Items	136
Second Language Learning	133
Scoring	130
Rating Scales	127
Student Evaluation	125
More ▼

Education Level

Higher Education	360
Postsecondary Education	285
Secondary Education	150
Elementary Education	135
Elementary Secondary Education	73
High Schools	68
Middle Schools	61
Early Childhood Education	41
Junior High Schools	34
Grade 8	29
Preschool Education	25
Grade 7	24
Intermediate Grades	24
Grade 4	22
Grade 5	20
Grade 6	20
Kindergarten	20
Primary Education	20
Adult Education	19
Grade 10	16
Grade 11	12
Grade 12	10
Grade 2	10
Grade 3	10
Grade 9	10
More ▼

Audience

Researchers	35
Practitioners	29
Teachers	15
Administrators	9
Policymakers	6
Counselors	2
Media Staff	2
Parents	1
Support Staff	1

Location

Turkey	59
United States	47
Australia	36
China	33
Canada	32
United Kingdom (England)	32
United Kingdom	28
Germany	25
Netherlands	24
Taiwan	22
Hong Kong	20
Iran	20
Spain	17
Belgium	15
California	15
Florida	13
Finland	12
Greece	12
Sweden	12
Texas	12
Indonesia	11
Japan	11
Jordan	11
Malaysia	11
Portugal	11
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	6
Every Student Succeeds Act…	2
Individuals with Disabilities…	2
Americans with Disabilities…	1
Comprehensive Employment and…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Race to the Top	1
Temporary Assistance for…	1

What Works Clearinghouse Rating

Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Comparative Analysis X

Showing 31 to 45 of 1,943 results Save | Export

Generating Social and Emotional Skill Items: Humans vs. ChatGPT. ACT Research. Issue Brief

Download full text

Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024

Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

How to Evaluate Students' Decisions in a Data Comparison Problem: Correct Decision for the Wrong Reasons?

Peer reviewed

Direct link

Karel Kok; Sophia Chroszczinsky; Burkhard Priemer – Physical Review Physics Education Research, 2024

Data comparison problems are used in teaching and science education research that focuses on students' ability to compare datasets and their conceptual understanding of measurement uncertainties. However, the evaluation of students' decisions in these problems can pose a problem: e.g., students making a correct decision for the wrong reasons.…

Descriptors: Secondary School Students, Undergraduate Students, Comparative Analysis, Evaluation Methods

German, Portuguese and Spanish Versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R")

Peer reviewed

Direct link

Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023

The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…

Descriptors: German, Portuguese, Spanish, Test Construction

Analytic or Holistic: A Study of Agreement between Different Grading Models

Peer reviewed
PDF on ERIC

Download full text

Jönsson, Anders; Balan, Andreia – Practical Assessment, Research & Evaluation, 2018

Research on teachers' grading has shown that there is great variability among teachers regarding both the process and product of grading, resulting in low comparability and issues of inequality when using grades for selection purposes. Despite this situation, not much is known about the merits or disadvantages of different models for grading. In…

Descriptors: Grading, Models, Reliability, Validity

Revisiting the Academic Self-Concept Transcultural Measurement Model: The Case of Spain and China

Peer reviewed

Direct link

Igor Esnaola; Albert Sesé; Lorea Azpiazu; Yina Wang – British Journal of Educational Psychology, 2024

Background: Modelling academic self-concept through second-order factors or bifactor structures is an important issue with substantive and practical implications; besides, the bifactor model has not been analysed with a Chinese sample and cross-cultural studies in the academic self-concept are scarce. Likewise, latent structure validity evidence…

Descriptors: Academic Achievement, Self Concept, Psychometrics, Validity

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

A Comparison of Two Learning Approach Inventories and Their Utility in Predicting Examination Performance and Study Habits

Peer reviewed

Direct link

Andrew R. Thompson – Advances in Physiology Education, 2024

The revised two-factor Study Process Questionnaire and the Approaches and Study Skills Inventory for Students are two instruments commonly used to measure student learning approach. Although they are designed to measure similar constructs, it is unclear whether the metrics they provide differ in terms of their real-world classification of learning…

Descriptors: Comparative Analysis, Anatomy, Classification, Cognitive Style

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

A Comparison of Reliability Estimation Based on Confirmatory Factor Analysis and Exploratory Structural Equation Models

Peer reviewed

Direct link

Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2022

Composite reliability, or coefficient omega, can be estimated using structural equation modeling. Composite reliability is usually estimated under the basic independent clusters model of confirmatory factor analysis (ICM-CFA). However, due to the existence of cross-loadings, the model fit of the exploratory structural equation model (ESEM) is…

Descriptors: Comparative Analysis, Structural Equation Models, Factor Analysis, Reliability

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Graders of the Future: Comparing the Consistency and Accuracy of GPT4 and Pre-Service Teachers in Physics Essay Question Assessments

Peer reviewed
PDF on ERIC

Download full text

Yubin Xu; Lin Liu; Jianwen Xiong; Guangtian Zhu – Journal of Baltic Science Education, 2025

As the development and application of large language models (LLMs) in physics education progress, the well-known AI-based chatbot ChatGPT4 has presented numerous opportunities for educational assessment. Investigating the potential of AI tools in practical educational assessment carries profound significance. This study explored the comparative…

Descriptors: Physics, Artificial Intelligence, Computer Software, Accuracy

The Effect of Sampling Context on Preschoolers' Finite Verb Morphology Composite Scores

Peer reviewed

Direct link

Brian Weiler; Ling-Yu Guo – Language, Speech, and Hearing Services in Schools, 2024

Purpose: The finite verb morphology composite (FVMC) is a valid measure for charting children's tense development and for differentiating children with and without language impairment during preschool and early elementary years. However, it is unclear whether FVMC scores vary as a function of language sample elicitation contexts. The current study…

Descriptors: Verbs, Preschool Children, Morphology (Languages), Accuracy

Are the Verbal TTCT Forms Actually Interchangeable?

Peer reviewed

Direct link

Grajzel, Katalin; Dumas, Denis; Acar, Selcuk – Journal of Creative Behavior, 2022

One of the best-known and most frequently used measures of creative idea generation is the Torrance Test of Creative Thinking (TTCT). The TTCT Verbal, assessing verbal ideation, contains two forms created to be used interchangeably by researchers and practitioners. However, the parallel forms reliability of the two versions of the TTCT Verbal has…

Descriptors: Test Reliability, Creative Thinking, Creativity Tests, Verbal Ability

Reliability and Validity of a Digital Goniometer for Measuring Knee Joint Range of Motion

Peer reviewed

Direct link

Lind, Veronika; Svensson, Melanie; Harringe, Marita L. – Measurement in Physical Education and Exercise Science, 2022

Goniometry is commonly used to evaluate joint range of motion (ROM). The most widespread method, a manual universal goniometer (UG), is considered time-consuming and difficult to handle. The digital goniometer EasyAngle (EA) was developed to improve and simplify the evaluation of ROM. This study aimed to evaluate the reliability and validity of EA…

Descriptors: Motor Reactions, Measurement Techniques, Comparative Analysis, Measurement Equipment

Adaptive Pairwise Comparison for Educational Measurement

Peer reviewed

Direct link

Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020

Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…

Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 130

Educational and Psychological…	64
ProQuest LLC	59
Journal of Speech, Language,…	31
Online Submission	27
Journal of Educational…	22
Language Testing	21
Measurement in Physical…	21
ETS Research Report Series	17
Journal of Autism and…	16
Journal of Psychoeducational…	16
Educational Research and…	15
Assessment & Evaluation in…	14
Measurement and Evaluation in…	14
Psychology in the Schools	14
Journal of Consulting and…	12
International Education…	11
Journal of Education and…	11
Psychological Assessment	11
Research in Developmental…	11
Applied Measurement in…	10
Applied Psychological…	10
Educational Sciences: Theory…	10
Advances in Health Sciences…	9
Assessment in Education:…	9
Psychometrika	9
More ▼

Reckase, Mark D.	6
Attali, Yigal	5
Coniam, David	5
Brennan, Robert L.	4
Crehan, Kevin D.	4
Feldt, Leonard S.	4
Hakstian, A. Ralph	4
Jones, Ian	4
Kolen, Michael J.	4
Lunz, Mary E.	4
August, Diane	3
Bashaw, W. L.	3
Bennett, Randy Elliot	3
Benson, Jeri	3
Betz, Nancy E.	3
Ebel, Robert L.	3
Fletcher, Jack M.	3
Francis, David J.	3
Frisbie, David A.	3
Haberman, Shelby	3
Haladyna, Tom	3
Hambleton, Ronald K.	3
Henk, William A.	3
Iwata, Brian A.	3
More ▼

Journal Articles	1365
Reports - Research	1333
Reports - Evaluative	286
Speeches/Meeting Papers	165
Tests/Questionnaires	81
Reports - Descriptive	63
Dissertations/Theses -…	61
Information Analyses	55
Opinion Papers	30
Numerical/Quantitative Data	19
Collected Works - General	8
Books	7
Collected Works - Proceedings	5
Guides - Non-Classroom	5
Book/Product Reviews	4
Dissertations/Theses -…	4
Collected Works - Serials	3
Guides - General	2
Collected Works - Serial	1
Dissertations/Theses	1
Guides - Classroom - Teacher	1
Historical Materials	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Wechsler Intelligence Scale…	16
Peabody Picture Vocabulary…	13
Woodcock Johnson Tests of…	11
SAT (College Admission Test)	10
Test of English as a Foreign…	10
Wechsler Adult Intelligence…	10
Program for International…	9
Minnesota Multiphasic…	8
National Assessment of…	8
Torrance Tests of Creative…	7
Trends in International…	7
Wide Range Achievement Test	7
Autism Diagnostic Observation…	6
ACT Assessment	5
Raven Progressive Matrices	5
Self Directed Search	5
Center for Epidemiologic…	4
Dynamic Indicators of Basic…	4
Early Childhood Environment…	4
General Educational…	4
Graduate Record Examinations	4
Iowa Tests of Basic Skills	4
Metropolitan Achievement Tests	4
Rosenberg Self Esteem Scale	4
Social Skills Rating System	4
More ▼