ERIC - Search Results

Publication Date

In 2025	8
Since 2024	18
Since 2021 (last 5 years)	32
Since 2016 (last 10 years)	57
Since 2006 (last 20 years)	79

Descriptor

Error of Measurement	123
Test Reliability	123
Test Validity	123
Foreign Countries	28
Test Construction	26
Psychometrics	25
Scores	23
Scoring	20
Item Analysis	19
Item Response Theory	19
Correlation	18
Factor Analysis	18
Measurement Techniques	16
Test Items	16
Academic Achievement	15
Student Evaluation	14
Testing Problems	14
Test Bias	13
Evaluation Methods	12
Interrater Reliability	12
Questionnaires	12
Achievement Tests	11
Sampling	11
Standardized Tests	11
Testing	11
More ▼

Publication Type

Reports - Research	77
Journal Articles	68
Reports - Evaluative	14
Reports - Descriptive	13
Speeches/Meeting Papers	11
Numerical/Quantitative Data	10
Opinion Papers	3
Tests/Questionnaires	3
Guides - Non-Classroom	2
Collected Works - Serials	1
Guides - General	1
Information Analyses	1
Reference Materials -…	1
More ▼

Education Level

Secondary Education	19
Elementary Education	17
Higher Education	16
Postsecondary Education	13
Junior High Schools	9
Middle Schools	9
Elementary Secondary Education	8
Grade 3	8
Grade 4	7
Early Childhood Education	6
Grade 5	6
High Schools	6
Intermediate Grades	6
Primary Education	6
Grade 6	5
Grade 7	5
Grade 8	5
Grade 10	2
Grade 11	2
Grade 12	2
Grade 9	2
Kindergarten	2
More ▼

Audience

Administrators	2
Researchers	2
Teachers	1

Location

New York	5
Canada	4
Germany	3
Australia	2
Indonesia	2
Netherlands	2
New Mexico	2
South Africa	2
Spain	2
Turkey	2
Belgium	1
Ethiopia	1
Florida	1
France	1
India	1
Iran	1
Italy	1
Luxembourg	1
Maine	1
New Zealand	1
Norway	1
South Korea	1
United Kingdom	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Early Childhood Longitudinal…	3
ACT Assessment	2
General Educational…	2
Iowa Tests of Basic Skills	2
Program for International…	2
California Achievement Tests	1
Cognitive Abilities Test	1
Cornell Critical Thinking Test	1
Dimensions of Self Concept	1
Florida Comprehensive…	1
Metropolitan Achievement Tests	1
Motivated Strategies for…	1
Peabody Picture Vocabulary…	1
Satisfaction With Life Scale	1
Student Teacher Relationship…	1
Watson Glaser Critical…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 123 results Save | Export

A Theoretical Suggestion on Testing Measurement Invariance in Adapting Parametric Measurement Tools

Peer reviewed
PDF on ERIC

Download full text

Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024

This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…

Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures

Measurement Invariance of the Action Competence in Sustainable Development Questionnaire: Can We Compare between Groups?

Peer reviewed

Direct link

M. Van Harskamp; S. De Maeyer; W. Sass; P. Van Petegem; J. Boeve-de Pauw – Environmental Education Research, 2025

There is a need for valid and reliable instruments to assess learning outcomes in education for sustainable development (ESD). Measurement invariance (MI) needs to be established before results of these instruments can be validly compared between groups. Despite its importance, establishing MI is an often overlooked validation step. To provide an…

Descriptors: Measurement, Sustainable Development, Error of Measurement, Questionnaires

Do Different Devices Perform Equally Well with Different Numbers of Scale Points and Response Formats? A Test of Measurement Invariance and Reliability

Peer reviewed

Direct link

Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024

Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…

Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

How Did Spain Perform in PISA 2018? New Estimates of Children's PISA Reading Scores

Peer reviewed

Direct link

John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024

International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Validation of the Higher Education Student Engagement Scale in Use for Program Evaluation

Peer reviewed

Direct link

Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024

The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…

Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Evaluating Measurement Invariance of Students' Practices Regarding Online Information Questionnaire in PISA 2022: A Comparative Study Using MGCFA and Alignment Method

Peer reviewed

Direct link

Esra Sözer Boz – Education and Information Technologies, 2025

International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…

Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students

Are the Signs of Factor Loadings Arbitrary in Confirmatory Factor Analysis? Problems and Solutions

Peer reviewed

Direct link

Dandan Tang; Steven M. Boker; Xin Tong – Structural Equation Modeling: A Multidisciplinary Journal, 2025

The replication crisis in social and behavioral sciences has raised concerns about the reliability and validity of empirical studies. While research in the literature has explored contributing factors to this crisis, the issues related to analytical tools have received less attention. This study focuses on a widely used analytical tool -…

Descriptors: Test Validity, Factor Analysis, Replication (Evaluation), Social Science Research

The Vague Language Use Scale: Clinical Utility and Psychometrics from Adults with Traumatic Brain Injury

Peer reviewed

Direct link

Kathryn J. Greenslade; Julia K. Bushell; Emily F. Dillon; Amy E. Ramage – International Journal of Language & Communication Disorders, 2025

Background: Pragmatic communication difficulties encompass many distinct behaviours, including the use of vague and/or insufficient language, a common characteristic following traumatic brain injury (TBI) that negatively impacts psychosocial outcomes. Existing assessments evaluate pragmatic communication broadly, often with only one or two items…

Descriptors: Neurological Impairments, Head Injuries, Language Impairments, Language Tests

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

Can Life Satisfaction Be Measured Fairly for Different Groups of South African First-Year Students? Testing the Satisfaction with Life Scale

Peer reviewed
PDF on ERIC

Download full text

van Rensburg, Clarisse; Mostert, Karina – Journal of Student Affairs in Africa, 2023

Student well-being has gradually become a topic of interest in higher education, and the accurate, valid, and reliable measure of well-being constructs is crucial in the South African context. This study examined item bias and configural, metric and scalar invariance of the Satisfaction with Life Scale (SWLS) for South African first-year…

Descriptors: Life Satisfaction, Measures (Individuals), Foreign Countries, College Freshmen

Initial Evidence Supporting Interpretations of Scores from the Enhanced ACT Test. ACT Research. Research Report. R2425

Download full text

Jeff Allen; Ty Cruce – ACT Education Corp., 2025

This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…

Descriptors: College Entrance Examinations, Testing, Change, Scores

Measurement Invariance across Race and Gender for the Force Concept Inventory

Peer reviewed

Direct link

Morley, Alicen; Nissen, Jayson M.; Van Dusen, Ben – Physical Review Physics Education Research, 2023

Instructors and researchers often use research-based assessments to identify the impact of instructional activities. These investigations often focus on issues of diversity, equity, and inclusions by comparing outcomes across social identity groups (e.g., gender, race, and class). Comparisons across groups assume the assessments measure the same…

Descriptors: Error of Measurement, Racial Differences, Gender Differences, Test Validity

The Short Inventory of Creative Activities (S-ICA): Compiling a Short Scale Using Ant Colony Optimization

Peer reviewed

Direct link

D. Steger; S. Weiss; O. Wilhelm – Creativity Research Journal, 2023

Creativity can be measured with a variety of methods including self-reports, others reports, and ability tests. While typical self-reports are best understood as weak proxies of creativity, biographical reports that assess previous creative activities seem more promising. Drawbacks of such measures -- including skewed item distributions, a lack of…

Descriptors: Creativity, Creativity Tests, Test Construction, Algorithms

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Educational and Psychological…	6
Grantee Submission	6
Journal of Educational…	5
New York State Education…	5
Journal of Psychoeducational…	4
Educational Measurement:…	3
Journal of Experimental…	3
IDEA Center, Inc.	2
International Journal of…	2
Measurement in Physical…	2
National Center for Education…	2
New Mexico Public Education…	2
Structural Equation Modeling:…	2
ACT Education Corp.	1
ACT, Inc.	1
Advances in Physiology…	1
Annenberg Institute for…	1
Applied Measurement in…	1
Biochemistry and Molecular…	1
British Journal of…	1
Cogent Education	1
Creativity Research Journal	1
ETS Research Institute	1
Educ Psychol Meas	1
Education and Information…	1
More ▼

Haladyna, Tom	3
Anna-Maria Fall	2
Benton, Stephen L.	2
Beula M. Magimairaj	2
Blaker, Lisa	2
Brennan, Robert L.	2
Dedrick, Robert F.	2
Greg Roberts	2
Lê, Thanh	2
Michael, William B.	2
Najarian, Michelle	2
Nord, Christine	2
Paek, Insu	2
Philip Capin	2
Roid, Gale	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Schoen, Robert C.	2
Sharon Vaughn	2
Shaunessy-Dedrick, Elizabeth	2
Suldo, Shannon M.	2
Tourangeau, Karen	2
Vaden-Kiernan, Nancy	2
Wallner-Allen, Kathleen	2
More ▼