ERIC - Search Results

Publication Date

In 2026	0
Since 2025	6
Since 2022 (last 5 years)	36
Since 2017 (last 10 years)	69
Since 2007 (last 20 years)	207

Descriptor

Error of Measurement	357
Reliability	357
Scores	105
Correlation	67
Statistical Analysis	64
Validity	64
Psychometrics	48
Generalizability Theory	44
Measurement Techniques	44
Computation	39
Evaluation Methods	36
Foreign Countries	36
Factor Analysis	35
Sampling	35
Comparative Analysis	32
Research Methodology	32
True Scores	30
Measurement	29
Measures (Individuals)	28
Sample Size	28
Item Response Theory	27
Models	27
Simulation	27
Structural Equation Models	26
Academic Achievement	25
More ▼

Publication Type

Journal Articles	258
Reports - Research	186
Reports - Evaluative	84
Reports - Descriptive	46
Speeches/Meeting Papers	34
Dissertations/Theses -…	8
Opinion Papers	6
Information Analyses	5
Tests/Questionnaires	5
Guides - Non-Classroom	4
Numerical/Quantitative Data	4
Book/Product Reviews	3
ERIC Digests in Full Text	2
ERIC Publications	2
Books	1
Collected Works - Serial	1
Guides - General	1
Legal/Legislative/Regulatory…	1
More ▼

Education Level

Higher Education	39
Postsecondary Education	26
Elementary Education	24
Secondary Education	17
Middle Schools	15
Elementary Secondary Education	14
Junior High Schools	11
Early Childhood Education	8
High Schools	8
Grade 4	7
Intermediate Grades	7
Grade 3	5
Primary Education	5
Grade 5	4
Grade 8	4
Adult Education	3
Preschool Education	3
Grade 1	2
Grade 6	2
Kindergarten	2
Grade 2	1
Grade 7	1
High School Equivalency…	1
More ▼

Audience

Researchers	9
Policymakers	1
Practitioners	1
Students	1
Teachers	1

Location

United States	7
Canada	6
North Carolina	5
Pennsylvania	5
Portugal	4
Spain	4
Turkey	4
Australia	3
California	3
China	3
Germany	3
Florida	2
New York	2
Philippines	2
Texas	2
United Kingdom (England)	2
Africa	1
Arkansas	1
Chile	1
China (Beijing)	1
Czech Republic	1
Finland	1
Georgia	1
Hong Kong	1
Iowa	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Elementary and Secondary…	1
Elementary and Secondary…	1
Guaranteed Student Loan…	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 357 results Save | Export

Brief Research Report: Effects of Sampling Error and Categorization on Estimation of Measure of Sampling Adequacy

Peer reviewed

Direct link

Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024

The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…

Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

The Reliability of Simultaneous versus Individual Data Collection during Stuttering Assessment

Peer reviewed

Direct link

Davidow, Jason H.; Ye, Jun; Edge, Robin L. – International Journal of Language & Communication Disorders, 2023

Background: Speech-language pathologists often multitask in order to be efficient with their commonly large caseloads. In stuttering assessment, multitasking often involves collecting multiple measures simultaneously. Aims: The present study sought to determine reliability when collecting multiple measures simultaneously versus individually.…

Descriptors: Graduate Students, Measurement, Reliability, Group Activities

An R Package for Optimizing the Composite Reliability in Multivariate Nested Designs

Peer reviewed
PDF on ERIC

Download full text

Joyce M. W. Moonen-van Loon; Jeroen Donkers – Practical Assessment, Research & Evaluation, 2025

The reliability of assessment tools is critical for accurately monitoring student performance in various educational contexts. When multiple assessments are combined to form an overall evaluation, each assessment serves as a data point contributing to the student's performance within a broader educational framework. Determining composite…

Descriptors: Programming Languages, Reliability, Evaluation Methods, Student Evaluation

A Meta-Analysis of Self-Assessment and Language Performance in Language Testing and Assessment

Peer reviewed

Direct link

Li, Minzi; Zhang, Xian – Language Testing, 2021

This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…

Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

The Analysis of Marking Reliability through the Approach of Gauge Repeatability and Reproducibility (GR&R) Study: A Case of English-Speaking Test

Peer reviewed

Direct link

Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024

Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…

Descriptors: Foreign Countries, Students, English Language Learners, Speech

Twenty Years of Network Meta-Analysis: Continuing Controversies and Recent Developments

Peer reviewed

Direct link

A. E. Ades; Nicky J. Welton; Sofia Dias; David M. Phillippo; Deborah M. Caldwell – Research Synthesis Methods, 2024

Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications…

Descriptors: Network Analysis, Meta Analysis, Medicine, Clinical Experience

Combined Logistic and Confined Exponential Growth Models: Estimation Using SEM Software

Peer reviewed

Direct link

Phillip K. Wood – Structural Equation Modeling: A Multidisciplinary Journal, 2024

The logistic and confined exponential curves are frequently used in studies of growth and learning. These models, which are nonlinear in their parameters, can be estimated using structural equation modeling software. This paper proposes a single combined model, a weighted combination of both models. Mplus, Proc Calis, and lavaan code for the model…

Descriptors: Structural Equation Models, Computation, Computer Software, Weighted Scores

On the Benefits of Using Maximal Reliability in Educational and Behavioral Research

Peer reviewed

Direct link

Tenko Raykov – Educational and Psychological Measurement, 2024

This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…

Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement

Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention

Peer reviewed

Direct link

Yan Xia; Selim Havan – Educational and Psychological Measurement, 2024

Although parallel analysis has been found to be an accurate method for determining the number of factors in many conditions with complete data, its application under missing data is limited. The existing literature recommends that, after using an appropriate multiple imputation method, researchers either apply parallel analysis to every imputed…

Descriptors: Data Interpretation, Factor Analysis, Statistical Inference, Research Problems

Frequentist and Bayesian Factorial Invariance Using R

Peer reviewed
PDF on ERIC

Download full text

Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024

The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…

Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability

Integrating Bifactor Models into a Generalizability Theory Based Structural Equation Modeling Framework

Peer reviewed

Direct link

Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023

Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…

Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 24

Educational and Psychological…	38
Applied Psychological…	17
Journal of Educational…	16
Applied Measurement in…	10
ProQuest LLC	8
Structural Equation Modeling:…	8
Advances in Health Sciences…	7
ETS Research Report Series	6
Multivariate Behavioral…	6
Psychological Methods	6
International Journal of…	5
Journal of Experimental…	5
Journal of Psychoeducational…	5
Measurement and Evaluation in…	5
Grantee Submission	4
Journal of Educational and…	4
Practical Assessment,…	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
Educational Testing Service	3
International Journal of…	3
Language Testing	3
Psychometrika	3
Regional Educational…	3
Research Synthesis Methods	3
More ▼

Raykov, Tenko	11
Henson, Robin K.	7
Kolen, Michael J.	5
Livingston, Samuel A.	5
Sijtsma, Klaas	5
Fan, Xitao	4
Haberman, Shelby J.	4
Marcoulides, George A.	4
Capraro, Robert M.	3
Feldt, Leonard S.	3
Lee, Guemin	3
Lee, Won-Chan	3
Moses, Tim	3
Onwuegbuzie, Anthony J.	3
Thompson, Bruce	3
Vacha-Haase, Tammi	3
Wang, Tianyou	3
Williams, Richard H.	3
Zimmerman, Donald W.	3
Al Otaiba, Stephanie	2
Alonso, Ariel	2
Brennan, Robert L.	2
Camilli, Gregory	2
Capraro, Mary Margaret	2
More ▼

ACT Assessment	4
Iowa Tests of Basic Skills	3
SAT (College Admission Test)	3
Advanced Placement…	2
National Household Education…	2
Rosenberg Self Esteem Scale	2
Teacher Efficacy Scale	2
Work Keys (ACT)	2
Alabama High School…	1
Beck Depression Inventory	1
Bem Sex Role Inventory	1
Big Five Inventory	1
California Learning…	1
Cognitive Abilities Test	1
College Student Experiences…	1
Depression Anxiety and Stress…	1
Dynamic Indicators of Basic…	1
Early Childhood Longitudinal…	1
Eysenck Personality Inventory	1
Flesch Kincaid Grade Level…	1
International English…	1
Learning Style Inventory	1
Marlowe Crowne Social…	1
Mathematics Anxiety Rating…	1
Motivated Strategies for…	1
More ▼