ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	71

Descriptor

Error of Measurement	104
Reliability	104
Scores	104
Correlation	19
Psychometrics	18
Generalizability Theory	17
Validity	15
Foreign Countries	14
Statistical Analysis	13
Measurement	12
Measurement Techniques	12
Measures (Individuals)	12
Computation	10
Elementary School Students	10
Factor Analysis	10
Academic Achievement	9
Comparative Analysis	9
Scoring	9
Test Construction	9
Test Theory	9
Generalization	8
Item Response Theory	8
Models	8
Test Items	8
Tests	8
More ▼

Publication Type

Journal Articles	78
Reports - Research	58
Reports - Evaluative	24
Reports - Descriptive	15
Speeches/Meeting Papers	10
Dissertations/Theses -…	4
Opinion Papers	3
Book/Product Reviews	2
Numerical/Quantitative Data	2
Tests/Questionnaires	2
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
More ▼

Education Level

Higher Education	13
Postsecondary Education	12
Elementary Education	11
Secondary Education	11
Middle Schools	9
Junior High Schools	8
High Schools	5
Elementary Secondary Education	3
Grade 8	3
Intermediate Grades	3
Grade 4	2
Grade 5	2
Adult Education	1
Early Childhood Education	1
Grade 3	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Policymakers	1
Practitioners	1
Researchers	1
Teachers	1

Location

Pennsylvania	3
United States	3
Australia	2
Canada	2
Portugal	2
Arkansas	1
Chile	1
China	1
China (Beijing)	1
Finland	1
Germany	1
Jordan	1
Maryland	1
Spain	1
Spain (Madrid)	1
Texas (Houston)	1
Turkey	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 104 results Save | Export

On the Benefits of Using Maximal Reliability in Educational and Behavioral Research

Peer reviewed

Direct link

Tenko Raykov – Educational and Psychological Measurement, 2024

This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…

Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Integrating Bifactor Models into a Generalizability Theory Based Structural Equation Modeling Framework

Peer reviewed

Direct link

Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023

Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…

Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores

Psychometric Properties of the Depression, Anxiety, and Stress Scale-21 (DASS-21) across Nine Countries/Regions

Peer reviewed

Direct link

Cristian Zanon; Nan Zhao; Nursel Topkaya; Ertugrul Sahin; David L. Vogel; Melissa M. Ertl; Samineh Sanatkar; Hsin-Ya Liao; Mark Rubin; Makilim N. Baptista; Winnie W. S. Mak; Fatima Rashed Al-Darmaki; Georg Schomerus; Ying-Fen Wang; Dalia Nasvytiene – International Journal of Testing, 2025

Examinations of the internal structure of the Depression, Anxiety, and Stress Scale-21 (DASS-21) have yielded inconsistent conclusions within and across cultural contexts. This study examined the dimensionality and reliability of the DASS-21 across three theoretically plausible factor structures (i.e., unidimensional, oblique three-factor, and…

Descriptors: Anxiety, Depression (Psychology), Psychometrics, Cultural Context

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Thematic Content Analysis of Studies Using Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Teker, Gülsen Tasdelen; Güler, Nese – International Journal of Assessment Tools in Education, 2019

One of the important theories in education and psychology is Generalizability (G) Theory and various properties distinguish it from the other measurement theories. To better understand methodological trends of G theory, a thematic content analysis was conducted. This study analyzes the studies using generalizability theory in the field of…

Descriptors: Generalizability Theory, Content Analysis, Foreign Countries, Education

Psychometric Properties and Measurement Invariance of the Academic Procrastination Scale-Short Form in Spanish Children and Adolescents

Peer reviewed

Direct link

Martín-Puga, M. Eva; Pelegrina, Santiago; Gómez-Pérez, M. Mar; Justicia-Galiano, M. José – Journal of Psychoeducational Assessment, 2022

The objectives were to examine the factorial structure of the Academic Procrastination Scale-Short Form (APS-S) and the measurement invariance across gender and educational levels, to determine possible differences in procrastination across gender, educational levels, and grades. The sample was formed of 1486 Spanish primary and secondary school…

Descriptors: Psychometrics, Measures (Individuals), Study Habits, Scores

Exploring Cross-Cultural and Gender Differences in Test Anxiety among U.S. and Canadian College Students

Peer reviewed

Direct link

Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2019

Existing measures of test anxiety used with the college student population are old with old norms and old items, and they do not capture the multiple dimensions of the test anxiety construct or assess facilitating anxiety. In the present study, the validity of the scores of a new, multidimensional measure of test anxiety with a facilitating…

Descriptors: Cross Cultural Studies, Gender Differences, Test Anxiety, Foreign Countries

Stabilizing Subgroup Proficiency Results to Improve the Identification of Low-Performing Schools. REL 2023-001

Peer reviewed
PDF on ERIC

Download full text

Forrow, Lauren; Starling, Jennifer; Gill, Brian – Regional Educational Laboratory Mid-Atlantic, 2023

The Every Student Succeeds Act requires states to identify schools with low-performing student subgroups for Targeted Support and Improvement or Additional Targeted Support and Improvement. Random differences between students' true abilities and their test scores, also called measurement error, reduce the statistical reliability of the performance…

Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques

Stabilizing Subgroup Proficiency Results to Improve the Identification of Low-Performing Schools. Study Snapshot. REL 2023-001

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Mid-Atlantic, 2023

This Snapshot highlights key findings from a study that used Bayesian stabilization to improve the reliability (long-term stability) of subgroup proficiency measures that the Pennsylvania Department of Education (PDE) uses to identify schools for Targeted Support and Improvement (TSI) or Additional Targeted Support and Improvement (ATSI). The…

Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques

Stabilizing Subgroup Proficiency Results to Improve the Identification of Low-Performing Schools. Appendixes. REL 2023-001

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Mid-Atlantic, 2023

The "Stabilizing Subgroup Proficiency Results to Improve the Identification of Low-Performing Schools" study used Bayesian stabilization to improve the reliability (long-term stability) of subgroup proficiency measures that the Pennsylvania Department of Education (PDE) uses to identify schools for Targeted Support and Improvement (TSI)…

Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques

Developing Situated Measures of Science Instruction through an Innovative Electronic Portfolio App for Mobile Devices: Reliability, Validity, and Feasibility

Peer reviewed

Direct link

Martínez, José Felipe; Kloser, Matt; Srinivasan, Jayashri; Stecher, Brian; Edelman, Amanda – Educational and Psychological Measurement, 2022

Adoption of new instructional standards in science demands high-quality information about classroom practice. Teacher portfolios can be used to assess instructional practice and support teacher self-reflection anchored in authentic evidence from classrooms. This study investigated a new type of electronic portfolio tool that allows efficient…

Descriptors: Science Instruction, Academic Standards, Instructional Innovation, Electronic Publishing

Estimating Variance Components from Sparse Data Matrices in Large-Scale Educational Assessments

Peer reviewed

Direct link

DeMars, Christine – Applied Measurement in Education, 2015

In generalizability theory studies in large-scale testing contexts, sometimes a facet is very sparsely crossed with the object of measurement. For example, when assessments are scored by human raters, it may not be practical to have every rater score all students. Sometimes the scoring is systematically designed such that the raters are…

Descriptors: Educational Assessment, Measurement, Data, Generalizability Theory

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

The Spanish Version of the Empathy Questionnaire (EmQue): Evidence for Longitudinal Measurement Invariance and Relationship with Emotional Regulation

Peer reviewed

Direct link

Lucas-Molina, Beatriz; Sarmento, Renata; Quintanilla, Laura; Giménez-Dasí, Marta – Early Education and Development, 2018

Research Findings: Empathy, or the ability to understand what others are thinking or feeling, can be observed in early developmental stages. The purpose of this study was to validate the Spanish version of the Empathy Questionnaire (EmQue) and examine its longitudinal measurement invariance (LMI) at 2 time points. Parents of 103 children completed…

Descriptors: Spanish, Empathy, Questionnaires, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	18
Journal of Educational…	6
Applied Psychological…	4
ETS Research Report Series	4
ProQuest LLC	4
Advances in Health Sciences…	3
Applied Measurement in…	3
Educational Testing Service	3
International Journal of…	3
Journal of Experimental…	3
Regional Educational…	3
Assessment for Effective…	2
Early Education and…	2
Educational Measurement:…	2
Journal of Psychoeducational…	2
Psychometrika	2
School Psychology Quarterly	2
Society for Research on…	2
ACT, Inc.	1
Assessment	1
Assessment & Evaluation in…	1
Assessment Update	1
College Board	1
Developmental Medicine &…	1
Education and the Public…	1
More ▼

Henson, Robin K.	5
Haberman, Shelby J.	4
Kolen, Michael J.	4
Capraro, Robert M.	3
Capraro, Mary Margaret	2
Casabianca, Jodi M.	2
Cook, Thomas D.	2
Fan, Xitao	2
Graham, James M.	2
Harris, Deborah J.	2
Lee, Won-Chan	2
Livingston, Samuel A.	2
McCaffrey, Daniel F.	2
Raymond, Mark R.	2
Schafer, William D.	2
Shadish, William R.	2
Sijtsma, Klaas	2
Steiner, Peter M.	2
Vacha-Haase, Tammi	2
Wang, Tianyou	2
Williams, Richard H.	2
Zimmerman, Donald W.	2
Abu-Hamour, Bashir	1
Acklie, Teresa J.	1
More ▼

ACT Assessment	2
Advanced Placement…	2
SAT (College Admission Test)	2
Beck Depression Inventory	1
Bem Sex Role Inventory	1
Big Five Inventory	1
Depression Anxiety and Stress…	1
Flesch Kincaid Grade Level…	1
Iowa Tests of Basic Skills	1
Learning Style Inventory	1
Mathematics Anxiety Rating…	1
Motivated Strategies for…	1
Myers Briggs Type Indicator	1
National Merit Scholarship…	1
Praxis Series	1
Preliminary Scholastic…	1
Teacher Efficacy Scale	1
United States Medical…	1
Work Keys (ACT)	1
More ▼