ERIC - Search Results

Publication Date

In 2026	0
Since 2025	59
Since 2022 (last 5 years)	416
Since 2017 (last 10 years)	919
Since 2007 (last 20 years)	1970

Descriptor

Error of Measurement	3316
Statistical Analysis	602
Scores	511
Item Response Theory	449
Correlation	434
Comparative Analysis	424
Foreign Countries	418
Test Reliability	412
Computation	407
Simulation	370
Reliability	357
Sample Size	355
Models	353
Evaluation Methods	350
Test Items	349
Measurement Techniques	318
Factor Analysis	311
Sampling	301
Statistical Bias	300
Research Methodology	288
Goodness of Fit	260
Psychometrics	260
Monte Carlo Methods	259
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	271
Secondary Education	201
Postsecondary Education	197
Elementary Education	194
Elementary Secondary Education	126
Middle Schools	98
High Schools	82
Junior High Schools	78
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	17
New York	15
United Kingdom	15
North Carolina	14
Texas	14
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Spain	11
Japan	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 211 to 225 of 3,316 results Save | Export

Interval Estimation of Item Response Probabilities along Studied Latent Dimensions

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Pusic, Martin – Measurement: Interdisciplinary Research and Perspectives, 2021

An interval estimation procedure is discussed that can be used to evaluate the probability of a particular response for a binary or binary scored item at a pre-specified point along an underlying latent continuum. The item is assumed to: (a) be part of a unidimensional multi-component measuring instrument that may contain also polytomous items,…

Descriptors: Item Response Theory, Computation, Probability, Test Items

How Do Social and Economic Status Impact Measurement Error?

Peer reviewed

Direct link

Alexandru Cernat; Vera Toepoel – International Journal of Social Research Methodology, 2024

Most of the social science research is based on the implied assumption that measurement error is the same across key socio-demographic groups and all differences in key statistics of interest are real. Nevertheless, there is evidence that this is not the case. In this paper, the authors tackle this important topic by investigating if data quality…

Descriptors: Error of Measurement, Low Income Groups, Probability, Foreign Countries

Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error. Appendixes. REL 2025-009

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Mid-Atlantic, 2024

These are the appendixes for the report, "Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error." This study applied a stabilization model called Bayesian hierarchical modeling to group-level data (with groups assigned according to demographic designations) within schools in New Jersey with the aim…

Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Measuring Autism-Associated Traits in the General Population: Factor Structure and Measurement Invariance across Sex and Diagnosis Status of the Social Communication Questionnaire

Peer reviewed

Direct link

Laura Hegemann; Ragna Bugge Askeland; Stian Barbo Valand; Anne-Siri Øyen; Synnve Schjølberg; Vanessa H. Bal; Somer L. Bishop; Camilla Stoltenberg; Tilmann von Soest; Laurie J. Hannigan; Alexandra Havdahl – Autism: The International Journal of Research and Practice, 2024

Autism screening questionnaires are sometimes used as a measure of "autism-associated traits" in samples drawn from the general population, even though such tools are primarily developed and designed for use in samples of children diagnosed with or being assessed for autism. Here, we explore the psychometric properties of the Social…

Descriptors: Autism Spectrum Disorders, Measurement, Clinical Diagnosis, Sex

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

Factor Structure and Measurement Invariances of the PHQ-9 in Chinese Students across Gender and Age

Peer reviewed

Direct link

Yanjing Cao; Chenchen Xu; Shan Lu; Qi Li; Jing Xiao – Psychology in the Schools, 2025

The patient health questionnaire-9 (PHQ-9) is widely utilized in assessing individuals' depression levels. Nevertheless, research regarding its factor structure and measurement invariance remains inadequate. The aim of this study was to delve into the factor structure of the PHQ-9 and to further investigate its measurement invariance across gender…

Descriptors: Factor Structure, Error of Measurement, Factor Analysis, Age Differences

Psychometric Properties of the Depression, Anxiety, and Stress Scale-21 (DASS-21) across Nine Countries/Regions

Peer reviewed

Direct link

Cristian Zanon; Nan Zhao; Nursel Topkaya; Ertugrul Sahin; David L. Vogel; Melissa M. Ertl; Samineh Sanatkar; Hsin-Ya Liao; Mark Rubin; Makilim N. Baptista; Winnie W. S. Mak; Fatima Rashed Al-Darmaki; Georg Schomerus; Ying-Fen Wang; Dalia Nasvytiene – International Journal of Testing, 2025

Examinations of the internal structure of the Depression, Anxiety, and Stress Scale-21 (DASS-21) have yielded inconsistent conclusions within and across cultural contexts. This study examined the dimensionality and reliability of the DASS-21 across three theoretically plausible factor structures (i.e., unidimensional, oblique three-factor, and…

Descriptors: Anxiety, Depression (Psychology), Psychometrics, Cultural Context

Factorial Validity and Gender Invariance of the Patient Health Questionnaire (PHQ-9) in Student-Athletes

Peer reviewed

Direct link

Sebastian Harenberg; Lindsey Keenan; Yvette Ingram; Sayre Wilson; Justine Vosloo; Miranda Kaye – Journal of American College Health, 2025

Background/purpose: Depressive symptoms are prevalent in student-athletes. Evidence for the factorial validity of measures assessing depressive symptoms in student-athletes is presently absent from the literature. This study examined the best fitting factorial structure and invariance across sexes of the PHQ-9. Methods: Data were collected from…

Descriptors: Student Athletes, Depression (Psychology), Symptoms (Individual Disorders), Gender Differences

Examiner Drift: What It Is and How It Can Be Avoided in Your Assessment Practices

Direct link

Antoniuk, Andrea; Cormier, Damien C. – Communique, 2020

School psychologists may experience examiner drift--a deviation from standardized administration and scoring procedures that occurs slowly over time. The purpose of this article is to explain how examiner drift occurs, outline how it can be assessed, and how it can be prevented.

Descriptors: Error of Measurement, Standardized Tests, School Psychologists, Skill Development

Comparison of Respiratory Calibration Methods for the Estimation of Lung Volume in Children with and without Neuromotor Disorders

Peer reviewed

Direct link

Darling-White, Meghan – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The primary purpose of this study was to validate common respiratory calibration methods for estimating lung volume in children. Method: Respiratory kinematic data were collected via inductive plethysmography from 81 typically developing children and nine children with neuromotor disorders. Correction factors for the rib cage and abdomen…

Descriptors: Physiology, Human Body, Psychomotor Skills, Neurological Impairments

On the Merits of Longitudinal Multiple Group Modelling: An Alternative to Multilevel Modelling for Intervention Evaluations

Peer reviewed

Direct link

Little, Todd D.; Bontempo, Daniel; Rioux, Charlie; Tracy, Allison – International Journal of Research & Method in Education, 2022

Multilevel modelling (MLM) is the most frequently used approach for evaluating interventions with clustered data. MLM, however, has some limitations that are associated with numerous obstacles to model estimation and valid inferences. Longitudinal multiple-group (LMG) modelling is a longstanding approach for testing intervention effects using…

Descriptors: Longitudinal Studies, Hierarchical Linear Modeling, Alternative Assessment, Intervention

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

Performance of Infit and Outfit Confidence Intervals Calculated via Parametric Bootstrapping

Peer reviewed

Direct link

Silva Diaz, John Alexander; Köhler, Carmen; Hartig, Johannes – Applied Measurement in Education, 2022

Testing item fit is central in item response theory (IRT) modeling, since a good fit is necessary to draw valid inferences from estimated model parameters. "Infit" and "outfit" fit statistics, widespread indices for detecting deviations from the Rasch model, are affected by data factors, such as sample size. Consequently, the…

Descriptors: Intervals, Item Response Theory, Item Analysis, Inferences

« Previous Page | Next Page »

Pages: 1 | ... | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | ... | 222

Educational and Psychological…	259
Journal of Educational…	117
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	71
Journal of Experimental…	70
ETS Research Report Series	59
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	47
Journal of Psychoeducational…	38
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
Practical Assessment,…	27
International Journal of…	26
Journal of Educational…	26
National Center for Education…	25
Psychology in the Schools	25
International Journal of…	23
More ▼

Journal Articles	2363
Reports - Research	1909
Reports - Evaluative	704
Reports - Descriptive	344
Speeches/Meeting Papers	329
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	27
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	45
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
ACT Assessment	20
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	9
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼