ERIC - Search Results

Publication Date

In 2025	39
Since 2024	192
Since 2021 (last 5 years)	495
Since 2016 (last 10 years)	996
Since 2006 (last 20 years)	2028

Descriptor

Error of Measurement	3295
Statistical Analysis	599
Scores	504
Item Response Theory	445
Correlation	434
Comparative Analysis	422
Foreign Countries	415
Test Reliability	408
Computation	404
Simulation	370
Reliability	355
Sample Size	352
Models	351
Evaluation Methods	348
Test Items	345
Measurement Techniques	318
Factor Analysis	308
Sampling	300
Statistical Bias	299
Research Methodology	288
Goodness of Fit	258
Monte Carlo Methods	257
Psychometrics	257
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	268
Secondary Education	196
Elementary Education	194
Postsecondary Education	194
Elementary Secondary Education	126
Middle Schools	96
High Schools	80
Junior High Schools	76
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	16
New York	15
United Kingdom	15
Texas	14
North Carolina	13
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Japan	10
Spain	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 196 to 210 of 3,295 results Save | Export

How Do Social and Economic Status Impact Measurement Error?

Peer reviewed

Direct link

Alexandru Cernat; Vera Toepoel – International Journal of Social Research Methodology, 2024

Most of the social science research is based on the implied assumption that measurement error is the same across key socio-demographic groups and all differences in key statistics of interest are real. Nevertheless, there is evidence that this is not the case. In this paper, the authors tackle this important topic by investigating if data quality…

Descriptors: Error of Measurement, Low Income Groups, Probability, Foreign Countries

Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error. Appendixes. REL 2025-009

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Mid-Atlantic, 2024

These are the appendixes for the report, "Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error." This study applied a stabilization model called Bayesian hierarchical modeling to group-level data (with groups assigned according to demographic designations) within schools in New Jersey with the aim…

Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Measuring Autism-Associated Traits in the General Population: Factor Structure and Measurement Invariance across Sex and Diagnosis Status of the Social Communication Questionnaire

Peer reviewed

Direct link

Laura Hegemann; Ragna Bugge Askeland; Stian Barbo Valand; Anne-Siri Øyen; Synnve Schjølberg; Vanessa H. Bal; Somer L. Bishop; Camilla Stoltenberg; Tilmann von Soest; Laurie J. Hannigan; Alexandra Havdahl – Autism: The International Journal of Research and Practice, 2024

Autism screening questionnaires are sometimes used as a measure of "autism-associated traits" in samples drawn from the general population, even though such tools are primarily developed and designed for use in samples of children diagnosed with or being assessed for autism. Here, we explore the psychometric properties of the Social…

Descriptors: Autism Spectrum Disorders, Measurement, Clinical Diagnosis, Sex

Psychometric Properties of the Depression, Anxiety, and Stress Scale-21 (DASS-21) across Nine Countries/Regions

Peer reviewed

Direct link

Cristian Zanon; Nan Zhao; Nursel Topkaya; Ertugrul Sahin; David L. Vogel; Melissa M. Ertl; Samineh Sanatkar; Hsin-Ya Liao; Mark Rubin; Makilim N. Baptista; Winnie W. S. Mak; Fatima Rashed Al-Darmaki; Georg Schomerus; Ying-Fen Wang; Dalia Nasvytiene – International Journal of Testing, 2025

Examinations of the internal structure of the Depression, Anxiety, and Stress Scale-21 (DASS-21) have yielded inconsistent conclusions within and across cultural contexts. This study examined the dimensionality and reliability of the DASS-21 across three theoretically plausible factor structures (i.e., unidimensional, oblique three-factor, and…

Descriptors: Anxiety, Depression (Psychology), Psychometrics, Cultural Context

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

Factor Structure and Measurement Invariances of the PHQ-9 in Chinese Students across Gender and Age

Peer reviewed

Direct link

Yanjing Cao; Chenchen Xu; Shan Lu; Qi Li; Jing Xiao – Psychology in the Schools, 2025

The patient health questionnaire-9 (PHQ-9) is widely utilized in assessing individuals' depression levels. Nevertheless, research regarding its factor structure and measurement invariance remains inadequate. The aim of this study was to delve into the factor structure of the PHQ-9 and to further investigate its measurement invariance across gender…

Descriptors: Factor Structure, Error of Measurement, Factor Analysis, Age Differences

Comparison of Respiratory Calibration Methods for the Estimation of Lung Volume in Children with and without Neuromotor Disorders

Peer reviewed

Direct link

Darling-White, Meghan – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The primary purpose of this study was to validate common respiratory calibration methods for estimating lung volume in children. Method: Respiratory kinematic data were collected via inductive plethysmography from 81 typically developing children and nine children with neuromotor disorders. Correction factors for the rib cage and abdomen…

Descriptors: Physiology, Human Body, Psychomotor Skills, Neurological Impairments

On the Merits of Longitudinal Multiple Group Modelling: An Alternative to Multilevel Modelling for Intervention Evaluations

Peer reviewed

Direct link

Little, Todd D.; Bontempo, Daniel; Rioux, Charlie; Tracy, Allison – International Journal of Research & Method in Education, 2022

Multilevel modelling (MLM) is the most frequently used approach for evaluating interventions with clustered data. MLM, however, has some limitations that are associated with numerous obstacles to model estimation and valid inferences. Longitudinal multiple-group (LMG) modelling is a longstanding approach for testing intervention effects using…

Descriptors: Longitudinal Studies, Hierarchical Linear Modeling, Alternative Assessment, Intervention

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

Performance of Infit and Outfit Confidence Intervals Calculated via Parametric Bootstrapping

Peer reviewed

Direct link

Silva Diaz, John Alexander; Köhler, Carmen; Hartig, Johannes – Applied Measurement in Education, 2022

Testing item fit is central in item response theory (IRT) modeling, since a good fit is necessary to draw valid inferences from estimated model parameters. "Infit" and "outfit" fit statistics, widespread indices for detecting deviations from the Rasch model, are affected by data factors, such as sample size. Consequently, the…

Descriptors: Intervals, Item Response Theory, Item Analysis, Inferences

Assessing Inter-Rater Reliability with Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables

Peer reviewed

Direct link

Martinková, Patrícia; Bartoš, František; Brabec, Marek – Journal of Educational and Behavioral Statistics, 2023

Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater's or ratee's gender, major, or experience. Identification of such heterogeneity sources in IRR is important for the implementation of policies with the potential to decrease measurement error…

Descriptors: Interrater Reliability, Bayesian Statistics, Statistical Inference, Hierarchical Linear Modeling

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

Combining Estimators in Interlaboratory Studies and Meta-Analyses

Peer reviewed

Direct link

Huang, Hening – Research Synthesis Methods, 2023

Many statistical methods (estimators) are available for estimating the consensus value (or average effect) and heterogeneity variance in interlaboratory studies or meta-analyses. These estimators are all valid because they are developed from or supported by certain statistical principles. However, no estimator can be perfect and must have error or…

Descriptors: Statistical Analysis, Computation, Measurement Techniques, Meta Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | ... | 220

Educational and Psychological…	259
Journal of Educational…	115
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	69
Journal of Experimental…	69
ETS Research Report Series	58
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	46
Journal of Psychoeducational…	37
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
International Journal of…	26
Journal of Educational…	26
Practical Assessment,…	26
National Center for Education…	25
Psychology in the Schools	24
Structural Equation Modeling	23
More ▼

Journal Articles	2348
Reports - Research	1892
Reports - Evaluative	702
Reports - Descriptive	342
Speeches/Meeting Papers	328
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	26
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	44
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
ACT Assessment	18
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	8
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼