ERIC - Search Results

Publication Date

In 2025	6
Since 2024	21
Since 2021 (last 5 years)	63
Since 2016 (last 10 years)	137
Since 2006 (last 20 years)	350

Descriptor

Error of Measurement	504
Scores	504
Reliability	104
Correlation	92
Item Response Theory	77
Test Reliability	76
Foreign Countries	71
Statistical Analysis	71
Comparative Analysis	69
Psychometrics	68
Computation	60
Models	57
Test Items	57
Academic Achievement	54
Factor Analysis	54
Measurement Techniques	47
Test Validity	46
Simulation	42
Evaluation Methods	41
Measurement	41
Regression (Statistics)	40
Sample Size	40
Generalizability Theory	39
Measures (Individuals)	38
Elementary School Students	37
More ▼

Publication Type

Journal Articles	338
Reports - Research	300
Reports - Evaluative	115
Speeches/Meeting Papers	50
Reports - Descriptive	37
Dissertations/Theses -…	25
Numerical/Quantitative Data	11
Opinion Papers	11
Tests/Questionnaires	7
Guides - Non-Classroom	6
Information Analyses	4
Book/Product Reviews	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Books	1
Collected Works - Serials	1
Reports - General	1
More ▼

Education Level

Higher Education	62
Postsecondary Education	54
Secondary Education	44
Elementary Education	43
Elementary Secondary Education	30
Middle Schools	25
High Schools	19
Junior High Schools	18
Grade 8	11
Grade 4	9
Intermediate Grades	9
Grade 5	8
Early Childhood Education	7
Kindergarten	7
Grade 7	6
Grade 3	5
Primary Education	5
Grade 10	4
Grade 9	4
Grade 2	3
Grade 6	3
Grade 11	2
Grade 12	2
Preschool Education	2
Adult Education	1
More ▼

Audience

Researchers	14
Policymakers	4
Teachers	3
Practitioners	2
Administrators	1
Community	1
Counselors	1

Location

Germany	7
United Kingdom (England)	7
United States	7
Canada	6
Netherlands	6
Australia	5
California	5
North Carolina	5
Turkey	5
China	4
New York	4
Tennessee	4
Indonesia	3
Iran	3
Pennsylvania	3
Portugal	3
Spain	3
District of Columbia	2
Florida	2
Georgia	2
Ohio	2
Saudi Arabia	2
South Africa	2
South Korea	2
Sudan	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Race to the Top	4
Head Start	1

What Works Clearinghouse Rating

Showing 1 to 15 of 504 results Save | Export

Using Item Scores and Distractors in Person-Fit Assessment

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A. – Journal of Educational Measurement, 2023

In order to detect a wide range of aberrant behaviors, it can be useful to incorporate information beyond the dichotomous item scores. In this paper, we extend the l[subscript z] and l*[subscript z] person-fit statistics so that unusual behavior in item scores and unusual behavior in item distractors can be used as indicators of aberrance. Through…

Descriptors: Test Items, Scores, Goodness of Fit, Statistics

On the Benefits of Using Maximal Reliability in Educational and Behavioral Research

Peer reviewed

Direct link

Tenko Raykov – Educational and Psychological Measurement, 2024

This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…

Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement

Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming

Peer reviewed

Direct link

Francesco Innocenti; Math J. J. M. Candel; Frans E. S. Tan; Gerard J. P. van Breukelen – Journal of Educational and Behavioral Statistics, 2024

Normative studies are needed to obtain norms for comparing individuals with the reference population on relevant clinical or educational measures. Norms can be obtained in an efficient way by regressing the test score on relevant predictors, such as age and sex. When several measures are normed with the same sample, a multivariate regression-based…

Descriptors: Sample Size, Multivariate Analysis, Error of Measurement, Regression (Statistics)

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

How Did Spain Perform in PISA 2018? New Estimates of Children's PISA Reading Scores

Peer reviewed

Direct link

John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024

International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Detecting Careless Responding in Multidimensional Forced-Choice Questionnaires

Peer reviewed

Direct link

Rebekka Kupffer; Susanne Frick; Eunike Wetzel – Educational and Psychological Measurement, 2024

The multidimensional forced-choice (MFC) format is an alternative to rating scales in which participants rank items according to how well the items describe them. Currently, little is known about how to detect careless responding in MFC data. The aim of this study was to adapt a number of indices used for rating scales to the MFC format and…

Descriptors: Measurement Techniques, Alternative Assessment, Rating Scales, Questionnaires

Improving the Precision of Classroom Observation Scores Using a Multi-Rater and Multi-Timepoint Item Response Theory Model

Peer reviewed

Direct link

Kelly Edwards; James Soland – Educational Assessment, 2024

Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…

Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores

Integrating Bifactor Models into a Generalizability Theory Based Structural Equation Modeling Framework

Peer reviewed

Direct link

Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023

Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…

Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores

Examining the Psychometric Properties of the Student Behavior Checklist-Brief's Subject Scores: Tests of Measurement Invariance

Peer reviewed

Direct link

Jake C. Steggerda; Sandra Yu Rueger; Ana J. Bridges – Children & Schools, 2024

Authors evaluated the Student Behavior Checklist-Brief (SBC-B) to test whether teacher-reports of student learning approach (i.e., learned helplessness [LH] and mastery orientation [MO]) were invariant across academic subjects. The current sample includes ethnically diverse seventh and eighth grade students (N = 145; 53 percent male) and six teams…

Descriptors: Psychometrics, Student Behavior, Check Lists, Scores

Comparing Factor Score Approaches to SEM in Multigroup Models with Small Samples

Peer reviewed

Direct link

Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…

Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size

Likelihood-Based Estimation of Model-Derived Oral Reading Fluency

Peer reviewed

Direct link

Cornelis Potgieter; Xin Qiao; Akihito Kamata; Yusuf Kara – Grantee Submission, 2024

As part of the effort to develop an improved oral reading fluency (ORF) assessment system, Kara et al. (2020) estimated the ORF scores based on a latent variable psychometric model of accuracy and speed for ORF data via a fully Bayesian approach. This study further investigates likelihood-based estimators for the model-derived ORF scores,…

Descriptors: Oral Reading, Reading Fluency, Scores, Psychometrics

Likelihood-Based Estimation of Model-Derived Oral Reading Fluency

Peer reviewed

Direct link

Cornelis Potgieter; Xin Qiao; Akihito Kamata; Yusuf Kara – Journal of Educational Measurement, 2024

As part of the effort to develop an improved oral reading fluency (ORF) assessment system, Kara et al. estimated the ORF scores based on a latent variable psychometric model of accuracy and speed for ORF data via a fully Bayesian approach. This study further investigates likelihood-based estimators for the model-derived ORF scores, including…

Descriptors: Oral Reading, Reading Fluency, Scores, Psychometrics

Standard Errors of Variance Components, Measurement Errors and Generalizability Coefficients for Crossed Designs

Peer reviewed

Direct link

Almehrizi, Rashid S. – Journal of Educational Measurement, 2021

Estimates of various variance components, universe score variance, measurement error variances, and generalizability coefficients, like all statistics, are subject to sampling variability, particularly in small samples. Such variability is quantified traditionally through estimated standard errors and/or confidence intervals. The paper derived new…

Descriptors: Error of Measurement, Statistics, Design, Generalizability Theory

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 34

Educational and Psychological…	45
Journal of Educational…	31
ProQuest LLC	25
Applied Psychological…	18
Journal of Educational and…	14
ETS Research Report Series	13
Grantee Submission	13
Applied Measurement in…	11
Journal of Psychoeducational…	9
Educational Measurement:…	8
International Journal of…	8
Society for Research on…	8
Educational Assessment	7
Psychometrika	7
Online Submission	6
Educational Testing Service	5
Journal of Experimental…	5
Assessment	4
Assessment for Effective…	4
Language Testing	4
Measurement and Evaluation in…	4
Psychological Assessment	4
Psychology in the Schools	4
Structural Equation Modeling:…	4
ACT, Inc.	3
More ▼

Lee, Won-Chan	10
Kolen, Michael J.	7
Haberman, Shelby J.	6
McCaffrey, Daniel F.	6
Reardon, Sean F.	6
Henson, Robin K.	5
Lockwood, J. R.	5
Zimmerman, Donald W.	5
Brennan, Robert L.	4
Ho, Andrew D.	4
Kane, Michael	4
Moses, Tim	4
Zwick, Rebecca	4
Alderman, Donald L.	3
Cai, Li	3
Capraro, Robert M.	3
Cho, Sun-Joo	3
Davison, Mark L.	3
DeMars, Christine E.	3
Floyd, Randy G.	3
Isenberg, Eric	3
Kamata, Akihito	3
Lee, Guemin	3
Livingston, Samuel A.	3
More ▼

SAT (College Admission Test)	13
ACT Assessment	11
Program for International…	7
Test of English as a Foreign…	7
Early Childhood Longitudinal…	5
Graduate Record Examinations	5
Wechsler Adult Intelligence…	5
Iowa Tests of Basic Skills	4
National Assessment of…	4
Trends in International…	4
Wechsler Intelligence Scale…	4
Advanced Placement…	3
General Educational…	3
Behavior Assessment System…	2
Big Five Inventory	2
Cognitive Abilities Test	2
Mathematics Anxiety Rating…	2
Motivated Strategies for…	2
National Merit Scholarship…	2
New Jersey College Basic…	2
Praxis Series	2
Preliminary Scholastic…	2
Aberrant Behavior Checklist	1
Alabama High School…	1
Armed Forces Qualification…	1
More ▼