ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	14

Descriptor

Data Analysis	24
Error of Measurement	24
Reliability	24
Evaluation Methods	7
Statistical Analysis	7
Sampling	6
Correlation	5
Validity	5
Generalizability Theory	4
Research Methodology	4
Scores	4
Analysis of Variance	3
Computation	3
Measurement Techniques	3
Predictor Variables	3
Research Problems	3
Scoring Rubrics	3
Surveys	3
True Scores	3
Accuracy	2
Classroom Observation…	2
Coding	2
Data Collection	2
Data Interpretation	2
Evaluators	2
More ▼

Source

Educational and Psychological…	3
Applied Measurement in…	1
Canadian Journal of Program…	1
Economics of Education Review	1
Educational Researcher	1
Eurasian Journal of…	1
Evaluation and Program…	1
Grantee Submission	1
Journal of Educational Data…	1
Language Testing	1
Language Testing in Asia	1
Measurement:…	1
National Assessment Governing…	1
Occupational Therapy Journal…	1
ProQuest LLC	1
School Psychology Quarterly	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	9
Reports - Evaluative	6
Reports - Descriptive	3
Dissertations/Theses -…	1
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	3
Middle Schools	2
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Secondary Education	1

Audience

Location

New York	1
Texas (Houston)	1
Thailand	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

The Analysis of Marking Reliability through the Approach of Gauge Repeatability and Reproducibility (GR&R) Study: A Case of English-Speaking Test

Peer reviewed

Direct link

Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024

Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…

Descriptors: Foreign Countries, Students, English Language Learners, Speech

Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention

Peer reviewed

Direct link

Yan Xia; Selim Havan – Educational and Psychological Measurement, 2024

Although parallel analysis has been found to be an accurate method for determining the number of factors in many conditions with complete data, its application under missing data is limited. The existing literature recommends that, after using an appropriate multiple imputation method, researchers either apply parallel analysis to every imputed…

Descriptors: Data Interpretation, Factor Analysis, Statistical Inference, Research Problems

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Reliability Estimation in Longitudinal Studies Using Latent Growth Curve Modeling

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2019

Longitudinal data analysis has received widespread interest throughout educational, behavioral, and social science research, with latent growth curve modeling currently being one of the most popular methods of analysis. Despite the popularity of latent growth curve modeling, limited attention has been directed toward understanding the issues of…

Descriptors: Reliability, Longitudinal Studies, Growth Models, Structural Equation Models

The Effects of Sample Size and Missing Data Rates on Generalizability Coefficients

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Karaman, Haydar; Dogan, Nuri – Eurasian Journal of Educational Research, 2018

Purpose of the Study: Missing data are a common problem encountered while implementing measurement instruments. Yet the extent to which reliability, validity, average discrimination and difficulty of the test results are affected by the missing data has not been studied much. Since it is inevitable that missing data have an impact on the…

Descriptors: Sample Size, Data Analysis, Research Problems, Error of Measurement

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Generalizability Theory Reliability of Written Expression Curriculum-Based Measurement in Universal Screening

Peer reviewed

Direct link

Keller-Margulis, Milena A.; Mercer, Sterett H.; Thomas, Erin L. – School Psychology Quarterly, 2016

The purpose of this study was to examine the reliability of written expression curriculum-based measurement (WE-CBM) in the context of universal screening from a generalizability theory framework. Students in second through fifth grade (n = 145) participated in the study. The sample included 54% female students, 49% White students, 23% African…

Descriptors: Generalizability Theory, Reliability, Written Language, Curriculum Based Assessment

Describing Profiles of Instructional Practice: A New Approach to Analyzing Classroom Observation Data

Peer reviewed

Direct link

Halpin, Peter F.; Kieffer, Michael J. – Educational Researcher, 2015

The authors outline the application of latent class analysis (LCA) to classroom observational instruments. LCA offers diagnostic information about teachers' instructional strengths and weaknesses, along with estimates of measurement error for individual teachers, while remaining relatively straightforward to implement and interpret. It is…

Descriptors: Multivariate Analysis, Classroom Observation Techniques, Data Analysis, Error of Measurement

Describing Profiles of Instructional Practice: A New Approach to Analyzing Classroom Observation Data

Peer reviewed
PDF on ERIC

Download full text

Halpin, Peter F.; Kieffer, Michael J. – Grantee Submission, 2015

Descriptors: Multivariate Analysis, Classroom Observation Techniques, Data Analysis, Error of Measurement

False Positives in Multiple Regression: Unanticipated Consequences of Measurement Error in the Predictor Variables

Peer reviewed

Direct link

Shear, Benjamin R.; Zumbo, Bruno D. – Educational and Psychological Measurement, 2013

Type I error rates in multiple regression, and hence the chance for false positive research findings, can be drastically inflated when multiple regression models are used to analyze data that contain random measurement error. This article shows the potential for inflated Type I error rates in commonly encountered scenarios and provides new…

Descriptors: Error of Measurement, Multiple Regression Analysis, Data Analysis, Computer Simulation

Metrics for Evaluation of Student Models

Peer reviewed
PDF on ERIC

Download full text

Pelanek, Radek – Journal of Educational Data Mining, 2015

Researchers use many different metrics for evaluation of performance of student models. The aim of this paper is to provide an overview of commonly used metrics, to discuss properties, advantages, and disadvantages of different metrics, to summarize current practice in educational data mining, and to provide guidance for evaluation of student…

Descriptors: Models, Data Analysis, Data Processing, Evaluation Criteria

Investigating the Reliability and Validity of the Consortium on Reading Excellence (CORE) Phonics Survey

Direct link

Brandt, Lorilynn – ProQuest LLC, 2010

Phonics was identified as one of the critical components in reading development by the National Reading Panel. Over time, research has repeatedly identified phonics as important to early reading development. Given the compelling evidence supporting the teaching of phonics in early reading, it is critical to make sure that instructional decisions…

Descriptors: Generalizability Theory, Phonics, Early Reading, Validity

Correction for Attenuation with Biased Reliability Estimates and Correlated Errors in Populations and Samples

Peer reviewed

Direct link

Zimmerman, Donald W. – Educational and Psychological Measurement, 2007

Properties of the Spearman correction for attenuation were investigated using Monte Carlo methods, under conditions where correlations between error scores exist as a population parameter and also where correlated errors arise by chance in random sampling. Equations allowing for all possible dependence among true and error scores on two tests at…

Descriptors: Monte Carlo Methods, Correlation, Sampling, Data Analysis

Measurement Error, Education Production and Data Envelopment Analysis

Peer reviewed

Direct link

Ruggiero, John – Economics of Education Review, 2006

Data Envelopment Analysis has become a popular tool for evaluating the efficiency of decision making units. The nonparametric approach has been widely applied to educational production. The approach is, however, deterministic and leads to biased estimates of performance in the presence of measurement error. Numerous simulation studies confirm the…

Descriptors: Data Analysis, Decision Making, Efficiency, Productivity

What Is Reliability Generalization, and Why Is It Important?

Download full text

Cousin, Sherri L.; Henson, Robin K. – 2000

Researchers consistently fail to report reliability estimates for data used in their studies. This lack of reporting hinders appropriate evaluation and interpretation of data and may lead to inappropriate conclusions. Because reliability is inured to scores obtained from a test, and not the test itself, it is important to report score reliability…

Descriptors: Data Analysis, Error of Measurement, Estimation (Mathematics), Generalization

Previous Page | Next Page »

Pages: 1 | 2

Halpin, Peter F.	2
Kieffer, Michael J.	2
Brandt, Lorilynn	1
Carol Eckerly	1
Cohen, Patricia	1
Conley, Valerie	1
Cousin, Sherri L.	1
Creighton, Cynthia L.	1
Daniel O'Connell	1
Dijkers, Marcel P. J. M.	1
Dogan, Nuri	1
Edwards, Keith J.	1
Evans, Brian	1
Fink, Steven	1
Haertel, Edward H.	1
Henson, Robin K.	1
Hornik, Robert	1
Jittima Kraisriwattana	1
John R. Donoghue	1
Karaman, Haydar	1
Keller-Margulis, Milena A.	1
Lin, Chih-Kai	1
Livingston, Samuel A.	1
Marcoulides, Katerina M.	1
McMorris, Robert F.	1
More ▼