ERIC - Search Results

Publication Date

In 2025	0
Since 2024	5
Since 2021 (last 5 years)	15
Since 2016 (last 10 years)	388
Since 2006 (last 20 years)	998

Descriptor

Reliability	1276
Statistical Analysis	1276
Foreign Countries	528
Validity	485
Correlation	339
Questionnaires	301
Measures (Individuals)	280
Factor Analysis	250
Student Attitudes	187
Scores	164
Comparative Analysis	160
Gender Differences	149
Likert Scales	145
College Students	137
Psychometrics	114
Research Methodology	111
Predictor Variables	101
Academic Achievement	99
Teacher Attitudes	98
Models	97
Teaching Methods	97
Elementary School Students	82
Qualitative Research	82
Rating Scales	80
Construct Validity	77
More ▼

Education Level

Higher Education	363
Postsecondary Education	299
Secondary Education	168
Elementary Education	134
High Schools	72
Middle Schools	62
Junior High Schools	43
Elementary Secondary Education	40
Early Childhood Education	39
Grade 6	27
Grade 5	25
Grade 8	25
Grade 7	22
Primary Education	21
Intermediate Grades	20
Grade 3	18
Grade 4	18
Preschool Education	17
Adult Education	16
Grade 9	13
Grade 11	12
Grade 1	10
Grade 10	10
Kindergarten	10
Grade 2	9
More ▼

Audience

Researchers	16
Practitioners	9
Teachers	6
Students	5
Administrators	4
Counselors	2
Parents	1
Policymakers	1

Location

Turkey	105
Nigeria	51
Taiwan	25
Jordan	24
Australia	19
Canada	17
Iran	16
India	14
Florida	13
Greece	12
China	11
Malaysia	11
Saudi Arabia	11
California	10
New York	10
Texas	10
Germany	9
Netherlands	9
South Korea	9
United Kingdom	9
Indonesia	8
Finland	7
Kuwait	7
Norway	7
Ohio	7
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	6
Individuals with Disabilities…	4
Race to the Top	2
Americans with Disabilities…	1
Debra P v Turlington	1
Reading Excellence Act	1
Rehabilitation Act 1973…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 1,276 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Reliability of Measuring Constructs in Applied Linguistics Research: A Comparative Study of Domestic and International Graduate Theses

Peer reviewed

Direct link

Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022

The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…

Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility

Designing Multisite Randomized Trials to Detect (Moderated) Mediation Effects

Peer reviewed

Direct link

Fangxing Bai; Ben Kelcey; Amota Ataneka; Yanli Xie; Kyle Cox; Nianbo Dong – Society for Research on Educational Effectiveness, 2024

Purpose: Multisite mediation studies are a cornerstone in mapping out developmental processes because they probe the mechanisms of a treatment while creating key opportunities to learn from and about variation in those mechanisms across sites. Despite the prevalence of multisite studies, a significant gap in the literature is how to plan such…

Descriptors: Randomized Controlled Trials, Mediation Theory, Statistical Analysis, Robustness (Statistics)

On the Importance of Coefficient Alpha for Measurement Research: Loading Equality Is Not Necessary for Alpha's Utility as a Scale Reliability Index

Peer reviewed

Direct link

Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023

The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…

Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Twenty Years of Network Meta-Analysis: Continuing Controversies and Recent Developments

Peer reviewed

Direct link

A. E. Ades; Nicky J. Welton; Sofia Dias; David M. Phillippo; Deborah M. Caldwell – Research Synthesis Methods, 2024

Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications…

Descriptors: Network Analysis, Meta Analysis, Medicine, Clinical Experience

Application of Model Averaging for Measurement in the Presence of Unknown Familiarization Phase or Fatigue Phase

Peer reviewed

Direct link

Steven Kim; Stephanie Lara-Sotelo; Eric Martin – Measurement in Physical Education and Exercise Science, 2024

A number of familiarization trials are needed for reliable measurement, particularly for inexperienced subjects. Researchers have studied and developed familiarization protocols that vary by exercise and study population. The pace of familiarization and fatigue may be an individual-level characteristic, so a population-level protocol may not fit…

Descriptors: Familiarity, Physical Education, Fatigue (Biology), Reliability

Six Solutions for More Reliable Infant Research

Peer reviewed

Direct link

Byers-Heinlein, Krista; Bergmann, Christina; Savalei, Victoria – Infant and Child Development, 2022

Infant research is often underpowered, undermining the robustness and replicability of our findings. Improving the reliability of infant studies offers a solution for increasing statistical power independent of sample size. Here, we discuss two senses of the term reliability in the context of infant research: reliable (large) effects and reliable…

Descriptors: Infants, Research, Reliability, Effect Size

"statcheck": Automatically Detect Statistical Reporting Inconsistencies to Increase Reproducibility of Meta-Analyses

Peer reviewed

Direct link

Nuijten, Michèle B.; Polanin, Joshua R. – Research Synthesis Methods, 2020

We present the R package and web app "statcheck" to automatically detect statistical reporting inconsistencies in primary studies and meta-analyses. Previous research has shown a high prevalence of reported p-values that are inconsistent--meaning a re-calculated p-value, based on the reported test statistic and degrees of freedom, does…

Descriptors: Meta Analysis, Statistical Analysis, Reliability, Replication (Evaluation)

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Adaptive Pairwise Comparison for Educational Measurement

Peer reviewed

Direct link

Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020

Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…

Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement

Validation of Rubric Evaluation for Programming Education

Peer reviewed
PDF on ERIC

Download full text

Saito, Daisuke; Yajima, Risei; Washizaki, Hironori; Fukazawa, Yoshiaki – Education Sciences, 2021

In evaluating the learning achievement of programming-thinking skills, the method of using a rubric that describes evaluation items and evaluation stages is widely employed. However, few studies have evaluated the reliability, validity, and consistency of the rubrics themselves. In this study, we introduced a statistical method for evaluating the…

Descriptors: Scoring Rubrics, Computer Science Education, Programming, Reliability

The Mathematical Quality of Instruction (MQI) in Kindergarten: An Evaluation of the Stability of the MQI Using Generalizability Theory

Peer reviewed

Direct link

Mantzicopoulos, Panayota; French, Brian F.; Patrick, Helen – Early Education and Development, 2018

Research Findings: We evaluated the score stability of the Mathematical Quality of Instruction (MQI), an observational measure of mathematics instruction. Three raters each scored, independently, 100 video-recorded lessons taught by 20 kindergarten teachers in the spring. Using generalizability theory analyses, we decomposed the MQI's score…

Descriptors: Kindergarten, Mathematics Instruction, Educational Quality, Classroom Observation Techniques

Estimating Hazard Ratios from Published Kaplan-Meier Survival Curves: A Methods Validation Study

Peer reviewed

Direct link

Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019

Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…

Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials

Assessing the Consistency Assumptions Underlying Network Meta-Regression Using Aggregate Data

Peer reviewed

Direct link

Donegan, Sarah; Dias, Sofia; Welton, Nicky J. – Research Synthesis Methods, 2019

When numerous treatments exist for a disease (Treatments 1, 2, 3, etc), network meta-regression (NMR) examines whether each relative treatment effect (eg, mean difference for 2 vs 1, 3 vs 1, and 3 vs 2) differs according to a covariate (eg, disease severity). Two consistency assumptions underlie NMR: consistency of the treatment effects at the…

Descriptors: Reliability, Regression (Statistics), Outcomes of Treatment, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 86

ProQuest LLC	58
Journal of Education and…	47
Educational Research and…	37
Educational and Psychological…	28
Online Submission	28
International Education…	18
Research on Social Work…	17
English Language Teaching	16
Journal of Education and…	16
Applied Psychological…	11
ETS Research Report Series	11
Education	11
Higher Education Studies	11
Measurement in Physical…	11
Research Quarterly for…	11
EURASIA Journal of…	10
Educational Sciences: Theory…	10
Psychological Assessment	10
Eurasian Journal of…	9
First Language	9
International Journal of…	9
Journal of Experimental…	9
Universal Journal of…	9
Assessment & Evaluation in…	8
Journal of Education and…	8
More ▼

Price, Gary G.	12
Alonzo, Julie	4
Tindal, Gerald	4
Anderson, Daniel	3
Brennan, Robert L.	3
Fan, Xitao	3
Fletcher, Jack M.	3
Forsyth, Robert A.	3
Hakstian, A. Ralph	3
Knapp, Thomas R.	3
Lai, Cheng-Fei	3
Liou, Pey-Yan	3
Miciak, Jeremy	3
Nese, Joseph F. T.	3
Padilla, Miguel A.	3
Raykov, Tenko	3
Stuebing, Karla K.	3
Abdi, Ali	2
Abell, Neil	2
Acar-Ciftci, Yasemin	2
Aryadoust, Vahid	2
Attali, Yigal	2
Bodkin-Andrews, Gawaian H.	2
More ▼

Journal Articles	985
Reports - Research	951
Reports - Evaluative	102
Tests/Questionnaires	87
Dissertations/Theses -…	59
Reports - Descriptive	46
Speeches/Meeting Papers	45
Information Analyses	40
Numerical/Quantitative Data	16
Opinion Papers	14
Guides - Non-Classroom	10
Books	6
Guides - Classroom - Learner	4
Guides - General	3
Reports - General	3
Book/Product Reviews	2
Collected Works - General	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Historical Materials	1
Legal/Legislative/Regulatory…	1
Non-Print Media	1
More ▼

Motivated Strategies for…	6
Stanford Achievement Tests	6
Autism Diagnostic Observation…	4
Marlowe Crowne Social…	4
Peabody Picture Vocabulary…	4
SAT (College Admission Test)	4
Strengths and Difficulties…	4
Test of English as a Foreign…	4
Torrance Tests of Creative…	4
Beck Depression Inventory	3
Childrens Manifest Anxiety…	3
Clinical Evaluation of…	3
Learning Style Inventory	3
National Longitudinal Study…	3
National Longitudinal Survey…	3
Wechsler Intelligence Scale…	3
Woodcock Johnson Tests of…	3
ACT Assessment	2
Behavior Assessment System…	2
Center for Epidemiologic…	2
Child Behavior Checklist	2
Early Childhood Environment…	2
Early Childhood Longitudinal…	2
Eysenck Personality Inventory	2
Flesch Kincaid Grade Level…	2
More ▼