Publication Date
In 2025 | 0 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 15 |
Since 2016 (last 10 years) | 388 |
Since 2006 (last 20 years) | 998 |
Descriptor
Reliability | 1276 |
Statistical Analysis | 1276 |
Foreign Countries | 528 |
Validity | 485 |
Correlation | 339 |
Questionnaires | 301 |
Measures (Individuals) | 280 |
Factor Analysis | 250 |
Student Attitudes | 187 |
Scores | 164 |
Comparative Analysis | 160 |
More ▼ |
Source
Author
Price, Gary G. | 12 |
Alonzo, Julie | 4 |
Tindal, Gerald | 4 |
Anderson, Daniel | 3 |
Brennan, Robert L. | 3 |
Fan, Xitao | 3 |
Fletcher, Jack M. | 3 |
Forsyth, Robert A. | 3 |
Hakstian, A. Ralph | 3 |
Knapp, Thomas R. | 3 |
Lai, Cheng-Fei | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 16 |
Practitioners | 9 |
Teachers | 6 |
Students | 5 |
Administrators | 4 |
Counselors | 2 |
Parents | 1 |
Policymakers | 1 |
Location
Turkey | 105 |
Nigeria | 51 |
Taiwan | 25 |
Jordan | 24 |
Australia | 19 |
Canada | 17 |
Iran | 16 |
India | 14 |
Florida | 13 |
Greece | 12 |
China | 11 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 6 |
Individuals with Disabilities… | 4 |
Race to the Top | 2 |
Americans with Disabilities… | 1 |
Debra P v Turlington | 1 |
Reading Excellence Act | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022
The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…
Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility
Fangxing Bai; Ben Kelcey; Amota Ataneka; Yanli Xie; Kyle Cox; Nianbo Dong – Society for Research on Educational Effectiveness, 2024
Purpose: Multisite mediation studies are a cornerstone in mapping out developmental processes because they probe the mechanisms of a treatment while creating key opportunities to learn from and about variation in those mechanisms across sites. Despite the prevalence of multisite studies, a significant gap in the literature is how to plan such…
Descriptors: Randomized Controlled Trials, Mediation Theory, Statistical Analysis, Robustness (Statistics)
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022
Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…
Descriptors: Reliability, Scores, Scaling, Statistical Analysis
A. E. Ades; Nicky J. Welton; Sofia Dias; David M. Phillippo; Deborah M. Caldwell – Research Synthesis Methods, 2024
Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications…
Descriptors: Network Analysis, Meta Analysis, Medicine, Clinical Experience
Steven Kim; Stephanie Lara-Sotelo; Eric Martin – Measurement in Physical Education and Exercise Science, 2024
A number of familiarization trials are needed for reliable measurement, particularly for inexperienced subjects. Researchers have studied and developed familiarization protocols that vary by exercise and study population. The pace of familiarization and fatigue may be an individual-level characteristic, so a population-level protocol may not fit…
Descriptors: Familiarity, Physical Education, Fatigue (Biology), Reliability
Byers-Heinlein, Krista; Bergmann, Christina; Savalei, Victoria – Infant and Child Development, 2022
Infant research is often underpowered, undermining the robustness and replicability of our findings. Improving the reliability of infant studies offers a solution for increasing statistical power independent of sample size. Here, we discuss two senses of the term reliability in the context of infant research: reliable (large) effects and reliable…
Descriptors: Infants, Research, Reliability, Effect Size
Nuijten, Michèle B.; Polanin, Joshua R. – Research Synthesis Methods, 2020
We present the R package and web app "statcheck" to automatically detect statistical reporting inconsistencies in primary studies and meta-analyses. Previous research has shown a high prevalence of reported p-values that are inconsistent--meaning a re-calculated p-value, based on the reported test statistic and degrees of freedom, does…
Descriptors: Meta Analysis, Statistical Analysis, Reliability, Replication (Evaluation)
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020
Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…
Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement
Saito, Daisuke; Yajima, Risei; Washizaki, Hironori; Fukazawa, Yoshiaki – Education Sciences, 2021
In evaluating the learning achievement of programming-thinking skills, the method of using a rubric that describes evaluation items and evaluation stages is widely employed. However, few studies have evaluated the reliability, validity, and consistency of the rubrics themselves. In this study, we introduced a statistical method for evaluating the…
Descriptors: Scoring Rubrics, Computer Science Education, Programming, Reliability
Mantzicopoulos, Panayota; French, Brian F.; Patrick, Helen – Early Education and Development, 2018
Research Findings: We evaluated the score stability of the Mathematical Quality of Instruction (MQI), an observational measure of mathematics instruction. Three raters each scored, independently, 100 video-recorded lessons taught by 20 kindergarten teachers in the spring. Using generalizability theory analyses, we decomposed the MQI's score…
Descriptors: Kindergarten, Mathematics Instruction, Educational Quality, Classroom Observation Techniques
Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019
Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…
Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials
Donegan, Sarah; Dias, Sofia; Welton, Nicky J. – Research Synthesis Methods, 2019
When numerous treatments exist for a disease (Treatments 1, 2, 3, etc), network meta-regression (NMR) examines whether each relative treatment effect (eg, mean difference for 2 vs 1, 3 vs 1, and 3 vs 2) differs according to a covariate (eg, disease severity). Two consistency assumptions underlie NMR: consistency of the treatment effects at the…
Descriptors: Reliability, Regression (Statistics), Outcomes of Treatment, Statistical Analysis