NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)97
Since 2006 (last 20 years)236
What Works Clearinghouse Rating
Does not meet standards1
Showing 1 to 15 of 321 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020
Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…
Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019
Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…
Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials
Peer reviewed Peer reviewed
Direct linkDirect link
De Raadt, Alexandra; Warrens, Matthijs J.; Bosker, Roel J.; Kiers, Henk A. L. – Educational and Psychological Measurement, 2019
Cohen's kappa coefficient is commonly used for assessing agreement between classifications of two raters on a nominal scale. Three variants of Cohen's kappa that can handle missing data are presented. Data are considered missing if one or both ratings of a unit are missing. We study how well the variants estimate the kappa value for complete data…
Descriptors: Interrater Reliability, Data, Statistical Analysis, Statistical Bias
Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020
In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…
Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Bailey, Bruce W.; LeCheminant, Gabrielle; Hope, Timothy; Bell, Mathew; Tucker, Larry A. – Measurement in Physical Education and Exercise Science, 2018
The study compared the agreement, internal consistency, and measurement stability of the GE iDXA, BOD POD, and InBody 720. Body composition of 43 men and 37 women (31.4 ± 10.7 years; 90% Caucasian and 10% other) was assessed in triplicate using each method over two different days. Mean percent body fat (% BF) of the participants was different for…
Descriptors: Body Composition, Measurement Equipment, Reliability, Comparative Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Thawabieh, Ahmad M. – Journal of Curriculum and Teaching, 2017
This study aimed to compare between the students' self-assessment and teachers' assessment. The study sample consisted of 71 students at Tafila Technical University studying Introduction to Psychology course. The researcher used 2 students' self-assessment tools and 2 tests. The results indicated that students can assess themselves accurately if…
Descriptors: Comparative Analysis, Self Evaluation (Individuals), Student Evaluation, Psychology
Peer reviewed Peer reviewed
Direct linkDirect link
Al-Hoorie, Ali H.; Vitta, Joseph P. – Language Teaching Research, 2019
This report presents a review of the statistical practices of 30 journals representative of the second language field. A review of 150 articles showed a number of prevalent statistical violations including incomplete reporting of reliability, validity, non-significant results, effect sizes, and assumption checks as well as making inferences from…
Descriptors: Periodicals, Second Language Learning, Second Language Instruction, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Grané, Aurea; Romera, Rosario – Sociological Methods & Research, 2018
Survey data are usually of mixed type (quantitative, multistate categorical, and/or binary variables). Multidimensional scaling (MDS) is one of the most extended methodologies to visualize the profile structure of the data. Since the past 60s, MDS methods have been introduced in the literature, initially in publications in the psychometrics area.…
Descriptors: Surveys, Data, Multidimensional Scaling, Robustness (Statistics)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lehan, Tara; Hussey, Heather; Mika, Eva – Journal of University Teaching and Learning Practice, 2016
Throughout the dissertation process, the chair and committee members provide feedback regarding quality to help the doctoral candidate to produce the highest-quality document and become an independent scholar. Nevertheless, results of previous research suggest that overall dissertation quality generally is poor. Because much of the feedback about…
Descriptors: Graduate Students, Doctoral Dissertations, Student Evaluation, Feedback (Response)
Peer reviewed Peer reviewed
Direct linkDirect link
Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Descriptors: Automation, Scoring, Comparative Analysis, Test Items
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Jaikaew, Pimpilai; Damrongpanit, Suntonrapot – Universal Journal of Educational Research, 2018
The research was designed to examine the effects of question setting using different conditions into 10 sets on the validity of structural equation modeling for factors affecting job morale. The data was collected from 690 personnel working in regional Statistical Offices around Thailand by using cluster random sampling. The tool used in…
Descriptors: Structural Equation Models, Questionnaires, Reliability, Multivariate Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
McKie, Greg L.; Islam, Hashim; Townsend, Logan K.; Howe, Greg J.; Hazell, Tom J. – Measurement in Physical Education and Exercise Science, 2018
This study examined the validity and reliability of a 30-second running sprint test using two non-motorized treadmills compared to the established Wingate Anaerobic Test. Twenty-four participants completed three sessions in a randomized order on a: (1) manual mode treadmill (Woodway); (2) specialized interval training treadmill (HiTrainer); and…
Descriptors: Exercise, Physical Activities, Correlation, Exercise Physiology
Peer reviewed Peer reviewed
Direct linkDirect link
Kelleher, Leila K.; Beach, Tyson A. C.; Frost, David M.; Johnson, Andrew M.; Dickey, James P. – Measurement in Physical Education and Exercise Science, 2018
The scoring scheme for the functional movement screen implicitly assumes that the factor structure is consistent, stable, and congruent across different populations. To determine if this is the case, we compared principal components analyses of three samples: a healthy, general population (n = 100), a group of varsity athletes (n = 101), and a…
Descriptors: Factor Structure, Test Reliability, Screening Tests, Motion
Peer reviewed Peer reviewed
Direct linkDirect link
Pulford, Briony D.; Woodward, Bethan; Taylor, Eve – Social Psychology of Education: An International Journal, 2018
This paper reports the development of an Academic Social Comparison Scale (ASCS) to measure students' tendencies to socially compare themselves with other students in an educational setting. The 27-item ASCS was then measured in relation to academic self-confidence in a sample of University students, using the Individual Learning Profile…
Descriptors: Measures (Individuals), Comparative Analysis, College Students, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Padilla, Miguel A.; Divers, Jasmin – Educational and Psychological Measurement, 2016
Coefficient omega and alpha are both measures of the composite reliability for a set of items. Unlike coefficient alpha, coefficient omega remains unbiased with congeneric items with uncorrelated errors. Despite this ability, coefficient omega is not as widely used and cited in the literature as coefficient alpha. Reasons for coefficient omega's…
Descriptors: Reliability, Computation, Statistical Analysis, Comparative Analysis
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  22