Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 23 |
Since 2006 (last 20 years) | 61 |
Descriptor
Comparative Analysis | 124 |
Measurement Techniques | 124 |
Test Reliability | 57 |
Reliability | 55 |
Test Validity | 35 |
Statistical Analysis | 31 |
Foreign Countries | 27 |
Correlation | 26 |
Validity | 22 |
Evaluation Methods | 18 |
Interrater Reliability | 18 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 9 |
Administrators | 1 |
Practitioners | 1 |
Location
Australia | 4 |
China | 3 |
United States | 3 |
Germany | 2 |
Hong Kong | 2 |
Portugal | 2 |
Sweden | 2 |
Taiwan | 2 |
Asia | 1 |
Brazil | 1 |
Canada | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Lind, Veronika; Svensson, Melanie; Harringe, Marita L. – Measurement in Physical Education and Exercise Science, 2022
Goniometry is commonly used to evaluate joint range of motion (ROM). The most widespread method, a manual universal goniometer (UG), is considered time-consuming and difficult to handle. The digital goniometer EasyAngle (EA) was developed to improve and simplify the evaluation of ROM. This study aimed to evaluate the reliability and validity of EA…
Descriptors: Motor Reactions, Measurement Techniques, Comparative Analysis, Measurement Equipment
A Comparison of Procedures for Estimating Person Reliability Parameters in the Graded Response Model
LaHuis, David M.; Bryant-Lees, Kinsey B.; Hakoyama, Shotaro; Barnes, Tyler; Wiemann, Andrea – Journal of Educational Measurement, 2018
Person reliability parameters (PRPs) model temporary changes in individuals' attribute level perceptions when responding to self-report items (higher levels of PRPs represent less fluctuation). PRPs could be useful in measuring careless responding and traitedness. However, it is unclear how well current procedures for estimating PRPs can recover…
Descriptors: Comparative Analysis, Reliability, Error of Measurement, Measurement Techniques
Arslan Mancar, Sinem; Gulleroglu, H. Deniz – International Journal of Assessment Tools in Education, 2022
The aim of this study is to analyse the importance of the number of raters and compare the results obtained by techniques based on Classical Test Theory (CTT) and Generalizability (G) Theory. The Kappa and Krippendorff alpha techniques based on CTT were used to determine the inter-rater reliability. In this descriptive research data consists of…
Descriptors: Comparative Analysis, Interrater Reliability, Advanced Placement, Scoring Rubrics
Pavelko, Stacey L.; Price, Larry R.; Owens, Robert E. – Language, Speech, and Hearing Services in Schools, 2020
Purpose: The goal of this study was to determine whether the results obtained from a 25-utterance conversational language sample were as reliable as those obtained from a 50-utterance sample. Method: Robust conversational language samples from 220 children with typically developing language (106 boys, 114 girls) ranging in age from 3;2 to 7;10…
Descriptors: Grammar, Sampling, Speech Communication, Preschool Children
Bailey, Bruce W.; LeCheminant, Gabrielle; Hope, Timothy; Bell, Mathew; Tucker, Larry A. – Measurement in Physical Education and Exercise Science, 2018
The study compared the agreement, internal consistency, and measurement stability of the GE iDXA, BOD POD, and InBody 720. Body composition of 43 men and 37 women (31.4 ± 10.7 years; 90% Caucasian and 10% other) was assessed in triplicate using each method over two different days. Mean percent body fat (% BF) of the participants was different for…
Descriptors: Body Composition, Measurement Equipment, Reliability, Comparative Analysis
McClaskey, Carolyn M.; Dias, James W.; Dubno, Judy R.; Harris, Kelly C. – Journal of Speech, Language, and Hearing Research, 2018
Purpose: Human auditory nerve (AN) activity estimated from the amplitude of the first prominent negative peak (N1) of the compound action potential (CAP) is typically quantified using either a peak-to-peak measurement or a baseline-corrected measurement. However, the reliability of these 2 common measurement techniques has not been evaluated but…
Descriptors: Comparative Analysis, Correlation, Measurement Techniques, Test Reliability
Moeller, Julia; Viljaranta, Jaana; Kracke, Bärbel; Dietrich, Julia – Frontline Learning Research, 2020
This article proposes a study design developed to disentangle the objective characteristics of a learning situation from individuals' subjective perceptions of that situation. The term objective characteristics refers to the agreement across students, whereas subjective perceptions refers to inter-individual heterogeneity. We describe a novel…
Descriptors: Student Attitudes, College Students, Lecture Method, Student Interests
Tobar-Henríquez, Anita; Rabagliati, Hugh; Branigan, Holly P. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2020
Language use is intrinsically variable, such that the words we use vary widely across speakers and communicative situations. For instance, we can call the same entity "refrigerator" or "fridge." However, attempts to understand individual differences in how we process language have made surprisingly little progress, perhaps…
Descriptors: Individual Differences, Language Processing, Pictorial Stimuli, Language Usage
Wu, Shu-Ling; Tio, Yee Pin; Ortega, Lourdes – Studies in Second Language Acquisition, 2022
Elicited imitation (EI), a short-cut measure of global proficiency in second language (L2) research, requires participants to listen to sentences and repeat them as closely as possible. To support instrument sharing and assessment of L2 proficiency for longitudinal and crosslinguistic research, we created a parallel form of an EI task (EIT) for L2…
Descriptors: Imitation, Second Language Learning, Second Language Instruction, Language Proficiency
McKie, Greg L.; Islam, Hashim; Townsend, Logan K.; Howe, Greg J.; Hazell, Tom J. – Measurement in Physical Education and Exercise Science, 2018
This study examined the validity and reliability of a 30-second running sprint test using two non-motorized treadmills compared to the established Wingate Anaerobic Test. Twenty-four participants completed three sessions in a randomized order on a: (1) manual mode treadmill (Woodway); (2) specialized interval training treadmill (HiTrainer); and…
Descriptors: Exercise, Physical Activities, Correlation, Exercise Physiology
Prihar, Ethan; Heffernan, Neil – International Educational Data Mining Society, 2021
Similar content has tremendous utility in classroom and online learning environments. For example, similar content can be used to combat cheating, track students' learning over time, and model students' latent knowledge. These different use cases for similar content all rely on different notions of similarity, which make it difficult to determine…
Descriptors: Computer Software, Middle School Teachers, Mathematics Teachers, College Students
Park, Jinhee; Pados, Britt Frisk; Thoyre, Suzanne M.; Estrem, Hayley H.; McComish, Cara – Journal of Early Intervention, 2019
The purpose of this study was to identify the factor structure of the Child Oral and Motor Proficiency Scale (ChOMPS) and to evaluate the psychometric properties, including internal consistency reliability, test-retest reliability, and construct validity as measured by convergent and known-groups validity. Principal component analysis with varimax…
Descriptors: Factor Structure, Factor Analysis, Psychometrics, Reliability
Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018
In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…
Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis
Kim, Ha Yeon; Gjicali, Kalina; Wu, Zezhen; Tubbs Dolan, Carly – Journal on Education in Emergencies, 2021
Rigorous evaluation of social and emotional learning programs requires the use of measures that provide reliable and valid information on the meaningful differences in children's social emotional skills across treatment and control groups, as well as changes over time. In contexts affected by conflict and crisis, few measures can provide the…
Descriptors: Teacher Attitudes, Social Emotional Learning, Psychometrics, Conflict