ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	61

Descriptor

Comparative Analysis	124
Measurement Techniques	124
Test Reliability	57
Reliability	55
Test Validity	35
Statistical Analysis	31
Foreign Countries	27
Correlation	26
Validity	22
Evaluation Methods	18
Interrater Reliability	18
Psychometrics	17
Test Construction	15
Scores	13
Factor Analysis	12
Questionnaires	12
Rating Scales	12
Adults	10
Item Analysis	10
Sampling	10
Student Evaluation	10
Higher Education	9
Measures (Individuals)	9
Data Analysis	8
Achievement Tests	7
More ▼

Publication Type

Reports - Research	79
Journal Articles	73
Reports - Evaluative	22
Speeches/Meeting Papers	11
Opinion Papers	5
Dissertations/Theses -…	4
Tests/Questionnaires	4
Reports - Descriptive	3
Numerical/Quantitative Data	2
Collected Works - Proceedings	1
Information Analyses	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	11
Secondary Education	9
Elementary Education	7
High Schools	5
Middle Schools	5
Elementary Secondary Education	3
Grade 7	2
Intermediate Grades	2
Junior High Schools	2
Adult Education	1
Early Childhood Education	1
Grade 2	1
Grade 5	1
Grade 6	1
Grade 8	1
Primary Education	1
More ▼

Audience

Researchers	9
Administrators	1
Practitioners	1

Location

Australia	4
China	3
United States	3
Germany	2
Hong Kong	2
Portugal	2
Sweden	2
Taiwan	2
Asia	1
Brazil	1
Canada	1
Chile (Santiago)	1
European Union	1
Iceland	1
Kansas	1
Lebanon	1
Maryland	1
Netherlands	1
New York (New York)	1
New Zealand	1
Pakistan	1
Singapore	1
South Carolina	1
Spain	1
Syria	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Autism Diagnostic Observation…	2
Childrens Manifest Anxiety…	2
SAT (College Admission Test)	2
ACT Assessment	1
Basic Reading Inventory	1
Early Childhood Longitudinal…	1
General Educational…	1
Group Assessment of Logical…	1
Kaufman Brief Intelligence…	1
Minnesota Multiphasic…	1
Motivated Strategies for…	1
NEO Personality Inventory	1
Self Description Questionnaire	1
Self Perception Profile for…	1
Test of Science Related…	1
Torrance Tests of Creative…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 124 results Save | Export

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Reliability and Validity of a Digital Goniometer for Measuring Knee Joint Range of Motion

Peer reviewed

Direct link

Lind, Veronika; Svensson, Melanie; Harringe, Marita L. – Measurement in Physical Education and Exercise Science, 2022

Goniometry is commonly used to evaluate joint range of motion (ROM). The most widespread method, a manual universal goniometer (UG), is considered time-consuming and difficult to handle. The digital goniometer EasyAngle (EA) was developed to improve and simplify the evaluation of ROM. This study aimed to evaluate the reliability and validity of EA…

Descriptors: Motor Reactions, Measurement Techniques, Comparative Analysis, Measurement Equipment

A Comparison of Procedures for Estimating Person Reliability Parameters in the Graded Response Model

Peer reviewed

Direct link

LaHuis, David M.; Bryant-Lees, Kinsey B.; Hakoyama, Shotaro; Barnes, Tyler; Wiemann, Andrea – Journal of Educational Measurement, 2018

Person reliability parameters (PRPs) model temporary changes in individuals' attribute level perceptions when responding to self-report items (higher levels of PRPs represent less fluctuation). PRPs could be useful in measuring careless responding and traitedness. However, it is unclear how well current procedures for estimating PRPs can recover…

Descriptors: Comparative Analysis, Reliability, Error of Measurement, Measurement Techniques

Comparison of Inter-Rater Reliability Techniques in Performance-Based Assessment

Peer reviewed
PDF on ERIC

Download full text

Arslan Mancar, Sinem; Gulleroglu, H. Deniz – International Journal of Assessment Tools in Education, 2022

The aim of this study is to analyse the importance of the number of raters and compare the results obtained by techniques based on Classical Test Theory (CTT) and Generalizability (G) Theory. The Kappa and Krippendorff alpha techniques based on CTT were used to determine the inter-rater reliability. In this descriptive research data consists of…

Descriptors: Comparative Analysis, Interrater Reliability, Advanced Placement, Scoring Rubrics

Revisiting Reliability: Using Sampling Utterances and Grammatical Analysis Revised (SUGAR) to Compare 25- and 50-Utterance Language Samples

Peer reviewed

Direct link

Pavelko, Stacey L.; Price, Larry R.; Owens, Robert E. – Language, Speech, and Hearing Services in Schools, 2020

Purpose: The goal of this study was to determine whether the results obtained from a 25-utterance conversational language sample were as reliable as those obtained from a 50-utterance sample. Method: Robust conversational language samples from 220 children with typically developing language (106 boys, 114 girls) ranging in age from 3;2 to 7;10…

Descriptors: Grammar, Sampling, Speech Communication, Preschool Children

A Comparison of the Agreement, Internal Consistency, and 2-Day Test Stability of the InBody 720, GE iDXA, and BOD POD® Gold Standard for Assessing Body Composition

Peer reviewed

Direct link

Bailey, Bruce W.; LeCheminant, Gabrielle; Hope, Timothy; Bell, Mathew; Tucker, Larry A. – Measurement in Physical Education and Exercise Science, 2018

The study compared the agreement, internal consistency, and measurement stability of the GE iDXA, BOD POD, and InBody 720. Body composition of 43 men and 37 women (31.4 ± 10.7 years; 90% Caucasian and 10% other) was assessed in triplicate using each method over two different days. Mean percent body fat (% BF) of the participants was different for…

Descriptors: Body Composition, Measurement Equipment, Reliability, Comparative Analysis

Reliability of Measures of N1 Peak Amplitude of the Compound Action Potential in Younger and Older Adults

Peer reviewed

Direct link

McClaskey, Carolyn M.; Dias, James W.; Dubno, Judy R.; Harris, Kelly C. – Journal of Speech, Language, and Hearing Research, 2018

Purpose: Human auditory nerve (AN) activity estimated from the amplitude of the first prominent negative peak (N1) of the compound action potential (CAP) is typically quantified using either a peak-to-peak measurement or a baseline-corrected measurement. However, the reliability of these 2 common measurement techniques has not been evaluated but…

Descriptors: Comparative Analysis, Correlation, Measurement Techniques, Test Reliability

Disentangling Objective Characteristics of Learning Situations from Subjective Perceptions Thereof, Using an Experience Sampling Method Design

Peer reviewed
PDF on ERIC

Download full text

Moeller, Julia; Viljaranta, Jaana; Kracke, Bärbel; Dietrich, Julia – Frontline Learning Research, 2020

This article proposes a study design developed to disentangle the objective characteristics of a learning situation from individuals' subjective perceptions of that situation. The term objective characteristics refers to the agreement across students, whereas subjective perceptions refers to inter-individual heterogeneity. We describe a novel…

Descriptors: Student Attitudes, College Students, Lecture Method, Student Interests

Lexical Entrainment Reflects a Stable Individual Trait: Implications for Individual Differences in Language Processing

Peer reviewed

Direct link

Tobar-Henríquez, Anita; Rabagliati, Hugh; Branigan, Holly P. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2020

Language use is intrinsically variable, such that the words we use vary widely across speakers and communicative situations. For instance, we can call the same entity "refrigerator" or "fridge." However, attempts to understand individual differences in how we process language have made surprisingly little progress, perhaps…

Descriptors: Individual Differences, Language Processing, Pictorial Stimuli, Language Usage

Elicited Imitation as a Measure of L2 Proficiency: New Insights from a Comparison of Two L2 English Parallel Forms

Peer reviewed

Direct link

Wu, Shu-Ling; Tio, Yee Pin; Ortega, Lourdes – Studies in Second Language Acquisition, 2022

Elicited imitation (EI), a short-cut measure of global proficiency in second language (L2) research, requires participants to listen to sentences and repeat them as closely as possible. To support instrument sharing and assessment of L2 proficiency for longitudinal and crosslinguistic research, we created a parallel form of an EI task (EIT) for L2…

Descriptors: Imitation, Second Language Learning, Second Language Instruction, Language Proficiency

Establishing a Practical Treadmill Sprint as an Alternative to the Wingate Anaerobic Test

Peer reviewed

Direct link

McKie, Greg L.; Islam, Hashim; Townsend, Logan K.; Howe, Greg J.; Hazell, Tom J. – Measurement in Physical Education and Exercise Science, 2018

This study examined the validity and reliability of a 30-second running sprint test using two non-motorized treadmills compared to the established Wingate Anaerobic Test. Twenty-four participants completed three sessions in a randomized order on a: (1) manual mode treadmill (Woodway); (2) specialized interval training treadmill (HiTrainer); and…

Descriptors: Exercise, Physical Activities, Correlation, Exercise Physiology

A Novel Algorithm for Aggregating Crowdsourced Opinions

Peer reviewed
PDF on ERIC

Download full text

Prihar, Ethan; Heffernan, Neil – International Educational Data Mining Society, 2021

Similar content has tremendous utility in classroom and online learning environments. For example, similar content can be used to combat cheating, track students' learning over time, and model students' latent knowledge. These different use cases for similar content all rely on different notions of similarity, which make it difficult to determine…

Descriptors: Computer Software, Middle School Teachers, Mathematics Teachers, College Students

Factor Structure and Psychometric Properties of the Child Oral and Motor Proficiency Scale

Peer reviewed

Direct link

Park, Jinhee; Pados, Britt Frisk; Thoyre, Suzanne M.; Estrem, Hayley H.; McComish, Cara – Journal of Early Intervention, 2019

The purpose of this study was to identify the factor structure of the Child Oral and Motor Proficiency Scale (ChOMPS) and to evaluate the psychometric properties, including internal consistency reliability, test-retest reliability, and construct validity as measured by convergent and known-groups validity. Principal component analysis with varimax…

Descriptors: Factor Structure, Factor Analysis, Psychometrics, Reliability

A Comparison of Two Content Area Curriculum-Based Measurement Tools

Peer reviewed

Direct link

Ford, Jeremy W.; Conoyer, Sarah J.; Lembke, Erica S.; Smith, R. Alex; Hosp, John L. – Assessment for Effective Intervention, 2018

In the present study, two types of curriculum-based measurement (CBM) tools in science, Vocabulary Matching (VM) and Statement Verification for Science (SV-S), a modified Sentence Verification Technique, were compared. Specifically, this study aimed to determine whether the format of information presented (i.e., SV-S vs. VM) produces differences…

Descriptors: Curriculum Based Assessment, Evaluation Methods, Measurement Techniques, Comparative Analysis

Teachers' Observations of Learners' Social and Emotional Learning: Psychometric Evidence for Program Evaluation in Education in Emergencies

Peer reviewed

Direct link

Kim, Ha Yeon; Gjicali, Kalina; Wu, Zezhen; Tubbs Dolan, Carly – Journal on Education in Emergencies, 2021

Rigorous evaluation of social and emotional learning programs requires the use of measures that provide reliable and valid information on the meaningful differences in children's social emotional skills across treatment and control groups, as well as changes over time. In contexts affected by conflict and crisis, few measures can provide the…

Descriptors: Teacher Attitudes, Social Emotional Learning, Psychometrics, Conflict

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Journal of Speech, Language,…	5
Measurement in Physical…	5
ProQuest LLC	4
Applied Psychological…	2
Assessment	2
Assessment & Evaluation in…	2
Educational and Psychological…	2
Frontline Learning Research	2
Journal of Autism and…	2
Journal of Communication…	2
Journal of Consulting and…	2
Research Quarterly	2
Research Quarterly for…	2
Social Indicators Research	2
American Journal of…	1
American Journal on Mental…	1
Assessment for Effective…	1
Autism: The International…	1
British Journal of…	1
Child & Youth Care Forum	1
Child Abuse & Neglect: The…	1
Child Welfare	1
Cogent Education	1
College Board	1
Comparative Education Review	1
More ▼

ANDERSON, JAMES A.	1
Adams, R. J.	1
Albion, Peter R.	1
Allan S. Cohen	1
Allen, Melissa A.	1
Almehrizi, Rashid S.	1
Alsawalmeh, Yousef M.	1
Alvermann, Donna E.	1
Ang, Rebecca P.	1
Argulewicz, Ed N.	1
Arnold, Mariah	1
Arslan Mancar, Sinem	1
Bailey, Bruce W.	1
Baird, Christopher	1
Ballantine, Joan A.	1
Barnes, Tyler	1
Bauman, Kurt J.	1
Bejerot, Susanne	1
Bell, Mathew	1
Bermúdez, María Olga Escandell	1
Betz, Nancy E.	1
Bhola, Dennison S.	1
Bjornsdottir, Gyda	1
Black, Maureen M.	1
More ▼