ERIC - Search Results

Publication Date

In 2025	2
Since 2024	17
Since 2021 (last 5 years)	60
Since 2016 (last 10 years)	131
Since 2006 (last 20 years)	218

Descriptor

Accuracy	218
Validity	218
Reliability	64
Foreign Countries	35
Correlation	33
College Students	29
Scores	29
Comparative Analysis	28
Statistical Analysis	28
Evaluation Methods	25
Measures (Individuals)	25
Models	24
Classification	23
Elementary School Students	21
Psychometrics	20
Identification	19
Teaching Methods	19
Questionnaires	18
Decision Making	17
Intervention	17
Measurement Techniques	17
Academic Achievement	16
Student Attitudes	16
Undergraduate Students	16
Computer Software	15
More ▼

Publication Type

Journal Articles	177
Reports - Research	167
Reports - Evaluative	17
Reports - Descriptive	16
Dissertations/Theses -…	13
Tests/Questionnaires	13
Speeches/Meeting Papers	10
Information Analyses	8
Guides - Non-Classroom	3
Books	2
Numerical/Quantitative Data	2
Guides - General	1
Non-Print Media	1
Opinion Papers	1
More ▼

Education Level

Higher Education	61
Postsecondary Education	55
Elementary Education	31
Secondary Education	27
High Schools	14
Middle Schools	14
Early Childhood Education	13
Elementary Secondary Education	12
Junior High Schools	12
Primary Education	10
Grade 4	8
Grade 8	8
Grade 7	6
Intermediate Grades	6
Grade 2	5
Grade 3	5
Kindergarten	5
Grade 12	3
Grade 5	3
Grade 6	3
Preschool Education	3
Grade 10	2
Grade 11	2
Grade 9	2
Adult Education	1
More ▼

Audience

Practitioners	3
Policymakers	2
Researchers	2
Teachers	2

Location

Texas	5
Florida	4
Germany	4
Indiana	4
Tennessee	4
China	3
Turkey	3
Utah	3
California	2
Iran	2
Israel	2
Japan	2
Missouri	2
New York	2
Pennsylvania	2
Rhode Island	2
Thailand	2
United Kingdom (England)	2
Australia	1
Austria	1
Belgium	1
California (Los Angeles)	1
Canada	1
Connecticut	1
Delaware	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 218 results Save | Export

Validity Arguments for AI-Based Automated Scores: Essay Scoring as an Illustration

Peer reviewed

Direct link

Ferrara, Steve; Qunbar, Saed – Journal of Educational Measurement, 2022

In this article, we argue that automated scoring engines should be transparent and construct relevant--that is, as much as is currently feasible. Many current automated scoring engines cannot achieve high degrees of scoring accuracy without allowing in some features that may not be easily explained and understood and may not be obviously and…

Descriptors: Artificial Intelligence, Scoring, Essays, Automation

Towards a More Nuanced Conceptualisation of Differential Examiner Stringency in OSCEs

Peer reviewed

Direct link

Matt Homer – Advances in Health Sciences Education, 2024

Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…

Descriptors: Examiners, Scoring, Validity, Cutting Scores

Raising the Roof: Situating Verbs in Symbolic and Embodied Language Processing

Peer reviewed

Direct link

John Hollander; Andrew Olney – Cognitive Science, 2024

Recent investigations on how people derive meaning from language have focused on task-dependent shifts between two cognitive systems. The symbolic (amodal) system represents meaning as the statistical relationships between words. The embodied (modal) system represents meaning through neurocognitive simulation of perceptual or sensorimotor systems…

Descriptors: Verbs, Symbolic Language, Language Processing, Semantics

Is ChatGPT Reliable in Education?

Peer reviewed
PDF on ERIC

Download full text

Amal Abdullah Alibrahim – South African Journal of Education, 2024

After ChatGPT was released late in 2022, many arguments about its accuracy and use in education arose. In this article, I seek to provide evidence of the accuracy and validity of ChatGPT's responses to users' queries in education by applying a systematic review methodology to analyse publications in specific databases following PRISMA guidelines…

Descriptors: Artificial Intelligence, Technology Uses in Education, Reliability, Natural Language Processing

Dtametasa: An R Shiny Application for Meta-Analysis of Diagnostic Test Accuracy and Sensitivity Analysis of Publication Bias

Peer reviewed

Direct link

Mizutani, Shosuke; Zhou, Yi; Tian, Yu-Shi; Takagi, Tatsuya; Ohkubo, Tadayasu; Hattori, Satoshi – Research Synthesis Methods, 2023

Meta-analysis of diagnostic test accuracy (DTA) is a powerful statistical method for synthesizing and evaluating the diagnostic capacity of medical tests and has been extensively used by clinical physicians and healthcare decision-makers. However, publication bias (PB) threatens the validity of meta-analysis of DTA. Some statistical methods have…

Descriptors: Meta Analysis, Diagnostic Tests, Accuracy, Publications

Expected Classification Accuracy for Categorical Growth Models

Peer reviewed

Direct link

Daniel Murphy; Sarah Quesen; Matthew Brunetti; Quintin Love – Educational Measurement: Issues and Practice, 2024

Categorical growth models describe examinee growth in terms of performance-level category transitions, which implies that some percentage of examinees will be misclassified. This paper introduces a new procedure for estimating the classification accuracy of categorical growth models, based on Rudner's classification accuracy index for item…

Descriptors: Classification, Growth Models, Accuracy, Performance Based Assessment

Validation of Garmin and Polar Devices for Continuous Heart Rate Monitoring during Common Training Movements in Tactical Populations

Peer reviewed

Direct link

Merrigan, Justin J.; Stovall, J. Hannah; Stone, Jason D.; Stephenson, Mark; Finomore, Victor S.; Hagen, Joshua A. – Measurement in Physical Education and Exercise Science, 2023

Heart rate samples (n = 4500-8000) from wearables were compared to electrocardiography during a steady-state ruck (Ruck-S), maximal effort ruck (Ruck-M), submaximal cycle (Cycle), and Tabata Circuit. One device was worn at each location (wrist: Polar Grit-X, Garmin Fenix 6; chest-straps: Polar H10, Garmin HRM-Pro; armband: Polar Verity).…

Descriptors: Measurement Equipment, Exercise Physiology, Training, Metabolism

Anchoring Validity Evidence for Automated Essay Scoring

Peer reviewed

Direct link

Shermis, Mark D. – Journal of Educational Measurement, 2022

One of the challenges of discussing validity arguments for machine scoring of essays centers on the absence of a commonly held definition and theory of good writing. At best, the algorithms attempt to measure select attributes of writing and calibrate them against human ratings with the goal of accurate prediction of scores for new essays.…

Descriptors: Scoring, Essays, Validity, Writing Evaluation

Technological Affordances and Applications of Chatbots for Conversational Skill Interventions in Autism: A Scoping Review

Peer reviewed

Direct link

Peidi Gu; Fang Xu; Lingwei Chen; Zijie Ma; Madian Zhang; Yi Zhang – Education and Information Technologies, 2025

Conversational skills, which are essential for effective social interactions and typically pose difficulties for individuals with autism spectrum disorder (ASD), include abilities such as initiating topics, engaging in back-and-forth dialog, and responding to conversational cues. Chatbots have been used in mental health fields, and the development…

Descriptors: Technology Uses in Education, Artificial Intelligence, Interpersonal Communication, Communication Skills

A Conceptual Approach to Validating Competence Frameworks

Download full text

Child, Simon; Shaw, Stuart – Research Matters, 2023

This article provides a conceptual framework for considering both the theoretical and methodological factors that underpin the successful validation of a competency framework. Drawing on educational assessment literature, this article argues that a valid competency framework relates to an interpretive judgement of the credibility of the claims…

Descriptors: Competence, Validity, Accuracy, Models

Video-Based Facial Movement Analysis in the Assessment of Bulbar Amyotrophic Lateral Sclerosis: Clinical Validation

Peer reviewed

Direct link

Guarin, Diego L.; Taati, Babak; Abrahao, Agessandro; Zinman, Lorne; Yunusova, Yana – Journal of Speech, Language, and Hearing Research, 2022

Purpose: Facial movement analysis during facial gestures and speech provides clinically useful information for assessing bulbar amyotrophic lateral sclerosis (ALS). However, current kinematic methods have limited clinical application due to the equipment costs. Recent advancements in consumer-grade hardware and machine/deep learning made it…

Descriptors: Video Technology, Nonverbal Communication, Diseases, Neurological Impairments

Iterative Item Selection of Neighborhood Clusters: A Nonparametric and Non-IRT Method for Generating Miniature Computer Adaptive Questionnaires

Peer reviewed

Direct link

Yongze Xu – Educational and Psychological Measurement, 2024

The questionnaire method has always been an important research method in psychology. The increasing prevalence of multidimensional trait measures in psychological research has led researchers to use longer questionnaires. However, questionnaires that are too long will inevitably reduce the quality of the completed questionnaires and the efficiency…

Descriptors: Item Response Theory, Questionnaires, Generalization, Simulation

Foundational Competencies in Educational Measurement

Peer reviewed

Direct link

Terry A. Ackerman; Deborah L. Bandalos; Derek C. Briggs; Howard T. Everson; Andrew D. Ho; Susan M. Lottridge; Matthew J. Madison; Sandip Sinharay; Michael C. Rodriguez; Michael Russell; Alina A. Davier; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2024

This article presents the consensus of an National Council on Measurement in Education Presidential Task Force on Foundational Competencies in Educational Measurement. Foundational competencies are those that support future development of additional professional and disciplinary competencies. The authors develop a framework for foundational…

Descriptors: Educational Assessment, Competence, Skill Development, Communication Skills

Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach

Peer reviewed

Direct link

Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024

To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…

Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making

Engagement as an Alternative to Noncompliance Measurement: Promoting Validity, Accuracy, and Student Outcomes

Peer reviewed

Direct link

Elisabeth J. Malone; Jennifer A. Kurth; Kathleen N. Zimmerman – Beyond Behavior, 2024

While noncompliance is a concerning challenging behavior and commonly reported by educators, its measurement is likely to be invalid and inaccurate given the subjectivity of the operational definition. Engagement is offered as a more valid, accurate measurement that may provide data regarding the amount of instruction accessed by the student. In…

Descriptors: Student Behavior, Behavior Problems, Resistance (Psychology), Learner Engagement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 15

ProQuest LLC	13
Educational and Psychological…	6
Journal of Speech, Language,…	6
Measurement in Physical…	6
International Educational…	5
Grantee Submission	4
International Education…	4
Journal of Educational…	4
Advances in Health Sciences…	3
Assessment for Effective…	3
Education and Information…	3
Educational Assessment	3
Educational Measurement:…	3
Journal of Autism and…	3
Journal of Experimental…	3
School Psychology Quarterly	3
Applied Measurement in…	2
Assessment & Evaluation in…	2
Assessment and Accountability…	2
Bill & Melinda Gates…	2
Cognitive Research:…	2
Cognitive Science	2
Developmental Psychology	2
Educational Policy	2
International Association for…	2
More ▼

Dobbins, Ian G.	3
Matta, Michael	3
Amy Briesch	2
Bell, Courtney A.	2
Bloom, Howard	2
Brittany Melo	2
Burke, Mack D.	2
Candace Walkington	2
Cook, Thomas D.	2
Douglas, Karen H.	2
Goldschmidt, Pete	2
Heritage, Margaret	2
Herman, Joan L.	2
Jacob, Robin	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Jones, Nathan D.	2
Keller-Margulis, Milena A.	2
Kelsey Schenck	2
Kilgus, Stephen P.	2
Krosnick, Jon A.	2
Lee, Yuan-Hsuan	2
Mercer, Sterett H.	2
Min Wang	2
Mitchell J. Nathan	2
More ▼

Beck Depression Inventory	2
Dynamic Indicators of Basic…	2
Strengths and Difficulties…	2
Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
Beck Anxiety Inventory	1
Center for Epidemiologic…	1
Child Behavior Checklist	1
Clinical Evaluation of…	1
Early Childhood Environment…	1
Flesch Kincaid Grade Level…	1
Graduate Record Examinations	1
Kaufman Brief Intelligence…	1
MacArthur Communicative…	1
National Longitudinal Survey…	1
National Longitudinal…	1
Preschool Language Scale	1
Preschool and Kindergarten…	1
Purdue Spatial Visualization…	1
SAT (College Admission Test)	1
State of Texas Assessments of…	1
Wechsler Individual…	1
Woodcock Johnson Tests of…	1
More ▼