ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	39
Since 2006 (last 20 years)	75

Descriptor

Accuracy	75
Reliability	75
Validity	64
Foreign Countries	21
Scores	18
College Students	16
Measures (Individuals)	14
Academic Achievement	13
Psychometrics	13
Factor Analysis	11
Statistical Analysis	11
Evaluation Methods	10
Comparative Analysis	9
Correlation	9
Student Evaluation	9
Questionnaires	8
Teacher Evaluation	8
Classification	6
Construct Validity	6
Decision Making	6
Feedback (Response)	6
Identification	6
Models	6
Predictive Validity	6
Program Evaluation	6
More ▼

Publication Type

Journal Articles	59
Reports - Research	53
Reports - Evaluative	9
Reports - Descriptive	8
Tests/Questionnaires	7
Information Analyses	4
Dissertations/Theses -…	3
Speeches/Meeting Papers	3
Guides - Non-Classroom	2
Books	1
Non-Print Media	1
Numerical/Quantitative Data	1
Opinion Papers	1
More ▼

Education Level

Higher Education	27
Postsecondary Education	25
Secondary Education	11
Elementary Education	8
Elementary Secondary Education	8
High Schools	7
Junior High Schools	5
Middle Schools	5
Early Childhood Education	4
Grade 8	4
Primary Education	3
Grade 12	2
Grade 7	2
Grade 1	1
Grade 10	1
Grade 11	1
Grade 6	1
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Policymakers	2
Practitioners	2
Researchers	1

Location

Australia	2
Germany	2
Iran	2
Pennsylvania	2
Rhode Island	2
Tennessee	2
Turkey	2
Canada	1
Connecticut	1
Delaware	1
District of Columbia	1
Ecuador	1
Finland	1
Florida	1
Indiana	1
Indonesia	1
Japan	1
Kenya	1
Lesotho	1
Massachusetts	1
Massachusetts (Boston)	1
New Jersey	1
New York	1
Oman	1
Philippines	1
More ▼

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Beck Depression Inventory	2
Test of English as a Foreign…	2
ACT Assessment	1
Beck Anxiety Inventory	1
Center for Epidemiologic…	1
Child Behavior Checklist	1
Dynamic Indicators of Basic…	1
Flesch Kincaid Grade Level…	1
Graduate Record Examinations	1
National Longitudinal…	1
SAT (College Admission Test)	1
Strengths and Difficulties…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 75 results Save | Export

Is ChatGPT Reliable in Education?

Peer reviewed
PDF on ERIC

Download full text

Amal Abdullah Alibrahim – South African Journal of Education, 2024

After ChatGPT was released late in 2022, many arguments about its accuracy and use in education arose. In this article, I seek to provide evidence of the accuracy and validity of ChatGPT's responses to users' queries in education by applying a systematic review methodology to analyse publications in specific databases following PRISMA guidelines…

Descriptors: Artificial Intelligence, Technology Uses in Education, Reliability, Natural Language Processing

Peer Overmarking and Insufficient Diagnosticity: The Impact of the Rating Method for Peer Assessment

Peer reviewed

Direct link

Van Meenen, Florence; Coertjens, Liesje; Van Nes, Marie-Claire; Verschuren, Franck – Advances in Health Sciences Education, 2022

The present study explores two rating methods for peer assessment (analytical rating using criteria and comparative judgement) in light of concurrent validity, reliability and insufficient diagnosticity (i.e. the degree to which substandard work is recognised by the peer raters). During a second-year undergraduate course, students wrote a one-page…

Descriptors: Evaluation Methods, Peer Evaluation, Accuracy, Evaluation Criteria

The Scale of Sincerity Based on Kyai Haji Ahmad Dahlan's Version for Islamic Students: The Rasch Analysis

Peer reviewed
PDF on ERIC

Download full text

Wahyu Nanda Eka Saputra; Trikinasih Handayani; Prima Suci Rohmadheny; Rohmatus Naini; Dody Hartanto; Hardi Santosa; Dewi Afra Khairunnisa; Risma Risansyah; Hanan Riati; Faturrahman – Journal of Education and Learning (EduLearn), 2025

The students are urged to do something without expecting anything in return and only in the name of God. Every islamic student becomes something ideal if they can internalize and implement sincerity. Many people are willing to do something because of an ulterior motive. The importance of sincerity in humans is the background for developing a…

Descriptors: Islam, Interrater Reliability, Prosocial Behavior, Muslims

Ecological Validity of Don't Remember and Don't Know for Distinguishing Accessibility- Versus Availability-Based Retrieval Failures in Older and Younger Adults: Knowledge for News Events

Peer reviewed

Direct link

Umanath, Sharda; Coane, Jennifer H.; Huff, Mark J.; Cimenian, Tamar; Chang, Kai – Cognitive Research: Principles and Implications, 2023

With pursuit of incremental progress and generalizability of findings in mind, we examined a possible boundary for older and younger adults' metacognitive distinction between what is not stored in memory versus merely inaccessible with materials that are not process pure to knowledge or events: information regarding news events. Participants were…

Descriptors: Older Adults, Young Adults, Recall (Psychology), Memory

Validity and Judgment Bias in Visual Analysis of Single-Case Data

Peer reviewed

Direct link

Wilbert, Jürgen; Bosch, Jannis; Lüke, Timo – International Journal for Research in Learning Disabilities, 2021

Analysis of data from single-case intervention studies commonly involves visual analysis. Previous research indicates that visual analysis may suffer from low reliability and unpromising error rates. We investigated the reliability and validity of visual analysis and explored to what extent data trends affect judgments. We administered a…

Descriptors: Data Analysis, Reliability, Validity, Visual Aids

Design Thinking Scale Development: Assessing Reliability and Validity

Peer reviewed
PDF on ERIC

Download full text

Atilla Ergin; Yelkin Diker Coskun – International Journal on Social and Education Sciences, 2024

This study aims to develop a scale to measure the design thinking process and to evaluate the reliability and validity of this scale. It fills this gap by introducing a 36-item scale specifically designed to measure design thinking abilities across the five key stages of the design thinking process: empathize, define, ideate, prototype, and test,…

Descriptors: Design, Thinking Skills, Likert Scales, Empathy

Swahili Version of the Dimensions of Mastery Questionnaire: Adaptation and Psychometric Properties

Peer reviewed

Direct link

Amukune, Stephen; Calchei, Marcela; Józsa, Krisztián – Electronic Journal of Research in Educational Psychology, 2021

Introduction: The current focus of empirical studies demonstrates the significance of mastery motivation in child development, academic achievement and school success. Consequently, it is critical to have reliable and valid tools to measure this important variable accurately. The Preschool version of the Dimensions of Mastery Questionnaire (DMQ)…

Descriptors: Mastery Learning, Child Development, Academic Achievement, Accuracy

An Information-Based Approach to Identifying Rapid-Guessing Thresholds

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2019

The identification of rapid guessing is important to promote the validity of achievement test scores, particularly with low-stakes tests. Effective methods for identifying rapid guesses require reliable threshold methods that are also aligned with test taker behavior. Although several common threshold methods are based on rapid guessing response…

Descriptors: Guessing (Tests), Identification, Reaction Time, Reliability

Using Classroom Observations in the Evaluation of Special Education Teachers

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jones, Nathan D.; Bell, Courtney A.; Brownell, Mary; Qi, Yi; Peyton, David; Pua, Daisy; Fowler, Melissa; Holtzman, Steven – Educational Evaluation and Policy Analysis, 2022

We examine whether one of the most popular observation systems in teacher evaluation--the Framework for Teaching (FFT)--captures the range of instructional skills teachers need to be effective. We focus on the case of special educators, who are likely to use instructional approaches that, although supported by research, are de-emphasized in common…

Descriptors: Classroom Observation Techniques, Teacher Evaluation, Special Education Teachers, Teaching Skills

Changes in the Speed-Ability Relation through Different Treatments of Rapid Guessing

Peer reviewed

Direct link

Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023

As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…

Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias

Generative Artificial Intelligence and Assessment Task Design: Getting Back to Basics through the Lens of the AARDVARC Model

Peer reviewed

Direct link

Elaine Chapman; Jian Zhao; Peyman G. P. Sabet – Education Research and Perspectives, 2024

Effective assessments guide student learning, refine teaching practices, ensure curriculum alignment, and foster workforce readiness. However, the emergence of generative artificial intelligence (GenAI) tools, such as ChatGPT, has significantly disrupted traditional assessment processes, raising concerns about academic integrity and necessitating…

Descriptors: Artificial Intelligence, Evaluation Methods, Influence of Technology, Integrity

Peer Assessment Using Soft Computing Techniques

Peer reviewed

Direct link

Pinargote-Ortega, Maricela; Bowen-Mendoza, Lorena; Meza, Jaime; Ventura, Sebastián – Journal of Computing in Higher Education, 2021

In this paper, we applied a peer assessment scenario at the Technical University of Manabí (Ecuador). Students and professors evaluated some works through rubrics, assigned a numerical score, and provided textual feedback grounding why such a numerical score was determined, to detect inaccuracy between both assessments. The proposed model uses…

Descriptors: Foreign Countries, College Students, Peer Evaluation, Scoring Rubrics

Steering Clear from 'Lost in Translation': Cross-Cultural Translation, Adaptation, and Validation of Critical Thinking Mindset Self-Rating Form to University Students

Peer reviewed

Direct link

Franco, Amanda R.; Vieira, Rui Marques; Riegel, Fernando; Crossetti, Maria da Graça Oliveira – Studies in Higher Education, 2021

Critical Thinking is a transversal skill needed to face current and future challenges, longed for in school, work, and life. Paradoxically, such relevance does not always translate into tangible efforts to measure and promote it. We present the cross-cultural translation, adaptation, and validation process of Critical Thinking Mindset Self-Rating…

Descriptors: Critical Thinking, Translation, Measures (Individuals), Portuguese

Effects of In-Service and Coaching to Increase Teachers' Use of Research-Based Strategies in Beginning Reading

Peer reviewed

Direct link

Goodnight, Crystalyn I.; Wood, Charles L.; Thompson, Julie L. – Preventing School Failure, 2020

In-service and coaching can increase teachers' use of research-based practices. This study examined the effects of in-service training plus coaching that included preconference, side-by-side coaching, and feedback on kindergarten teachers' use of research-based strategies during beginning reading instruction. Teachers were trained to enhance…

Descriptors: Reading Strategies, Reading Instruction, Coaching (Performance), Evidence Based Practice

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

ProQuest LLC	3
Applied Measurement in…	2
Assessment and Accountability…	2
Bill & Melinda Gates…	2
Educational and Psychological…	2
Journal of Psychoeducational…	2
Measurement and Evaluation in…	2
Preventing School Failure	2
Advances in Health Sciences…	1
Afterschool Matters	1
Annals of Dyslexia	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
Assessment in Education:…	1
Child & Youth Care Forum	1
Cognitive Research:…	1
ConnCAN	1
Discourse Processes: A…	1
ETS Research Report Series	1
Education Policy Analysis…	1
Education Research and…	1
Education and Information…	1
Education and the Public…	1
Educational Evaluation and…	1
Electronic Journal of…	1
More ▼

Goldschmidt, Pete	2
Heritage, Margaret	2
Herman, Joan L.	2
Adjei, Seth	1
Alsinani, Maryam	1
Amal Abdullah Alibrahim	1
Amrein-Beardsley, Audrey	1
Amukune, Stephen	1
Andrade, Heidi L.	1
Andre, Sherry	1
Angus, Megan Hague	1
Apple, Kristen	1
Atilla Ergin	1
Babcock, Ben	1
Baratchian, Taher	1
Bardoshi, Gerta	1
Barth, Amy E.	1
Bell, Courtney A.	1
Bell, Priscilla	1
Bosch, Jannis	1
Bowen-Mendoza, Lorena	1
Bramley, Tom	1
Brown, Gavin T. L.	1
Brownell, Mary	1
Calchei, Marcela	1
More ▼