Publication Date
In 2025 | 1 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 14 |
Since 2016 (last 10 years) | 39 |
Since 2006 (last 20 years) | 75 |
Descriptor
Accuracy | 75 |
Reliability | 75 |
Validity | 64 |
Foreign Countries | 21 |
Scores | 18 |
College Students | 16 |
Measures (Individuals) | 14 |
Academic Achievement | 13 |
Psychometrics | 13 |
Factor Analysis | 11 |
Statistical Analysis | 11 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Policymakers | 2 |
Practitioners | 2 |
Researchers | 1 |
Location
Australia | 2 |
Germany | 2 |
Iran | 2 |
Pennsylvania | 2 |
Rhode Island | 2 |
Tennessee | 2 |
Turkey | 2 |
Canada | 1 |
Connecticut | 1 |
Delaware | 1 |
District of Columbia | 1 |
More ▼ |
Laws, Policies, & Programs
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Amal Abdullah Alibrahim – South African Journal of Education, 2024
After ChatGPT was released late in 2022, many arguments about its accuracy and use in education arose. In this article, I seek to provide evidence of the accuracy and validity of ChatGPT's responses to users' queries in education by applying a systematic review methodology to analyse publications in specific databases following PRISMA guidelines…
Descriptors: Artificial Intelligence, Technology Uses in Education, Reliability, Natural Language Processing
Peer Overmarking and Insufficient Diagnosticity: The Impact of the Rating Method for Peer Assessment
Van Meenen, Florence; Coertjens, Liesje; Van Nes, Marie-Claire; Verschuren, Franck – Advances in Health Sciences Education, 2022
The present study explores two rating methods for peer assessment (analytical rating using criteria and comparative judgement) in light of concurrent validity, reliability and insufficient diagnosticity (i.e. the degree to which substandard work is recognised by the peer raters). During a second-year undergraduate course, students wrote a one-page…
Descriptors: Evaluation Methods, Peer Evaluation, Accuracy, Evaluation Criteria
Wahyu Nanda Eka Saputra; Trikinasih Handayani; Prima Suci Rohmadheny; Rohmatus Naini; Dody Hartanto; Hardi Santosa; Dewi Afra Khairunnisa; Risma Risansyah; Hanan Riati; Faturrahman – Journal of Education and Learning (EduLearn), 2025
The students are urged to do something without expecting anything in return and only in the name of God. Every islamic student becomes something ideal if they can internalize and implement sincerity. Many people are willing to do something because of an ulterior motive. The importance of sincerity in humans is the background for developing a…
Descriptors: Islam, Interrater Reliability, Prosocial Behavior, Muslims
Umanath, Sharda; Coane, Jennifer H.; Huff, Mark J.; Cimenian, Tamar; Chang, Kai – Cognitive Research: Principles and Implications, 2023
With pursuit of incremental progress and generalizability of findings in mind, we examined a possible boundary for older and younger adults' metacognitive distinction between what is not stored in memory versus merely inaccessible with materials that are not process pure to knowledge or events: information regarding news events. Participants were…
Descriptors: Older Adults, Young Adults, Recall (Psychology), Memory
Wilbert, Jürgen; Bosch, Jannis; Lüke, Timo – International Journal for Research in Learning Disabilities, 2021
Analysis of data from single-case intervention studies commonly involves visual analysis. Previous research indicates that visual analysis may suffer from low reliability and unpromising error rates. We investigated the reliability and validity of visual analysis and explored to what extent data trends affect judgments. We administered a…
Descriptors: Data Analysis, Reliability, Validity, Visual Aids
Atilla Ergin; Yelkin Diker Coskun – International Journal on Social and Education Sciences, 2024
This study aims to develop a scale to measure the design thinking process and to evaluate the reliability and validity of this scale. It fills this gap by introducing a 36-item scale specifically designed to measure design thinking abilities across the five key stages of the design thinking process: empathize, define, ideate, prototype, and test,…
Descriptors: Design, Thinking Skills, Likert Scales, Empathy
Amukune, Stephen; Calchei, Marcela; Józsa, Krisztián – Electronic Journal of Research in Educational Psychology, 2021
Introduction: The current focus of empirical studies demonstrates the significance of mastery motivation in child development, academic achievement and school success. Consequently, it is critical to have reliable and valid tools to measure this important variable accurately. The Preschool version of the Dimensions of Mastery Questionnaire (DMQ)…
Descriptors: Mastery Learning, Child Development, Academic Achievement, Accuracy
Wise, Steven L. – Applied Measurement in Education, 2019
The identification of rapid guessing is important to promote the validity of achievement test scores, particularly with low-stakes tests. Effective methods for identifying rapid guesses require reliable threshold methods that are also aligned with test taker behavior. Although several common threshold methods are based on rapid guessing response…
Descriptors: Guessing (Tests), Identification, Reaction Time, Reliability
Jones, Nathan D.; Bell, Courtney A.; Brownell, Mary; Qi, Yi; Peyton, David; Pua, Daisy; Fowler, Melissa; Holtzman, Steven – Educational Evaluation and Policy Analysis, 2022
We examine whether one of the most popular observation systems in teacher evaluation--the Framework for Teaching (FFT)--captures the range of instructional skills teachers need to be effective. We focus on the case of special educators, who are likely to use instructional approaches that, although supported by research, are de-emphasized in common…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Special Education Teachers, Teaching Skills
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Elaine Chapman; Jian Zhao; Peyman G. P. Sabet – Education Research and Perspectives, 2024
Effective assessments guide student learning, refine teaching practices, ensure curriculum alignment, and foster workforce readiness. However, the emergence of generative artificial intelligence (GenAI) tools, such as ChatGPT, has significantly disrupted traditional assessment processes, raising concerns about academic integrity and necessitating…
Descriptors: Artificial Intelligence, Evaluation Methods, Influence of Technology, Integrity
Pinargote-Ortega, Maricela; Bowen-Mendoza, Lorena; Meza, Jaime; Ventura, Sebastián – Journal of Computing in Higher Education, 2021
In this paper, we applied a peer assessment scenario at the Technical University of Manabí (Ecuador). Students and professors evaluated some works through rubrics, assigned a numerical score, and provided textual feedback grounding why such a numerical score was determined, to detect inaccuracy between both assessments. The proposed model uses…
Descriptors: Foreign Countries, College Students, Peer Evaluation, Scoring Rubrics
Franco, Amanda R.; Vieira, Rui Marques; Riegel, Fernando; Crossetti, Maria da Graça Oliveira – Studies in Higher Education, 2021
Critical Thinking is a transversal skill needed to face current and future challenges, longed for in school, work, and life. Paradoxically, such relevance does not always translate into tangible efforts to measure and promote it. We present the cross-cultural translation, adaptation, and validation process of Critical Thinking Mindset Self-Rating…
Descriptors: Critical Thinking, Translation, Measures (Individuals), Portuguese
Goodnight, Crystalyn I.; Wood, Charles L.; Thompson, Julie L. – Preventing School Failure, 2020
In-service and coaching can increase teachers' use of research-based practices. This study examined the effects of in-service training plus coaching that included preconference, side-by-side coaching, and feedback on kindergarten teachers' use of research-based strategies during beginning reading instruction. Teachers were trained to enhance…
Descriptors: Reading Strategies, Reading Instruction, Coaching (Performance), Evidence Based Practice
Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…
Descriptors: Validity, Discourse Analysis, Databases, Scoring