NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 76 to 90 of 26,686 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sas, Marlies; Snaphaan, Thom; Pauwels, Lieven J. R.; Ponnet, Koen; Hardyns, Wim – Field Methods, 2023
This study focuses on the use of systematic social observations (SSO) to measure crime prevention through environmental design (CPTED) and disorder. To improve knowledge about measurement issues in small area research, SSO is conducted by means of three different methods: in-situ, photographs, and Google Street View (GSV) imagery. By evaluating…
Descriptors: Crime Prevention, Measurement Techniques, Photography, Observation
Peer reviewed Peer reviewed
Direct linkDirect link
Zhou, Shuqi; Merzdorf, Hillary E.; Douglas, Kerrie A.; Moore, Tamara J. – Journal of Pre-College Engineering Education Research, 2023
This study aimed to develop a K-12 classroom observation protocol to assess K-12 teachers' implementation of science, technology, engineering, and mathematics (STEM) integration. The intended purpose of the observation protocol is for researchers to examine how K-12 teachers implement the STEM integrated curriculum. Based on research on STEM…
Descriptors: Test Construction, Test Validity, STEM Education, Classroom Observation Techniques
McCluskey, Sydne – ProQuest LLC, 2023
Rater comparison analysis is commonly necessary in the social sciences. Conventional approaches to the problem generally focus on calculation of agreement statistics, which provide useful but incomplete information about rater agreement. Importantly, one-number agreement statistics give no indication regarding the nature of disagreements, nor do…
Descriptors: Bayesian Statistics, Structural Equation Models, Interrater Reliability, Beliefs
Peer reviewed Peer reviewed
Direct linkDirect link
Luu, Kimberly; Sidhu, Ravi; Chadha, Neil K.; Eva, Kevin W. – Advances in Health Sciences Education, 2023
Clinical supervisors are known to assess trainee performance idiosyncratically, causing concern about the validity of their ratings. The literature on this issue relies heavily on retrospective collection of decisions, resulting in the risk of inaccurate information regarding what actually drives raters' perceptions. Capturing in-the-moment…
Descriptors: Clinical Experience, Practicum Supervision, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Egmose, Ida; Skou, Mia; Madsen, Eva Back; Stuart, Anne Christine; Krogh, Marianne Thode; Haase, Tina Wahl; Vaever, Mette Skovgaard – European Journal of Developmental Psychology, 2023
Mind-mindedness (MM) refers to the parent's ability to treat the child as an individual with a mind of his or her own. Studies have found representational and interactional MM to predict child development, but more research is needed on the validity of representational MM in parents of infants. Therefore, we examine the reliability and validity of…
Descriptors: Individualism, Mothers, Infants, Foreign Countries
Feldberg, Zachary R. – ProQuest LLC, 2023
Cognitive diagnostic models (CDMs) provide pedagogically relevant information in the form of a student profile of multiple binary categorizations of students into mastery or nonmastery statuses on latent traits called attributes. Federal educational accountability requires accountability measures to designate students into one of at least three…
Descriptors: Accountability, Standards, Cutting Scores, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023
In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…
Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training
Peer reviewed Peer reviewed
Direct linkDirect link
Pereira, Valerie J.; Tuomainen, Jyrki; Lee, Kathy Y. S.; Tong, Michael C. F.; Sell, Debbie A. – International Journal of Language & Communication Disorders, 2021
Background: The status of the velopharyngeal mechanism can be inferred from perceptual ratings of specified speech parameters. Several studies have proposed the measure of an overall velopharyngeal composite score based on these perceptual ratings and have reported good validity. The Cleft Audit Protocol for Speech--Augmented (CAPS-A) is a…
Descriptors: Congenital Impairments, Speech Tests, Outcome Measures, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024
Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…
Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Erin Johnson; Samantha Barstack; Yikai Xu; Hannah Wise; Bradley T. Erford; Catharina Chang; David Delmonico – Measurement and Evaluation in Counseling and Development, 2025
Problem Statement: Among individuals aged 12 years or older, 14.3% (40.0 million) reporting the use of an illicit drug in the previous year. Given the prevalence of drug abuse, it is increasingly important to determine effective screening practices, treatment procedures, and best practices among various subpopulations to identify drug use-related…
Descriptors: Drug Abuse, Screening Tests, Psychometrics, Synthesis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mirjam Sophia Glessmer; Rachel Forsyth – Teaching & Learning Inquiry, 2025
Generative AI tools (GenAI) are increasingly used for academic tasks, including qualitative data analysis for the Scholarship of Teaching and Learning (SoTL). In our practice as academic developers, we are frequently asked for advice on whether this use for GenAI is reliable, valid, and ethical. Since this is a new field, we have not been able to…
Descriptors: Artificial Intelligence, Research Methodology, Data Analysis, Scholarship
Peer reviewed Peer reviewed
Direct linkDirect link
Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025
Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…
Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025
Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…
Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability
Peer reviewed Peer reviewed
Direct linkDirect link
Alberto Gandolfi – International Journal of Artificial Intelligence in Education, 2025
In this paper, we initially investigate the capabilities of GPT-3 5 and GPT-4 in solving college-level calculus problems, an essential segment of mathematics that remains under-explored so far. Although improving upon earlier versions, GPT-4 attains approximately 65% accuracy for standard problems and decreases to 20% for competition-like…
Descriptors: Artificial Intelligence, Reliability, Problem Solving, Mathematics Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025
In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…
Descriptors: Test Items, Factor Structure, Test Reliability, Semantics
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  1780