NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 256 to 270 of 27,052 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rajeshwari Panigrahi; Khaliq Lubza Nihar; Neha Singh – Higher Learning Research Communications, 2024
Objective: This study aimed to develop and test a scale for measuring the quality of blended learning models in higher education. Methods: This research adopts a sequential mixed-method approach to construct a new measurement scale. The first phase consisted of the inductive approach to identify the items, followed by exploratory factor analysis.…
Descriptors: Blended Learning, Educational Quality, Higher Education, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Benjamin R. Shear; Derek C. Briggs – Asia Pacific Education Review, 2024
Research in the social and behavioral sciences relies on a wide range of experimental and quasi-experimental designs to estimate the causal effects of specific programs, policies, and events. In this paper we highlight measurement issues relevant to evaluating the validity of causal estimation and generalization. These issues impact all four…
Descriptors: Measurement Techniques, Inferences, COVID-19, Pandemics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Qinjin Jia; Jialin Cui; Ruijie Xi; Chengyuan Liu; Parvez Rashid; Ruochi Li; Edward Gehringer – International Educational Data Mining Society, 2024
Feedback on student assignments plays a crucial role in steering students toward academic success. To provide feedback more promptly and efficiently, researchers are actively exploring the use of large language models (LLMs) to automatically generate feedback on student artifacts. Although the generated feedback is highly fluent, coherent, and…
Descriptors: Feedback (Response), Assignments, Artificial Intelligence, Accuracy
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Jones, Nathan; Bell, Courtney; Qi, Yi; Lewis, Jennifer; Kirui, David; Stickler, Leslie; Redash, Amanda – ETS Research Report Series, 2021
The observation systems being used in all 50 states require administrators to learn to accurately and reliably score their teachers' instruction using standardized observation systems. Although the literature on observation systems is growing, relatively few studies have examined the outcomes of trainings focused on developing administrators'…
Descriptors: Observation, Standardized Tests, Teacher Evaluation, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023
As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…
Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Zigler, Christina K.; Lin, Li; McFatrich, Molly; Lucas, Nicole; Gordon, Kelly L.; Jones, Harrison N.; Berent, Allyson; Panagoulias, Jennifer; Evans, Paula; Reeve, Bryce B. – American Journal on Intellectual and Developmental Disabilities, 2023
There is a critical need for high-quality clinical outcome assessments to capture the important aspects of communication ability of individuals with Angelman syndrome (AS). To center the perspective of caregivers, our team developed the novel Observer-Reported Communication Ability (ORCA) measure using best practice guidelines, with the goal of…
Descriptors: Genetic Disorders, Test Validity, Observation, Communication Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Ates, Esin; Konal Korkmaz, Ebru; Temel, Ayla Baylk – Journal of School Health, 2023
Background: Appropriate diagnosis of sleep problems is crucial, given the importance of sleep in childhood development. The Sleep Self-Report Scale (SSRS) is used to assess children's sleep problems in the United States and Spain, and this study aimed to expand the usability of this instrument by evaluating its validity and reliability in Turkish…
Descriptors: Foreign Countries, Sleep, Child Health, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Weingarden, Merav; Heyd-Metzuyanim, Einat – Journal of Mathematics Teacher Education, 2023
In this study, we examine "what went wrong" in our professional development program for encouraging cognitively demanding instruction, focusing on the difficulties we encountered in using an observational tool for evaluating this type of instruction and reaching inter-rater reliability. We do so through the lens of a discursive theory of…
Descriptors: Mathematics Instruction, Interrater Reliability, Cognitive Processes, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Cheng, Yao-Chung – Asia-Pacific Education Researcher, 2023
A principals' school management imaginative capability is the cornerstone of visionary leadership. Based on the imagination theory, this study constructed a Principal's School Management Imaginative Capability Scale (PSMICS). Questionnaires were conducted through stratified random sampling. Thirteen hundred and two valid samples were obtained.…
Descriptors: Principals, Leadership, Imagination, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Chamba-Eras, Luis; Arruarte, Ana; Elorriaga, Jon A. – IEEE Transactions on Learning Technologies, 2023
In the context of virtual learning communities (VLCs), where the participants may not know each other, it is necessary to have a mechanism to help when deciding who to work with and what reliable contents and information sources are. This study aims to design a generic trust model, named T-VLC, applicable to VLCs, which can be adapted to different…
Descriptors: Communities of Practice, Electronic Learning, Trust (Psychology), Models
Peer reviewed Peer reviewed
Direct linkDirect link
Kunar, Melina A.; Watson, Derrick G. – Cognitive Research: Principles and Implications, 2023
Computer-Aided Detection (CAD) has been proposed to help operators search for cancers in mammograms. Previous studies have found that although accurate CAD leads to an improvement in cancer detection, inaccurate CAD leads to an increase in both missed cancers and false alarms. This is known as the over-reliance effect. We investigated whether…
Descriptors: Assistive Technology, Computer Use, Clinical Diagnosis, Screening Tests
Bryce D. McLeod; Nicole Porter; Aaron Hogue; Emily M. Becker-Haimes; Amanda Jensen-Doss – Grantee Submission, 2023
Objective: The precise measurement of treatment fidelity (quantity and quality in the delivery of treatment strategies in an intervention) is essential for intervention development, evaluation, and implementation. Various informants are used in fidelity assessment (e.g., observers, practitioners [clinicians, teachers], clients), but these…
Descriptors: Measurement, Fidelity, Educational Research, Evidence Based Practice
Peer reviewed Peer reviewed
Direct linkDirect link
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Peer reviewed Peer reviewed
Direct linkDirect link
Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022
The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…
Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Pages: 1  |  ...  |  14  |  15  |  16  |  17  |  18  |  19  |  20  |  21  |  22  |  ...  |  1804