NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 46 to 60 of 734 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Experimental Education, 2022
In this study, we examined the scoring and generalizability assumptions of an explicit instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…
Descriptors: Direct Instruction, Teacher Education, Classroom Observation Techniques, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020
Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…
Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Carbonneau, Kira J.; Van Orman, Dustin S. J.; Lemberger-Truelove, Matthew E.; Atencio, David J. – Early Education and Development, 2020
Research Findings: Given the variable nature of early childhood settings, practitioners and researchers need better guidance on what conditions influence observations conducted within early childhood settings (National Research Council, 2008). Using 230 observations from 23 three- and four-year-old children, we conducted a Generalizability study…
Descriptors: Classroom Environment, Observation, Preschool Children, Influences
Peer reviewed Peer reviewed
Direct linkDirect link
Weston, Timothy J.; Hayward, Charles N.; Laursen, Sandra L. – American Journal of Evaluation, 2021
Observations are widely used in research and evaluation to characterize teaching and learning activities. Because conducting observations is typically resource intensive, it is important that inferences from observation data are made confidently. While attention focuses on interrater reliability, the reliability of a single-class measure over the…
Descriptors: Generalizability Theory, Observation, Inferences, Social Science Research
Peer reviewed Peer reviewed
Direct linkDirect link
Andrea L. B. Ford; Marianne Elmquist; LeAnne D. Johnson; Jon Tapp – Journal of Speech, Language, and Hearing Research, 2025
Purpose: Estimating the sequential associations between educators' and children's talk during language learning interactions requires careful consideration of factors that may impact measurement stability and resultant inferences. This research note will describe a preliminary study that used generalizability theory to understand the contribution…
Descriptors: Preschool Children, Preschool Curriculum, Preschool Education, Preschool Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Erickson, Ainsley T. – History of Education Quarterly, 2020
Carl Kaestle defines a generalization as "how we know when we know." Kaestle sketches a model of increasing certainty in historical claims as they are developed and refined at increasing scales of research, from local to international. A historical claim might originate in the study of a particular place or case, but to know that the…
Descriptors: Generalization, Generalizability Theory, Historical Interpretation, Archives
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021
This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…
Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability
Zhun Deng – ProQuest LLC, 2021
Machine learning has achieved state-of-the-art performance in many areas, including image recognition and natural language processing. However, there are still many challenges and mysteries attracting numerous researchers. This dissertation comprises a series of works concerning problems at the intersection of computer science theory, adversarial…
Descriptors: Learning Analytics, Instructional Design, Artificial Intelligence, Computer Science
Peer reviewed Peer reviewed
Direct linkDirect link
Leher Singh – Journal of Cognition and Development, 2024
This article serves as an introduction to the Special Issue on "Decolonizing and Diversifying Research in Cognitive Development." The Special Issue comprises six articles: two articles are empirical articles that focus on executive function development in under-represented environments, two articles address barriers pathways toward…
Descriptors: Decolonization, Cognitive Development, Theory Practice Relationship, Research and Development
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Yoon Jeon; Knowles, Mariah A.; Scianna, Jennifer; Lin, Grace; Ruipérez-Valiente, José A. – British Journal of Educational Technology, 2023
Game-based assessment (GBA), a specific application of games for learning, has been recognized as an alternative form of assessment. While there is a substantive body of literature that supports the educational benefits of GBA, limited work investigates the validity and generalizability of such systems. In this paper, we describe applications of…
Descriptors: Learning Analytics, Validity, Generalizability Theory, Game Based Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024
Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…
Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021
Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…
Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Simsek, Ahmet Salih – International Journal of Assessment Tools in Education, 2023
Likert-type item is the most popular response format for collecting data in social, educational, and psychological studies through scales or questionnaires. However, there is no consensus on whether parametric or non-parametric tests should be preferred when analyzing Likert-type data. This study examined the statistical power of parametric and…
Descriptors: Error of Measurement, Likert Scales, Nonparametric Statistics, Statistical Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Jane E. Miller – Numeracy, 2023
Students often believe that statistical significance is the only determinant of whether a quantitative result is "important." In this paper, I review traditional null hypothesis statistical testing to identify what questions inferential statistics can and cannot answer, including statistical significance, effect size and direction,…
Descriptors: Statistical Significance, Holistic Approach, Statistical Inference, Effect Size
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Jones, Eli – Educational Researcher, 2019
Teacher evaluation systems often include classroom observations in which raters use rating scales to evaluate teachers' effectiveness. Recently, researchers have promoted the use of multifaceted approaches to investigating reliability using Generalizability theory, instead of rater reliability statistics. Generalizability theory allows analysts to…
Descriptors: Teacher Evaluation, Observation, Generalizability Theory, Item Response Theory
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  49