NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Parents1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 27 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Carly Oddleifson; Stephen Kilgus; David A. Klingbeil; Alexander D. Latham; Jessica S. Kim; Ishan N. Vengurlekar – Grantee Submission, 2025
The purpose of this study was to conduct a conceptual replication of Pendergast et al.'s (2018) study that examined the diagnostic accuracy of a nomogram procedure, also known as a naive Bayesian approach. The specific naive Bayesian approach combined academic and social-emotional and behavioral (SEB) screening data to predict student performance…
Descriptors: Bayesian Statistics, Accuracy, Social Emotional Learning, Diagnostic Tests
Aytürk, Ezgi; Cham, Heining; Jennings, Patricia A.; Brown, Joshua L. – Educational and Psychological Measurement, 2020
Methods to handle ordered-categorical indicators in latent variable interactions have been developed, yet they have not been widely applied. This article compares the performance of two popular latent variable interaction modeling approaches in handling ordered-categorical indicators: unconstrained product indicator (UPI) and latent moderated…
Descriptors: Evaluation Methods, Grade 3, Grade 4, Grade 5
Daniel Rodriguez-Segura; Beth E. Schueler – Annenberg Institute for School Reform at Brown University, 2022
School closures induced by COVID-19 placed heightened emphasis on alternative ways to measure student learning besides in-person exams. We leverage the administration of phone-based assessments (PBAs) measuring numeracy and literacy for primary school children in Kenya, along with in-person standardized tests administered to the same students…
Descriptors: Foreign Countries, School Closing, COVID-19, Pandemics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2018
We compared students' performance on a paper-based test (PBT) and three computer-based tests (CBTs). The three computer-based tests used different test navigation and answer selection features, allowing us to examine how these features affect student performance. The study sample consisted of 9,698 fourth through twelfth grade students from across…
Descriptors: Evaluation Methods, Tests, Computer Assisted Testing, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yoo, Hanwook; Wolf, Mikyung Kim; Ballard, Laura D. – Practical Assessment, Research & Evaluation, 2023
As the theme of the 2022 annual meeting of the American Education Research Association, cultivating equitable education systems has gained renewed attention amid an increasingly diverse society. However, systemic inequalities persist for traditionally underserved student populations. As a way to better address diverse students' needs, it is of…
Descriptors: Comparative Analysis, Native Language, English Language Learners, Multilingualism
Peer reviewed Peer reviewed
Direct linkDirect link
Westine, Carl D. – American Journal of Evaluation, 2016
Little is known empirically about intraclass correlations (ICCs) for multisite cluster randomized trial (MSCRT) designs, particularly in science education. In this study, ICCs suitable for science achievement studies using a three-level (students in schools in districts) MSCRT design that block on district are estimated and examined. Estimates of…
Descriptors: Efficiency, Evaluation Methods, Science Achievement, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Anderson, Daniel; Farley, Dan; Tindal, Gerald – Journal of Special Education, 2015
Students with significant cognitive disabilities present an assessment dilemma that centers on access and validity in large-scale testing programs. Typically, access is improved by eliminating construct-irrelevant barriers, while validity is improved, in part, through test standardization. In this article, one state's alternate assessment data…
Descriptors: Mental Retardation, Evaluation Methods, Student Evaluation, Standardized Tests
Smith, Leigh – ProQuest LLC, 2015
This applied dissertation was designed to provide perceptual teacher data as well as summative testing data to educational leaders concerning the effects of implementing Investigations in Number, Data, and Space® (Investigations) in three Title I elementary school settings, two Title I schools, and one non-Title I school. Data collected during…
Descriptors: Program Evaluation, Program Implementation, Investigations, Elementary School Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Ready, Douglas David – Educational Policy, 2013
Accountability systems that measure student learning rather than student achievement have the potential to more accurately evaluate school quality. However, one methodological concern has remained surprisingly absent from discussions of value-added modeling. Standardized assessments that exhibit either positive or negative correlations between…
Descriptors: Academic Achievement, School Effectiveness, Accountability, Achievement Gains
Peer reviewed Peer reviewed
Direct linkDirect link
Garrett, Rachel; Steinberg, Matthew P. – Educational Evaluation and Policy Analysis, 2015
Despite policy efforts to encourage multiple measures of performance in newly developing teacher evaluation systems, practical constraints often result in evaluations based predominantly on formal classroom observations. Yet there is limited knowledge of how these observational measures relate to student achievement. This article leverages the…
Descriptors: Teacher Effectiveness, Classroom Observation Techniques, Evidence, Teacher Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Briggs, Derek C.; Domingue, Ben – Journal of Educational and Behavioral Statistics, 2013
It is often assumed that a vertical scale is necessary when value-added models depend upon the gain scores of students across two or more points in time. This article examines the conditions under which the scale transformations associated with the vertical scaling process would be expected to have a significant impact on normative interpretations…
Descriptors: Evaluation Methods, Scaling, Scores, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Frost, Jørgen; Ottem, Ernst; Snow, Catherine E.; Hagtvet, Bente E.; Lyster, Solveig Alma Helaas; White, Claire – Scandinavian Journal of Educational Research, 2014
Two ways of measuring change are presented and compared: A conventional "change score", defined as the difference between scores before and after an interim period, and a process-oriented approach focusing on detailed analysis of conceptually defined response patterns. The validity of the two approaches was investigated. Vocabulary…
Descriptors: Vocabulary, Scores, Knowledge Level, Vocabulary Development
Peer reviewed Peer reviewed
Direct linkDirect link
Ruiz-Primo, Maria Araceli; Li, Min; Wills, Kellie; Giamellaro, Michael; Lan, Ming-Chih; Mason, Hillary; Sands, Deanna – Journal of Research in Science Teaching, 2012
The purpose of this article is to address a major gap in the instructional sensitivity literature on how to develop instructionally sensitive assessments. We propose an approach to developing and evaluating instructionally sensitive assessments in science and test this approach with one elementary life-science module. The assessment we developed…
Descriptors: Effect Size, Inferences, Student Centered Curriculum, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kersting, Nicole B.; Chen, Mei-kuang; Stigler, James W. – Education Policy Analysis Archives, 2013
If teacher value-added estimates (VAEs) are to be used as indicators of individual teacher performance in teacher evaluation and accountability systems, it is important to understand how much VAEs are affected by the data and model specifications used to estimate them. In this study we explored the effects of three conditions on the stability of…
Descriptors: Teacher Effectiveness, Teacher Competencies, Accountability, Teacher Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Flowers, Claudia; Kim, Do-Hong; Lewis, Preston; Davis, Violeta Carmen – Journal of Special Education Technology, 2011
This study examined the academic performance and preference of students with disabilities for two types of test administration conditions, computer-based testing (CBT) and pencil-and-paper testing (PPT). Data from a large-scale assessment program were used to examine differences between CBT and PPT academic performance for third to eleventh grade…
Descriptors: Testing, Test Items, Effect Size, Computer Assisted Testing
Previous Page | Next Page »
Pages: 1  |  2