NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 55 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Gregory Chernov – Evaluation Review, 2025
Most existing solutions to the current replication crisis in science address only the factors stemming from specific poor research practices. We introduce a novel mechanism that leverages the experts' predictive abilities to analyze the root causes of replication failures. It is backed by the principle that the most accurate predictor is the most…
Descriptors: Replication (Evaluation), Prediction, Scientific Research, Failure
Peer reviewed Peer reviewed
Direct linkDirect link
Jennifer Randall; Mya Poe; Maria Elena Oliveri; David Slomp – Educational Assessment, 2024
Traditional validation approaches fail to account for the ways oppressive systems (e.g. racism, radical nationalism) impact the test design and development process. To disrupt this legacy of white supremacy, we illustrate how justice-oriented, antiracist validation (JAV) framework can be applied to construct articulation and validation, data…
Descriptors: Social Justice, Racism, Educational Assessment, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Curran, Patrick J.; Georgeson, A. R.; Bauer, Daniel J.; Hussong, Andrea M. – International Journal of Behavioral Development, 2021
Conducting valid and reliable empirical research in the prevention sciences is an inherently difficult and challenging task. Chief among these is the need to obtain numerical scores of underlying theoretical constructs for use in subsequent analysis. This challenge is further exacerbated by the increasingly common need to consider multiple…
Descriptors: Psychometrics, Scoring, Prevention, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Maryam Atai-Tabar; Gholamreza Zareian; Seyyed Mohammad Reza Amirian; Seyyed Mohammad Reza Adel – Journal of Applied Research in Higher Education, 2024
Purpose: The purpose of this study was to ascertain the relationship between EFL teachers' perception of the intended and unintended consequences of formative assessment (FA) decisions and their sense of self-efficacy and anxiety toward data-driven decision-making (DDDM). Design/methodology/approach: A correlational research design and…
Descriptors: Formative Evaluation, Teacher Attitudes, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Mansolf, Maxwell; Vreeker, Annabel; Reise, Steven P.; Freimer, Nelson B.; Glahn, David C.; Gur, Raquel E.; Moore, Tyler M.; Pato, Carlos N.; Pato, Michele T.; Palotie, Aarno; Holm, Minna; Suvisaari, Jaana; Partonen, Timo; Kieseppä, Tuula; Paunio, Tiina; Boks, Marco; Kahn, René; Ophoff, Roel A.; Bearden, Carrie E.; Loohuis, Loes Olde; Teshiba, Terri; deGeorge, Daniella; Bilder, Robert M. – Educational and Psychological Measurement, 2020
Large-scale studies spanning diverse project sites, populations, languages, and measurements are increasingly important to relate psychological to biological variables. National and international consortia already are collecting and executing mega-analyses on aggregated data from individuals, with different measures on each person. In this…
Descriptors: Item Response Theory, Data Analysis, Measurement, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Yan, Xun; Staples, Shelley – Language Testing, 2020
The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…
Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Peer reviewed Peer reviewed
Direct linkDirect link
Klingbeil, David A.; Van Norman, Ethan R.; Nelson, Peter M. – Journal of Behavioral Education, 2017
Single-case designs provide an established technology for evaluating the effects of academic interventions. Researchers interested in studying the long-term effects of reading interventions often use curriculum-based measures of reading (CBM-R) as they possess many of the desirable characteristics for use in a time-series design. The reliability…
Descriptors: Curriculum Based Assessment, Accuracy, Scores, Reading Skills
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kuhfeld, Megan; Domina, Thurston; Hanselman, Paul – AERA Open, 2019
The Stanford Educational Data Archive (SEDA) is the first data set to allow comparisons of district academic achievement and growth from Grades 3 to 8 across the United States, shining a light on the distribution of educational opportunities. This study describes a convergent validity analysis of the SEDA growth estimates in mathematics and…
Descriptors: Educational Research, Educational Assessment, Data Analysis, Archives
Peer reviewed Peer reviewed
PDF on ERIC Download full text
McCoy, Jan D.; Braun-Monegan, Jenelle; Bettesworth, Leanne; Tindal, Gerald – Journal of Education and Practice, 2015
While problem solving as an instructional technique is widely advocated, educators are often challenged in effectively assessing student skill in this area. Students failing to solve a problem might fail in any of several aspects of the effort. The purpose of this research was to validate a scaffolded technique for assessing problem solving in…
Descriptors: Middle School Students, Scaffolding (Teaching Technique), Problem Solving, Science Education
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Algozzine, Bob; Horner, Robert H.; Todd, Anne W.; Newton, J. Stephen; Algozzine, Kate; Cusumano, Dale – Journal of Psychoeducational Assessment, 2016
Although there is a strong legislative base and perceived efficacy for multidisciplinary team decision making, limited evidence supports its effectiveness or consistency of implementation in practice. In recent research, we used the Decision Observation, Recording, and Analysis (DORA) tool to document activities and adult behaviors during positive…
Descriptors: Problem Solving, Participative Decision Making, Positive Behavior Supports, Meetings
Peer reviewed Peer reviewed
Direct linkDirect link
Castillo, Jose M.; Dedrick, Robert F.; Stockslager, Kevin M.; March, Amanda L.; Hines, Constance V.; Tan, Sim Yin – Journal of Applied School Psychology, 2015
This article presents information on the development and initial validation of the 16-item Response to Intervention (RTI) Beliefs Scale. The scale is designed to measure the extent to which educators working in schools hold beliefs consistent with the tenets of RTI. The authors administered the instrument to 2,430 educators in 62 elementary…
Descriptors: Response to Intervention, Teacher Attitudes, Test Construction, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Tengberg, Michael – Language Assessment Quarterly, 2018
Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results
Peer reviewed Peer reviewed
Direct linkDirect link
Long, Avizia Y.; Shin, Sun-Young; Geeslin, Kimberly; Willis, Erik W. – Language Learning & Technology, 2018
In response to the need for examples of test validation from which everyday language programs can benefit, this paper reports on a study that used Bachman's (2005) assessment use argument (AUA) framework to examine evidence to support claims made about the intended interpretations and uses of scores based on a new web-based Spanish language…
Descriptors: Second Language Instruction, Second Language Learning, Spanish, Computer Assisted Testing
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4