Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 19 |
Since 2006 (last 20 years) | 44 |
Descriptor
Data Analysis | 55 |
Scores | 55 |
Test Validity | 23 |
Validity | 20 |
Test Reliability | 12 |
Academic Achievement | 10 |
Test Construction | 10 |
Construct Validity | 9 |
Correlation | 9 |
Evaluation Methods | 9 |
Foreign Countries | 9 |
More ▼ |
Source
Author
Tindal, Gerald | 2 |
Adamu, L. E. | 1 |
Al-Jafar, Ali A. | 1 |
Al-Yousefi, Zainab H. | 1 |
Algozzine, Bob | 1 |
Algozzine, Kate | 1 |
Anderson, Daniel | 1 |
Bandalos, Deborah L. | 1 |
Barron, Kenneth E. | 1 |
Bauer, Daniel J. | 1 |
Bearden, Carrie E. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Policymakers | 1 |
Researchers | 1 |
Teachers | 1 |
Location
China | 2 |
New York | 2 |
Arizona | 1 |
Australia | 1 |
California (Los Angeles) | 1 |
Colombia | 1 |
Connecticut | 1 |
Costa Rica | 1 |
Finland | 1 |
Iran | 1 |
Kuwait | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gregory Chernov – Evaluation Review, 2025
Most existing solutions to the current replication crisis in science address only the factors stemming from specific poor research practices. We introduce a novel mechanism that leverages the experts' predictive abilities to analyze the root causes of replication failures. It is backed by the principle that the most accurate predictor is the most…
Descriptors: Replication (Evaluation), Prediction, Scientific Research, Failure
Jennifer Randall; Mya Poe; Maria Elena Oliveri; David Slomp – Educational Assessment, 2024
Traditional validation approaches fail to account for the ways oppressive systems (e.g. racism, radical nationalism) impact the test design and development process. To disrupt this legacy of white supremacy, we illustrate how justice-oriented, antiracist validation (JAV) framework can be applied to construct articulation and validation, data…
Descriptors: Social Justice, Racism, Educational Assessment, Models
Curran, Patrick J.; Georgeson, A. R.; Bauer, Daniel J.; Hussong, Andrea M. – International Journal of Behavioral Development, 2021
Conducting valid and reliable empirical research in the prevention sciences is an inherently difficult and challenging task. Chief among these is the need to obtain numerical scores of underlying theoretical constructs for use in subsequent analysis. This challenge is further exacerbated by the increasingly common need to consider multiple…
Descriptors: Psychometrics, Scoring, Prevention, Scores
Maryam Atai-Tabar; Gholamreza Zareian; Seyyed Mohammad Reza Amirian; Seyyed Mohammad Reza Adel – Journal of Applied Research in Higher Education, 2024
Purpose: The purpose of this study was to ascertain the relationship between EFL teachers' perception of the intended and unintended consequences of formative assessment (FA) decisions and their sense of self-efficacy and anxiety toward data-driven decision-making (DDDM). Design/methodology/approach: A correlational research design and…
Descriptors: Formative Evaluation, Teacher Attitudes, English (Second Language), Second Language Learning
Mansolf, Maxwell; Vreeker, Annabel; Reise, Steven P.; Freimer, Nelson B.; Glahn, David C.; Gur, Raquel E.; Moore, Tyler M.; Pato, Carlos N.; Pato, Michele T.; Palotie, Aarno; Holm, Minna; Suvisaari, Jaana; Partonen, Timo; Kieseppä, Tuula; Paunio, Tiina; Boks, Marco; Kahn, René; Ophoff, Roel A.; Bearden, Carrie E.; Loohuis, Loes Olde; Teshiba, Terri; deGeorge, Daniella; Bilder, Robert M. – Educational and Psychological Measurement, 2020
Large-scale studies spanning diverse project sites, populations, languages, and measurements are increasingly important to relate psychological to biological variables. National and international consortia already are collecting and executing mega-analyses on aggregated data from individuals, with different measures on each person. In this…
Descriptors: Item Response Theory, Data Analysis, Measurement, Validity
Yan, Xun; Staples, Shelley – Language Testing, 2020
The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…
Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Precision of Curriculum-Based Measurement Reading Data: Considerations for Multiple-Baseline Designs
Klingbeil, David A.; Van Norman, Ethan R.; Nelson, Peter M. – Journal of Behavioral Education, 2017
Single-case designs provide an established technology for evaluating the effects of academic interventions. Researchers interested in studying the long-term effects of reading interventions often use curriculum-based measures of reading (CBM-R) as they possess many of the desirable characteristics for use in a time-series design. The reliability…
Descriptors: Curriculum Based Assessment, Accuracy, Scores, Reading Skills
Kuhfeld, Megan; Domina, Thurston; Hanselman, Paul – AERA Open, 2019
The Stanford Educational Data Archive (SEDA) is the first data set to allow comparisons of district academic achievement and growth from Grades 3 to 8 across the United States, shining a light on the distribution of educational opportunities. This study describes a convergent validity analysis of the SEDA growth estimates in mathematics and…
Descriptors: Educational Research, Educational Assessment, Data Analysis, Archives
McCoy, Jan D.; Braun-Monegan, Jenelle; Bettesworth, Leanne; Tindal, Gerald – Journal of Education and Practice, 2015
While problem solving as an instructional technique is widely advocated, educators are often challenged in effectively assessing student skill in this area. Students failing to solve a problem might fail in any of several aspects of the effort. The purpose of this research was to validate a scaffolded technique for assessing problem solving in…
Descriptors: Middle School Students, Scaffolding (Teaching Technique), Problem Solving, Science Education
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Algozzine, Bob; Horner, Robert H.; Todd, Anne W.; Newton, J. Stephen; Algozzine, Kate; Cusumano, Dale – Journal of Psychoeducational Assessment, 2016
Although there is a strong legislative base and perceived efficacy for multidisciplinary team decision making, limited evidence supports its effectiveness or consistency of implementation in practice. In recent research, we used the Decision Observation, Recording, and Analysis (DORA) tool to document activities and adult behaviors during positive…
Descriptors: Problem Solving, Participative Decision Making, Positive Behavior Supports, Meetings
Castillo, Jose M.; Dedrick, Robert F.; Stockslager, Kevin M.; March, Amanda L.; Hines, Constance V.; Tan, Sim Yin – Journal of Applied School Psychology, 2015
This article presents information on the development and initial validation of the 16-item Response to Intervention (RTI) Beliefs Scale. The scale is designed to measure the extent to which educators working in schools hold beliefs consistent with the tenets of RTI. The authors administered the instrument to 2,430 educators in 62 elementary…
Descriptors: Response to Intervention, Teacher Attitudes, Test Construction, Test Validity
Tengberg, Michael – Language Assessment Quarterly, 2018
Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results
Long, Avizia Y.; Shin, Sun-Young; Geeslin, Kimberly; Willis, Erik W. – Language Learning & Technology, 2018
In response to the need for examples of test validation from which everyday language programs can benefit, this paper reports on a study that used Bachman's (2005) assessment use argument (AUA) framework to examine evidence to support claims made about the intended interpretations and uses of scores based on a new web-based Spanish language…
Descriptors: Second Language Instruction, Second Language Learning, Spanish, Computer Assisted Testing