Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 22 |
Descriptor
Source
Author
Agboh, Darren | 1 |
Akaeze, Hope O. | 1 |
Alkhuwaiter, Munirah | 1 |
Allegra, Laurie | 1 |
Badjadi, Nour El Imane | 1 |
Barbera, Jack | 1 |
Blackman, Horatio | 1 |
Brendan A. Schuetze | 1 |
Brockx, Bert | 1 |
Bunce, Diane M. | 1 |
Capie, William | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 7 |
Elementary Education | 3 |
Postsecondary Education | 3 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Grade 10 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Two Year Colleges | 1 |
More ▼ |
Audience
Researchers | 4 |
Policymakers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Cognitive Abilities Test | 1 |
Group Assessment of Logical… | 1 |
National Survey of Student… | 1 |
SAT (College Admission Test) | 1 |
Stages of Concern… | 1 |
Strengths and Difficulties… | 1 |
Teacher Performance… | 1 |
What Works Clearinghouse Rating
Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025
Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…
Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences
Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019
Validity is broadly defined as how well something measures what it's supposed to measure. The reliability and validity of scores from assessments are two concepts that are closely knit together and feed into each other.
Descriptors: Screening Tests, Scores, Test Validity, Test Reliability
Clain, Alex E.; Alkhuwaiter, Munirah; Davidson, Kate; Martin-Harris, Bonnie – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The purpose of this study was to extend the assessment of the psychometric properties of the Modified Barium Swallow Impairment Profile (MBSImP). Here, we re-examined structural validity and internal consistency using a large clinical-registry data set and formally examined rater reliability in a smaller data set. Method: This study…
Descriptors: Diagnostic Tests, Disability Identification, Physical Disabilities, Eating Disorders
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
May, Henry; Blackman, Horatio; Van Horne, Sam; Tilley, Katherine; Farley-Ripple, Elizabeth N.; Shewchuk, Samantha; Agboh, Darren; Micklos, Deborah Amsden – Center for Research Use in Education, 2022
In this technical report, the Center for Research Use in Education (CRUE) presents the methodological design of a large-scale quantitative investigation of research use by school-based practitioners through the "Survey of Evidence in Education for Schools (SEE-S)." It documents the major technical aspects of the development of SEE-S,…
Descriptors: Surveys, Schools, Educational Research, Research Utilization
Fosnacht, Kevin; Gonyea, Robert M. – Research & Practice in Assessment, 2018
This study utilized generalizability theory to assess the context where the National Survey of Student Engagement's (NSSE) summary measures, the Engagement Indicators, produce dependable group-level means. The dependability of NSSE group means is an important topic for the higher education assessment community given its wide utilization and usage…
Descriptors: College Freshmen, College Seniors, Learner Engagement, National Surveys
Irby, Sarah M.; Floyd, Randy G. – Psychology in the Schools, 2017
This study examined the exchangeability of total scores (i.e., intelligent quotients [IQs]) from three brief intelligence tests. Tests were administered to 36 children with intellectual giftedness, scored live by one set of primary examiners and later scored by a secondary examiner. For each student, six IQs were calculated, and all 216 values…
Descriptors: Intelligence Tests, Gifted, Error of Measurement, Scores
Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017
Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…
Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability
Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015
The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…
Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales
Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018
This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…
Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods
Badjadi, Nour El Imane – Online Submission, 2013
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability
Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015
We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…
Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales
Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014
Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…
Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory
Bunce, Diane M.; VandenPlas, Jessica R.; Neiles, Kelly Y.; Flens, Elizabeth A. – Journal of College Science Teaching, 2010
Development of a research instrument to measure student achievement requires planning and reliability and validity testing before the instrument is used to collect data. These steps are often overlooked in research studies, but when the instrument is to be used across a wider population, the inclusion of these steps is vital to address the…
Descriptors: Academic Achievement, Measures (Individuals), Science Process Skills, Test Reliability
Lakin, Joni M.; Lai, Emily R. – Educational and Psychological Measurement, 2012
For educators seeking to differentiate instruction, cognitive ability tests sampling multiple content domains, including verbal, quantitative, and nonverbal reasoning, provide superior information about student strengths and weaknesses compared with unidimensional reasoning measures. However, these ability tests have not been fully evaluated with…
Descriptors: Aptitude Tests, Nonverbal Ability, Cognitive Ability, Verbal Ability