ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	22

Descriptor

Generalizability Theory	37
Test Reliability	37
Test Validity	37
Evaluation Methods	10
Psychometrics	10
Interrater Reliability	9
Performance Based Assessment	7
Statistical Analysis	7
Test Construction	7
Academic Achievement	5
Factor Analysis	5
Item Response Theory	5
Scores	5
Student Evaluation	5
Teacher Evaluation	5
Test Use	5
Error of Measurement	4
Factor Structure	4
Higher Education	4
Item Analysis	4
Measurement Techniques	4
Measures (Individuals)	4
Observation	4
Data Collection	3
Decision Making	3
More ▼

Publication Type

Reports - Research	25
Journal Articles	17
Speeches/Meeting Papers	7
Reports - Evaluative	6
Information Analyses	2
Reports - Descriptive	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Non-Print Media	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	7
Elementary Education	3
Postsecondary Education	3
Secondary Education	2
Early Childhood Education	1
Grade 10	1
Grade 3	1
Grade 4	1
Junior High Schools	1
Middle Schools	1
Two Year Colleges	1
More ▼

Audience

Researchers	4
Policymakers	1

Location

California	1
Colorado	1
Cyprus	1
Georgia	1
Michigan	1
Norway	1
Pennsylvania	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Cognitive Abilities Test	1
Group Assessment of Logical…	1
National Survey of Student…	1
SAT (College Admission Test)	1
Stages of Concern…	1
Strengths and Difficulties…	1
Teacher Performance…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

How Not to Fool Ourselves about Heterogeneity of Treatment Effects. EdWorkingPaper No. 25-1116

Download full text

Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025

Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…

Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences

Validity. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Validity is broadly defined as how well something measures what it's supposed to measure. The reliability and validity of scores from assessments are two concepts that are closely knit together and feed into each other.

Descriptors: Screening Tests, Scores, Test Validity, Test Reliability

Structural Validity, Internal Consistency, and Rater Reliability of the Modified Barium Swallow Impairment Profile: Breaking Ground on a 52,726-Patient, Clinical Data Set

Peer reviewed

Direct link

Clain, Alex E.; Alkhuwaiter, Munirah; Davidson, Kate; Martin-Harris, Bonnie – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The purpose of this study was to extend the assessment of the psychometric properties of the Modified Barium Swallow Impairment Profile (MBSImP). Here, we re-examined structural validity and internal consistency using a large clinical-registry data set and formally examined rater reliability in a smaller data set. Method: This study…

Descriptors: Diagnostic Tests, Disability Identification, Physical Disabilities, Eating Disorders

Validation of the Child Observation Record Advantage 1.5 Assessment Tool for Preschool Children: A Multilevel Bifactor Modeling Approach

Peer reviewed

Direct link

Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023

This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…

Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques

Survey of Evidence in Education for Schools (SEE-S) Technical Report

Download full text

May, Henry; Blackman, Horatio; Van Horne, Sam; Tilley, Katherine; Farley-Ripple, Elizabeth N.; Shewchuk, Samantha; Agboh, Darren; Micklos, Deborah Amsden – Center for Research Use in Education, 2022

In this technical report, the Center for Research Use in Education (CRUE) presents the methodological design of a large-scale quantitative investigation of research use by school-based practitioners through the "Survey of Evidence in Education for Schools (SEE-S)." It documents the major technical aspects of the development of SEE-S,…

Descriptors: Surveys, Schools, Educational Research, Research Utilization

The Dependability of the Updated NSSE: A Generalizability Study

Peer reviewed
PDF on ERIC

Download full text

Fosnacht, Kevin; Gonyea, Robert M. – Research & Practice in Assessment, 2018

This study utilized generalizability theory to assess the context where the National Survey of Student Engagement's (NSSE) summary measures, the Engagement Indicators, produce dependable group-level means. The dependability of NSSE group means is an important topic for the higher education assessment community given its wide utilization and usage…

Descriptors: College Freshmen, College Seniors, Learner Engagement, National Surveys

The Exchangeability of Brief Intelligence Tests for Children with Intellectual Giftedness: Illuminating Error Variance Components' Influence on IQs

Peer reviewed

Direct link

Irby, Sarah M.; Floyd, Randy G. – Psychology in the Schools, 2017

This study examined the exchangeability of total scores (i.e., intelligent quotients [IQs]) from three brief intelligence tests. Tests were administered to 36 children with intellectual giftedness, scored live by one set of primary examiners and later scored by a secondary examiner. For each student, six IQs were calculated, and all 216 values…

Descriptors: Intelligence Tests, Gifted, Error of Measurement, Scores

Exploring the Reliability of Generic and Content-Specific Instructional Aspects in Physical Education Lessons

Peer reviewed

Direct link

Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017

Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…

Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability

Measuring Afterschool Program Quality Using Setting-Level Observational Approaches

Peer reviewed

Direct link

Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015

The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…

Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales

Sources of Variance in Special Educator Observation Rubrics

Peer reviewed
PDF on ERIC

Download full text

Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018

This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…

Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods

Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

Download full text

Badjadi, Nour El Imane – Online Submission, 2013

The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…

Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability

The Student Risk Screening Scale for Early Childhood: An Initial Validation Study

Peer reviewed

Direct link

Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015

We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…

Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales

Psychometric Analysis of the Thermochemistry Concept Inventory

Peer reviewed

Direct link

Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014

Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…

Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory

Development of a Valid and Reliable Student-Achievement and Process-Skills Instrument

Peer reviewed

Direct link

Bunce, Diane M.; VandenPlas, Jessica R.; Neiles, Kelly Y.; Flens, Elizabeth A. – Journal of College Science Teaching, 2010

Development of a research instrument to measure student achievement requires planning and reliability and validity testing before the instrument is used to collect data. These steps are often overlooked in research studies, but when the instrument is to be used across a wider population, the inclusion of these steps is vital to address the…

Descriptors: Academic Achievement, Measures (Individuals), Science Process Skills, Test Reliability

Multigroup Generalizability Analysis of Verbal, Quantitative, and Nonverbal Ability Tests for Culturally and Linguistically Diverse Students

Peer reviewed

Direct link

Lakin, Joni M.; Lai, Emily R. – Educational and Psychological Measurement, 2012

For educators seeking to differentiate instruction, cognitive ability tests sampling multiple content domains, including verbal, quantitative, and nonverbal reasoning, provide superior information about student strengths and weaknesses compared with unidimensional reasoning measures. However, these ability tests have not been fully evaluated with…

Descriptors: Aptitude Tests, Nonverbal Ability, Cognitive Ability, Verbal Ability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	2
Advances in Physiology…	1
Annenberg Institute for…	1
Center for Research Use in…	1
Chemistry Education Research…	1
College Board	1
Grantee Submission	1
Intelligence	1
International Journal of…	1
Journal of College Science…	1
Journal of Early Adolescence	1
Journal of Psychoeducational…	1
Journal of Special Education	1
Journal of Speech, Language,…	1
Library and Information…	1
National Center on Improving…	1
Online Submission	1
Pearson	1
ProQuest LLC	1
Psychology in the Schools	1
Research & Practice in…	1
Review of Educational Research	1
Routledge, Taylor & Francis…	1
School Effectiveness and…	1
Topics in Early Childhood…	1
More ▼

Agboh, Darren	1
Akaeze, Hope O.	1
Alkhuwaiter, Munirah	1
Allegra, Laurie	1
Badjadi, Nour El Imane	1
Barbera, Jack	1
Blackman, Horatio	1
Brendan A. Schuetze	1
Brockx, Bert	1
Bunce, Diane M.	1
Capie, William	1
Charalambous, Charalambos Y.	1
Chi, Youngshin	1
Clain, Alex E.	1
Crawford, Angela R.	1
Cronin, Linda	1
Davidson, Kate	1
Denison, D. Brian, Ed.	1
Farley-Ripple, Elizabeth N.	1
Flens, Elizabeth A.	1
Floyd, Randy G.	1
Follesdal, Hallvard	1
Fosnacht, Kevin	1
Garcia, Raymond E.	1
More ▼