ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Generalizability Theory	14
Psychometrics	14
Test Reliability	14
Test Validity	10
Factor Analysis	5
Item Response Theory	5
Measures (Individuals)	3
Observation	3
Special Education	3
Child Development	2
Construct Validity	2
Criterion Referenced Tests	2
Early Childhood Education	2
Error of Measurement	2
Evidence	2
Factor Structure	2
Foreign Countries	2
Goodness of Fit	2
Intelligence Tests	2
Interrater Reliability	2
Longitudinal Studies	2
Measurement Techniques	2
Performance Based Assessment	2
Predictive Validity	2
Preschool Children	2
More ▼

Source

Chemistry Education Research…	1
College Board	1
Educational and Psychological…	1
Grantee Submission	1
Intelligence	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Online Submission	1
Society for Research on…	1
Topics in Early Childhood…	1
More ▼

Publication Type

Reports - Research	12
Journal Articles	8
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Elementary Education	2
Postsecondary Education	2
Early Childhood Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Colorado	1
Michigan	1
Norway	1
Pennsylvania	1
South Korea	1

Laws, Policies, & Programs

Assessments and Surveys

Battelle Developmental…	1
Dynamic Indicators of Basic…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Strengths and Difficulties…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Quantile Reliability: Beyond Global Estimates of Internal Consistency

Peer reviewed

Direct link

Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024

Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…

Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities

Validation of the Child Observation Record Advantage 1.5 Assessment Tool for Preschool Children: A Multilevel Bifactor Modeling Approach

Peer reviewed

Direct link

Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023

This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…

Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques

Psychometric Properties of MATE: A Study Focused on Testing the Generalizability of the Measure of Acceptance of the Theory of Evolution

Peer reviewed

Direct link

Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021

The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…

Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)

Measuring Afterschool Program Quality Using Setting-Level Observational Approaches

Peer reviewed

Direct link

Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015

The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…

Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales

Sources of Variance in Special Educator Observation Rubrics

Peer reviewed
PDF on ERIC

Download full text

Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018

This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…

Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods

Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

Download full text

Badjadi, Nour El Imane – Online Submission, 2013

The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…

Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability

The Student Risk Screening Scale for Early Childhood: An Initial Validation Study

Peer reviewed

Direct link

Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015

We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…

Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales

Psychometric Analysis of the Thermochemistry Concept Inventory

Peer reviewed

Direct link

Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014

Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…

Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory

Emotional Intelligence: The MSCEIT from the Perspective of Generalizability Theory

Peer reviewed

Direct link

Follesdal, Hallvard; Hagtvet, Knut A. – Intelligence, 2009

The Mayer, Salovey, & Caruso Emotional Intelligence Test (MSCEIT) has been reported to provide reliable scores for the four-branch ability model of emotional intelligence [Mayer, J. D., Salovey, P., & Caruso, D. R. (2002). "Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT). User's manual." Toronto, Canada: Multi-Health…

Descriptors: Emotional Intelligence, Intelligence Tests, Adults, Error of Measurement

Select Psychometric Properties and Predictive Validity of Scores on the SAT Writing Section

Download full text

Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009

Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…

Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity

A Study of Four Psychometric Properties of the Jenkins Activity Survey Type. A Scale with Suggested Modifications and Validation.

Peer reviewed

Shipper, Frank; And Others – Educational and Psychological Measurement, 1986

Despite much evidence that the Jenkins Activity Survey Type A (JAS Type A) scale is lacking in essential psychometric properties, it continues to be widely used for measuring coronary-prone behavior. Four psychometric properties of the scale were assessed. The scale failed to satisfy accepted reliability and validity criteria. (Author/JAZ)

Descriptors: Behavior Rating Scales, Factor Analysis, Generalizability Theory, Measurement Techniques

The Role of Reliability in Criterion-Referenced Tests.

Peer reviewed

Kane, Michael T. – Journal of Educational Measurement, 1986

These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)

Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory

Technical Issues in Large-Scale Performance Assessment.

Download full text

Phillips, Gary W., Ed. – 1996

Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…

Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics

Establishing the Psychometric Integrity of the Battelle Developmental Inventory for Young Children with Disabilities.

Lawson, Stephen; And Others – 1991

Early childhood special educators recognize the necessity of establishing indices of reliability and validity for instruments that provide an index of developmental status. Many such instruments present little empirical evidence regarding psychometric integrity, particularly for a non-normative sample. The 341-item Battelle Developmental Inventory…

Descriptors: Child Development, Construct Validity, Diagnostic Tests, Disabilities

Akaeze, Hope O.	1
Allegra, Laurie	1
Badjadi, Nour El Imane	1
Barbera, Jack	1
Crawford, Angela R.	1
Follesdal, Hallvard	1
Ha, Minsu	1
Hagtvet, Knut A.	1
Jeffrey Shero	1
Jessica Logan	1
Johnson, Evelyn S.	1
Kane, Michael T.	1
Kim, YoungKoung Rachel	1
Lane, Kathleen Lynne	1
Lawrence, Frank R.	1
Lawson, Stephen	1
Major, Rebecca	1
Menzies, Holly Mariah	1
Moylan, Laura A.	1
Oakes, Wendy Peia	1
Oh, Yoonkyung	1
Osgood, D. Wayne	1
Phillips, Gary W., Ed.	1
Powers, Lisa	1
More ▼