Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 10 |
Descriptor
Generalizability Theory | 14 |
Psychometrics | 14 |
Test Reliability | 14 |
Test Validity | 10 |
Factor Analysis | 5 |
Item Response Theory | 5 |
Measures (Individuals) | 3 |
Observation | 3 |
Special Education | 3 |
Child Development | 2 |
Construct Validity | 2 |
More ▼ |
Source
Author
Akaeze, Hope O. | 1 |
Allegra, Laurie | 1 |
Badjadi, Nour El Imane | 1 |
Barbera, Jack | 1 |
Crawford, Angela R. | 1 |
Follesdal, Hallvard | 1 |
Ha, Minsu | 1 |
Hagtvet, Knut A. | 1 |
Jeffrey Shero | 1 |
Jessica Logan | 1 |
Johnson, Evelyn S. | 1 |
More ▼ |
Publication Type
Reports - Research | 12 |
Journal Articles | 8 |
Information Analyses | 1 |
Non-Print Media | 1 |
Reference Materials - General | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 3 |
Elementary Education | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Colorado | 1 |
Michigan | 1 |
Norway | 1 |
Pennsylvania | 1 |
South Korea | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Battelle Developmental… | 1 |
Dynamic Indicators of Basic… | 1 |
SAT (College Admission Test) | 1 |
Stanford Binet Intelligence… | 1 |
Strengths and Difficulties… | 1 |
What Works Clearinghouse Rating
Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024
Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…
Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021
The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…
Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)
Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015
The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…
Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales
Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018
This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…
Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods
Badjadi, Nour El Imane – Online Submission, 2013
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability
Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015
We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…
Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales
Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014
Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…
Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory
Follesdal, Hallvard; Hagtvet, Knut A. – Intelligence, 2009
The Mayer, Salovey, & Caruso Emotional Intelligence Test (MSCEIT) has been reported to provide reliable scores for the four-branch ability model of emotional intelligence [Mayer, J. D., Salovey, P., & Caruso, D. R. (2002). "Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT). User's manual." Toronto, Canada: Multi-Health…
Descriptors: Emotional Intelligence, Intelligence Tests, Adults, Error of Measurement
Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009
Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…
Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity

Shipper, Frank; And Others – Educational and Psychological Measurement, 1986
Despite much evidence that the Jenkins Activity Survey Type A (JAS Type A) scale is lacking in essential psychometric properties, it continues to be widely used for measuring coronary-prone behavior. Four psychometric properties of the scale were assessed. The scale failed to satisfy accepted reliability and validity criteria. (Author/JAZ)
Descriptors: Behavior Rating Scales, Factor Analysis, Generalizability Theory, Measurement Techniques

Kane, Michael T. – Journal of Educational Measurement, 1986
These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)
Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Lawson, Stephen; And Others – 1991
Early childhood special educators recognize the necessity of establishing indices of reliability and validity for instruments that provide an index of developmental status. Many such instruments present little empirical evidence regarding psychometric integrity, particularly for a non-normative sample. The 341-item Battelle Developmental Inventory…
Descriptors: Child Development, Construct Validity, Diagnostic Tests, Disabilities