NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024
Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…
Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities
Peer reviewed Peer reviewed
Direct linkDirect link
Akaeze, Hope O.; Wu, Jamie Heng-Chieh; Lawrence, Frank R.; Weber, Everett P. – Journal of Psychoeducational Assessment, 2023
This paper reports an investigation into the psychometric properties of the COR-Advantage1.5 (COR-Adv1.5) assessment tool, a criterion-referenced observation-based instrument designed to assess the developmental abilities of children from birth through kindergarten. Using data from 8534 children participating in a state-funded preschool program…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Measures (Individuals), Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Sya'bandari, Yustika; Rachmatullah, Arif; Ha, Minsu – International Journal of Science Education, 2021
The Measure of Acceptance of the Theory of Evolution (MATE) has been extensively used in science education research for more than two decades. This study examines the fairness of MATE items based on religious convictions and academic majors. The multidimensional item response theory and differential item functioning analyses were run on data…
Descriptors: Attitude Measures, Scientific Attitudes, Evolution, Adoption (Ideas)
Peer reviewed Peer reviewed
Direct linkDirect link
Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015
The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…
Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Grantee Submission, 2018
This study describes the development and initial psychometric evaluation of a Recognizing Effective Special Education Teachers (RESET) teacher observation instrument. Specifically, the study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of…
Descriptors: Special Education Teachers, Direct Instruction, Observation, Teaching Methods
Badjadi, Nour El Imane – Online Submission, 2013
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Descriptors: Essay Tests, Writing Evaluation, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Lane, Kathleen Lynne; Oakes, Wendy Peia; Menzies, Holly Mariah; Major, Rebecca; Allegra, Laurie; Powers, Lisa; Schatschneider, Chris – Topics in Early Childhood Special Education, 2015
We report findings of two exploratory validation studies of a revised instrument: the "Student Risk Screening Scale for Early Childhood" version (SRSS-EC). The SRSS-EC was modified to reflect characteristics of externalizing and internalizing behaviors manifested by preschool-age children. In Study 1, we explored the reliability of…
Descriptors: Screening Tests, At Risk Students, Early Childhood Education, Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014
Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…
Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Follesdal, Hallvard; Hagtvet, Knut A. – Intelligence, 2009
The Mayer, Salovey, & Caruso Emotional Intelligence Test (MSCEIT) has been reported to provide reliable scores for the four-branch ability model of emotional intelligence [Mayer, J. D., Salovey, P., & Caruso, D. R. (2002). "Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT). User's manual." Toronto, Canada: Multi-Health…
Descriptors: Emotional Intelligence, Intelligence Tests, Adults, Error of Measurement
Proctor, Thomas P.; Kim, YoungKoung Rachel – College Board, 2009
Presented at the national conference for the American Educational Research Association (AERA) in April 2009. This study examined the utility of scores on the SAT writing test, specifically examining the reliability of scores using generalizability and item response theories. The study also provides an overview of current predictive validity…
Descriptors: College Entrance Examinations, Writing Tests, Psychometrics, Predictive Validity
Peer reviewed Peer reviewed
Shipper, Frank; And Others – Educational and Psychological Measurement, 1986
Despite much evidence that the Jenkins Activity Survey Type A (JAS Type A) scale is lacking in essential psychometric properties, it continues to be widely used for measuring coronary-prone behavior. Four psychometric properties of the scale were assessed. The scale failed to satisfy accepted reliability and validity criteria. (Author/JAZ)
Descriptors: Behavior Rating Scales, Factor Analysis, Generalizability Theory, Measurement Techniques
Peer reviewed Peer reviewed
Kane, Michael T. – Journal of Educational Measurement, 1986
These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)
Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Lawson, Stephen; And Others – 1991
Early childhood special educators recognize the necessity of establishing indices of reliability and validity for instruments that provide an index of developmental status. Many such instruments present little empirical evidence regarding psychometric integrity, particularly for a non-normative sample. The 341-item Battelle Developmental Inventory…
Descriptors: Child Development, Construct Validity, Diagnostic Tests, Disabilities