NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024
The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…
Descriptors: Test Items, Test Construction, Sample Size, Scaling
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Fujimoto, Ken A.; Gordon, Rachel A.; Peng, Fang; Hofer, Kerry G. – AERA Open, 2018
Classroom quality measures, such as the Early Childhood Environment Rating Scale, Revised (ECERS-R), are widely used in research, practice, and policy. Increasingly, these uses have been for purposes not originally intended, such as contributing to consequential policy decisions. The current study adds to the recent evidence of problems with the…
Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Preschool Curriculum
Peer reviewed Peer reviewed
Direct linkDirect link
Hallinger, Philip; Dongyu, Li; Wang, Wen-Chung – Educational Administration Quarterly, 2016
Purpose: Instructional leadership has assumed steadily increasing importance within the general role set of principals over the past 60 years. One persisting finding within this corpus of studies concerns the consistently higher ratings obtained by female principals on instructional leadership when compared with their male counterparts. This…
Descriptors: Gender Differences, Instructional Leadership, Meta Analysis, Principals
Peer reviewed Peer reviewed
Direct linkDirect link
Petersen, Nancy S. – Applied Psychological Measurement, 2008
This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…
Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods
Woldbeck, Tanya – 1998
This paper summarizes some of the basic concepts in test equating. Various types of equating methods, as well as data collection designs, are outlined, with attempts to provide insight into preferred methods and techniques. Test equating describes a group of methods that enable test constructors and users to compare scores from two different forms…
Descriptors: Comparative Analysis, Data Collection, Difficulty Level, Equated Scores
Peer reviewed Peer reviewed
Harvey, Robert J.; Hammer, Allen L. – Counseling Psychologist, 1999
Examines item-response theory (IRT), which seeks to model the way in which latent psychological constructs manifest themselves in terms of observable item responses. Provides an overview of the most popular IRT models and contrasts them with the techniques used in classical test theory. Results highlight several IRT advantages. (Author/GCP)
Descriptors: Comparative Analysis, Counseling Psychology, Data Collection, Item Response Theory
Peer reviewed Peer reviewed
Harris, Deborah J.; Crouse, Jill D. – Applied Measurement in Education, 1993
Criteria used in the equating process proposed in the literature are reviewed. The discussion begins by examining how equating is defined. The controversy over the best criterion, the utility of some, and whether a criterion is needed at all means that much work needs to be done in this area. (SLD)
Descriptors: Data Collection, Definitions, Equated Scores, Evaluation Criteria
Peer reviewed Peer reviewed
Koretz, Daniel – Educational Evaluation and Policy Analysis, 1995
Studies of the mathematics assessments of the National Assessment of Educational Progress (NAEP) are summarized. One study found that omit rates for NAEP test items were higher for African Americans and Hispanics than for whites. The other found that descriptions and examples for the 1992 mathematics achievement levels were misleading. (SLD)
Descriptors: Black Students, Data Collection, Elementary Secondary Education, Hispanic Americans