ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Data Collection	8
Item Response Theory	6
Scores	4
Equated Scores	3
Test Construction	3
Comparative Analysis	2
Evaluation Methods	2
Gender Differences	2
Literature Reviews	2
Models	2
Racial Differences	2
Rating Scales	2
Research Methodology	2
Test Items	2
Achievement Tests	1
Black Students	1
Child Development	1
College Entrance Examinations	1
Computation	1
Correlation	1
Counseling Psychology	1
Data Analysis	1
Definitions	1
Difficulty Level	1
Early Childhood Education	1
More ▼

Source

AERA Open	1
Applied Measurement in…	1
Applied Psychological…	1
Counseling Psychologist	1
ETS Research Report Series	1
Educational Administration…	1
Educational Evaluation and…	1

Author

Crouse, Jill D.	1
Daniel F. McCaffrey	1
Dongyu, Li	1
Fujimoto, Ken A.	1
Gordon, Rachel A.	1
Hallinger, Philip	1
Hammer, Allen L.	1
Harris, Deborah J.	1
Harvey, Robert J.	1
Hofer, Kerry G.	1
Hongwen Guo	1
Koretz, Daniel	1
Lixong Gu	1
Matthew S. Johnson	1
Peng, Fang	1
Petersen, Nancy S.	1
Wang, Wen-Chung	1
Woldbeck, Tanya	1
More ▼

Publication Type

Information Analyses	8
Journal Articles	7
Reports - Research	3
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Early Childhood Education	1
Higher Education	1

Audience

Location

Taiwan	1
Thailand	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Environment…	1
Myers Briggs Type Indicator	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Examining the Category Functioning of the ECERS-R across Eight Data Sets

Peer reviewed
PDF on ERIC

Download full text

Fujimoto, Ken A.; Gordon, Rachel A.; Peng, Fang; Hofer, Kerry G. – AERA Open, 2018

Classroom quality measures, such as the Early Childhood Environment Rating Scale, Revised (ECERS-R), are widely used in research, practice, and policy. Increasingly, these uses have been for purposes not originally intended, such as contributing to consequential policy decisions. The current study adds to the recent evidence of problems with the…

Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Preschool Curriculum

Gender Differences in Instructional Leadership: A Meta-Analytic Review of Studies Using the Principal Instructional Management Rating Scale

Peer reviewed

Direct link

Hallinger, Philip; Dongyu, Li; Wang, Wen-Chung – Educational Administration Quarterly, 2016

Purpose: Instructional leadership has assumed steadily increasing importance within the general role set of principals over the past 60 years. One persisting finding within this corpus of studies concerns the consistently higher ratings obtained by female principals on instructional leadership when compared with their male counterparts. This…

Descriptors: Gender Differences, Instructional Leadership, Meta Analysis, Principals

A Discussion of Population Invariance of Equating

Peer reviewed

Direct link

Petersen, Nancy S. – Applied Psychological Measurement, 2008

This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…

Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods

Basic Concepts in Modern Methods of Test Equating.

Download full text

Woldbeck, Tanya – 1998

This paper summarizes some of the basic concepts in test equating. Various types of equating methods, as well as data collection designs, are outlined, with attempts to provide insight into preferred methods and techniques. Test equating describes a group of methods that enable test constructors and users to compare scores from two different forms…

Descriptors: Comparative Analysis, Data Collection, Difficulty Level, Equated Scores

Item Response Theory.

Peer reviewed

Harvey, Robert J.; Hammer, Allen L. – Counseling Psychologist, 1999

Examines item-response theory (IRT), which seeks to model the way in which latent psychological constructs manifest themselves in terms of observable item responses. Provides an overview of the most popular IRT models and contrasts them with the techniques used in classical test theory. Results highlight several IRT advantages. (Author/GCP)

Descriptors: Comparative Analysis, Counseling Psychology, Data Collection, Item Response Theory

A Study of Criteria Used in Equating.

Peer reviewed

Harris, Deborah J.; Crouse, Jill D. – Applied Measurement in Education, 1993

Criteria used in the equating process proposed in the literature are reviewed. The discussion begins by examining how equating is defined. The controversy over the best criterion, the utility of some, and whether a criterion is needed at all means that much work needs to be done in this area. (SLD)

Descriptors: Data Collection, Definitions, Equated Scores, Evaluation Criteria

The Quality of Information from NAEP: Two Examples of Work Done in Collaboration with Leigh Burstein.

Peer reviewed

Koretz, Daniel – Educational Evaluation and Policy Analysis, 1995

Studies of the mathematics assessments of the National Assessment of Educational Progress (NAEP) are summarized. One study found that omit rates for NAEP test items were higher for African Americans and Hispanics than for whites. The other found that descriptions and examples for the 1992 mathematics achievement levels were misleading. (SLD)

Descriptors: Black Students, Data Collection, Elementary Secondary Education, Hispanic Americans