Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Source
Journal of Educational… | 3 |
Measurement:… | 2 |
American Educational Research… | 1 |
Educational Measurement:… | 1 |
Online Submission | 1 |
Author
Baldwin, Su G. | 1 |
Billings, Bradley B. | 1 |
Clauser, Brian E. | 1 |
Cui, Ying | 1 |
Dillon, Gerard F. | 1 |
Hedl, John J., Jr. | 1 |
Hill, Heather C. | 1 |
Hogan, Thomas P. | 1 |
Jiao, Hong | 1 |
Jin, Ying | 1 |
Leighton, Jacqueline P. | 1 |
More ▼ |
Publication Type
Journal Articles | 7 |
Reports - Research | 4 |
Opinion Papers | 3 |
Reports - Evaluative | 2 |
Collected Works - Proceedings | 1 |
Guides - General | 1 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 5 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Wang, Shudong; Jiao, Hong; Jin, Ying; Thum, Yeow Meng – Online Submission, 2010
The vertical scales of large-scale achievement tests created by using item response theory (IRT) models are mostly based on cluster (or correlated) educational data in which students usually are clustered in certain groups or settings (classrooms or schools). While such application directly violated assumption of independent sample of person in…
Descriptors: Scaling, Achievement Tests, Data Analysis, Item Response Theory
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009
In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…
Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology

Hogan, Thomas P. – Educational Measurement: Issues and Practice, 1983
The implications of "A Nation at Risk" for the field of measurement are examined. These are a need for more frequent and varied tests; responsibility for eliminating measurement problems; and a resurgence of standardized testing at the high school level. The need for measures of teaching quality will increase. (DWH)
Descriptors: Educational Improvement, Educational Testing, Futures (of Society), Measurement
Tobias, Sigmund; Hedl, John J., Jr. – 1972
This paper reports two experiments whose purpose was to relate two bodies of research on anxiety: test and trait-state anxiety. It was reasoned that state anxiety measures obtained in an evaluation testing condition should be more similar to test anxiety than state anxiety measures obtained in non-evaluative situations, such as a game in Study I…
Descriptors: Anxiety, Behavioral Science Research, College Students, Educational Testing
Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007
The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…
Descriptors: Educational Testing, Tests, Measurement, Faculty Development
Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…
Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement
Connecticut State Dept. of Education, Hartford. Bureau of Evaluation and Educational Services. – 1975
The practice of analyzing all available project children in as large a group as possible is considered not to be justifiable when distinct subgroups of pupils are represented. Instead, the approach suggested here determines the test score gain a pupil achieves from the beginning to the end of the year, with all of the pupil gain scores of a single…
Descriptors: Academic Achievement, Classification, Compensatory Education, Educational Assessment
Stake, Robert E. – 1971
A discussion of performance contracting, defined as an agreement between a group offering instruction and a school needing the services, is presented. Four major hazards to direct measurement of specific learning are considered: poor statement of objectives; selection of the wrong tests; misinterpretation of test scores; and depersonalization of…
Descriptors: Accountability, Criterion Referenced Tests, Educational Objectives, Educational Testing
Educational Testing Service, Princeton, NJ. – 1953
Seven major topics were included in the conference proceedings: (1) Improving Evaluation of Educational Outcomes at the College Level; (2) Individual versus Group Decision Making; (3) Problems and Procedures in Profile Analysis; (4) Making Test Results Meaningful; (5) The Teaching of Educational Measurement; (6) The Interview as an Evaluation…
Descriptors: Course Content, Decision Making, Educational Benefits, Educational Improvement
Mushkin, Selma J.; Billings, Bradley B. – 1975
This guide is essentially designed as a teaching aid for those who would inform planners, officials of educational ministries, school administrators, principals, and teachers about educational outcome measurements. In outline and graphic form, the guide presents topics for discussion in a seminar dealing with how to obtain information on…
Descriptors: Academic Achievement, Affective Measures, Comparative Education, Educational Assessment
PAQUETTE, F. ANDRE; TOLLINGER, SUZANNE – 1966
THE DIRECTOR OF TESTING OF THE MODERN LANGUAGE ASSOCIATION (MLA), WITH THE ASSISTANCE OF 28 SELECTED IMPARTIAL PROFESSIONALS, PRODUCED INDIVIDUALLY AND IN TEAMS THIS CRITICAL APPRAISAL OF THE MLA FOREIGN LANGUAGE PROFICIENCY TESTS IN ORDER TO POINT OUT EXISTING DEFICIENCIES AND TO SUGGEST IMPROVEMENTS IN FUTURE TEST DEVELOPMENT. THIS HANDBOOK…
Descriptors: Achievement Tests, Advanced Students, Applied Linguistics, Cultural Background