Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Educational Testing | 8 |
Evaluation Problems | 8 |
Evaluation Research | 8 |
Educational Assessment | 6 |
Evaluation Methods | 6 |
Measurement | 5 |
Testing Problems | 5 |
Psychometrics | 4 |
Mathematics Education | 3 |
Mathematics Instruction | 3 |
Program Effectiveness | 3 |
More ▼ |
Author
Baldwin, Su G. | 1 |
Chavez, Oscar | 1 |
Clauser, Brian E. | 1 |
Cui, Ying | 1 |
Dillon, Gerard F. | 1 |
Grouws, Douglas A. | 1 |
Hill, Heather C. | 1 |
Leighton, Jacqueline P. | 1 |
Margolis, Melissa J. | 1 |
McNaught, Melissa D. | 1 |
Mee, Janet | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 3 |
Opinion Papers | 2 |
Reports - Evaluative | 2 |
ERIC Digests in Full Text | 1 |
ERIC Publications | 1 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 5 |
Secondary Education | 3 |
Elementary Education | 1 |
High Schools | 1 |
Audience
Location
Florida | 1 |
North Carolina | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
Florida Comprehensive… | 1 |
Iowa Tests of Educational… | 1 |
National Assessment of… | 1 |
North Carolina End of Course… | 1 |
What Works Clearinghouse Rating
Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009
In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…
Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Xu, Zeyu; Nichols, Austin – National Center for Analysis of Longitudinal Data in Education Research, 2010
The gold standard in making causal inference on program effects is a randomized trial. Most randomization designs in education randomize classrooms or schools rather than individual students. Such "clustered randomization" designs have one principal drawback: They tend to have limited statistical power or precision. This study aims to…
Descriptors: Test Format, Reading Tests, Norm Referenced Tests, Research Design
Tarr, James E.; Ross, Daniel J.; McNaught, Melissa D.; Chavez, Oscar; Grouws, Douglas A.; Reys, Robert E.; Sears, Ruthmae; Taylan, R. Didem – Online Submission, 2010
The Comparing Options in Secondary Mathematics: Investigating Curriculum (COSMIC) project is a longitudinal study of student learning from two types of mathematics curricula: integrated and subject-specific. Previous large-scale research studies such as the National Assessment of Educational Progress (NAEP) indicate that numerous variables are…
Descriptors: Mathematics Education, Teacher Characteristics, Mathematics Achievement, Program Effectiveness
Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007
The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…
Descriptors: Educational Testing, Tests, Measurement, Faculty Development
Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…
Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement
Schafer, William D. – 1995
The purpose of this digest is to describe school counselors' roles in the area of assessment through an historical review of testing in counseling, and to report on study findings regarding roles employers require school counselors to perform. Knowledge needed by counselors to obtain evidence, evaluate its usefulness, and interpret its meaning…
Descriptors: Counselor Evaluation, Educational Testing, Elementary Secondary Education, Evaluation