Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 12 |
Descriptor
Source
Educational Measurement:… | 14 |
Author
Burroughs, Susie | 1 |
Cawthon, Stephanie W. | 1 |
Chajewski, Michael | 1 |
Compton, Elizabeth | 1 |
Davidson, Anne H. | 1 |
Elliott, Stephen N. | 1 |
Ferrara, Steve | 1 |
Frey, Andreas | 1 |
Gong, Brian | 1 |
Groce, Eric | 1 |
Hartig, Johannes | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Descriptive | 6 |
Reports - Research | 4 |
Reports - Evaluative | 3 |
Opinion Papers | 1 |
Education Level
Elementary Secondary Education | 14 |
Elementary Education | 2 |
Grade 3 | 2 |
Grade 5 | 2 |
Higher Education | 2 |
Adult Education | 1 |
Grade 4 | 1 |
High Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Klugman, Emma M.; Ho, Andrew D. – Educational Measurement: Issues and Practice, 2020
State testing programs regularly release previously administered test items to the public. We provide an open-source recipe for state, district, and school assessment coordinators to combine these items flexibly to produce scores linked to established state score scales. These would enable estimation of student score distributions and achievement…
Descriptors: Testing Programs, State Programs, Test Items, Scores
Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011
Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…
Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences
Chajewski, Michael; Mattern, Krista D.; Shaw, Emily J. – Educational Measurement: Issues and Practice, 2011
The purpose of the current study was to examine the relationship between Advanced Placement (AP) exam participation and enrollment in a 4-year postsecondary institution. A positive relationship was expected given that the primary purpose of offering AP courses is to allow students to engage in college-level academic work while in high school, and…
Descriptors: Advanced Placement Programs, College Preparation, College Credits, Enrollment
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008
In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…
Descriptors: Testing Programs, State Programs, Standard Setting, Reliability
Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009
In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…
Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design
Kingston, Neal; Nash, Brooke – Educational Measurement: Issues and Practice, 2011
An effect size of about 0.70 (or 0.40-0.70) is often claimed for the efficacy of formative assessment, but is not supported by the existing research base. More than 300 studies that appeared to address the efficacy of formative assessment in grades K-12 were reviewed. Many of the studies had severely flawed research designs yielding…
Descriptors: Elementary Secondary Education, Formative Evaluation, Program Effectiveness, Effect Size
Cawthon, Stephanie W. – Educational Measurement: Issues and Practice, 2009
Students who are deaf or hard of hearing (SDHH) often use test accommodations when they participate in large-scale, standardized assessments. The purpose of this article is to present findings from the "Third Annual Survey of Assessment and Accommodations for Students who are Deaf or Hard of Hearing". The "big five" accommodations were reported by…
Descriptors: Standardized Tests, Testing Accommodations, Measures (Individuals), Partial Hearing
Perie, Marianne; Marion, Scott; Gong, Brian – Educational Measurement: Issues and Practice, 2009
Local assessment systems are being marketed as formative, benchmark, predictive, and a host of other terms. Many so-called formative assessments are not at all similar to the types of assessments and strategies studied by Black and Wiliam (1998) but instead are interim assessments. In this article, we clarify the definition and uses of interim…
Descriptors: Student Evaluation, Evaluation Methods, Educational Assessment, Formative Evaluation
Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009
Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…
Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior
Burroughs, Susie; Groce, Eric; Webeck, Mary Lee – Educational Measurement: Issues and Practice, 2005
With 3 years and counting since its inception, the scope and impact of "No Child Left Behind" is now being felt in classrooms across the nation. Although some successes have been identified, concerns about the implementation and expectations of the legislation are emerging. As a result of the legislation's emphasis on the development of…
Descriptors: Program Effectiveness, Federal Legislation, Testing, Accountability
Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009
In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…
Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods
Porter, Andrew C.; Linn, Robert L.; Trimble, C. Scott – Educational Measurement: Issues and Practice, 2005
The No Child Left Behind Act allows states to vary (a) the trajectories they select to move from the baseline percent proficient or above in 2002 to the 100% proficient goal in 2014, (b) the minimum number of students required for reporting of disaggregated subgroup results, and (c) whether or not they will use confidence intervals when…
Descriptors: Federal Legislation, Educational Improvement, Educational Policy, State Legislation
Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007
The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…
Descriptors: Inferences, Disabilities, Rating Scales, Eligibility