Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 10 |
Descriptor
Source
Educational Measurement:… | 11 |
Author
Publication Type
Journal Articles | 11 |
Reports - Evaluative | 7 |
Reports - Research | 2 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 8 |
Elementary Education | 3 |
Grade 3 | 2 |
Grade 5 | 2 |
Grade 4 | 1 |
Audience
Location
California | 1 |
Idaho | 1 |
Nebraska | 1 |
United States | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011
Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…
Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences
Cawthon, Stephanie W. – Educational Measurement: Issues and Practice, 2009
Students who are deaf or hard of hearing (SDHH) often use test accommodations when they participate in large-scale, standardized assessments. The purpose of this article is to present findings from the "Third Annual Survey of Assessment and Accommodations for Students who are Deaf or Hard of Hearing". The "big five" accommodations were reported by…
Descriptors: Standardized Tests, Testing Accommodations, Measures (Individuals), Partial Hearing
Perie, Marianne; Marion, Scott; Gong, Brian – Educational Measurement: Issues and Practice, 2009
Local assessment systems are being marketed as formative, benchmark, predictive, and a host of other terms. Many so-called formative assessments are not at all similar to the types of assessments and strategies studied by Black and Wiliam (1998) but instead are interim assessments. In this article, we clarify the definition and uses of interim…
Descriptors: Student Evaluation, Evaluation Methods, Educational Assessment, Formative Evaluation
Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009
Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…
Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior
Heritage, Margaret; Kim, Jinok; Vendlinski, Terry; Herman, Joan – Educational Measurement: Issues and Practice, 2009
Based on the results of a generalizability study of measures of teacher knowledge for teaching mathematics developed at the National Center for Research on Evaluation, Standards, and Student Testing at the University of California, Los Angeles, this article provides evidence that teachers are better at drawing reasonable inferences about student…
Descriptors: Formative Evaluation, Educational Testing, Inferences, Mathematics Instruction
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods
Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008
Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…
Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement
Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009
In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…
Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design
Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009
In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…
Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods
Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007
The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…
Descriptors: Inferences, Disabilities, Rating Scales, Eligibility

Phelps, Richard P. – Educational Measurement: Issues and Practice, 1997
Data from large-scale international studies for 13 countries indicate that U.S. students are clearly not the most heavily tested students on earth if one compares systemwide tests by their duration. In the United States, tests are much more likely to be of low consequence for the student. (SLD)
Descriptors: Comparative Analysis, Educational Assessment, Educational Testing, Elementary Secondary Education