ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	10

Descriptor

Educational Testing	11
Student Evaluation	11
Educational Assessment	9
Elementary Secondary Education	6
Measurement	5
Test Use	5
Educational Principles	4
Evaluation Criteria	4
Evaluation Methods	4
Evaluation Utilization	4
Formative Evaluation	4
Inferences	4
Program Evaluation	4
Achievement Tests	3
Diagnostic Tests	3
Measures (Individuals)	3
State Programs	3
Test Construction	3
Test Items	3
Cutting Scores	2
Educational Improvement	2
Educational Research	2
Evaluation Research	2
Evidence	2
Mathematics Tests	2
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	11
Reports - Evaluative	7
Reports - Research	2
Opinion Papers	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	8
Elementary Education	3
Grade 3	2
Grade 5	2
Grade 4	1

Audience

Location

California	1
Idaho	1
Nebraska	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Test Development with Performance Standards and Achievement Growth in Mind

Peer reviewed

Direct link

Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011

Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…

Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences

Accommodations for Students Who Are Deaf or Hard of Hearing in Large-Scale, Standardized Assessments: Surveying the Landscape and Charting a New Direction

Peer reviewed

Direct link

Cawthon, Stephanie W. – Educational Measurement: Issues and Practice, 2009

Students who are deaf or hard of hearing (SDHH) often use test accommodations when they participate in large-scale, standardized assessments. The purpose of this article is to present findings from the "Third Annual Survey of Assessment and Accommodations for Students who are Deaf or Hard of Hearing". The "big five" accommodations were reported by…

Descriptors: Standardized Tests, Testing Accommodations, Measures (Individuals), Partial Hearing

Moving toward a Comprehensive Assessment System: A Framework for Considering Interim Assessments

Peer reviewed

Direct link

Perie, Marianne; Marion, Scott; Gong, Brian – Educational Measurement: Issues and Practice, 2009

Local assessment systems are being marketed as formative, benchmark, predictive, and a host of other terms. Many so-called formative assessments are not at all similar to the types of assessments and strategies studied by Black and Wiliam (1998) but instead are interim assessments. In this article, we clarify the definition and uses of interim…

Descriptors: Student Evaluation, Evaluation Methods, Educational Assessment, Formative Evaluation

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

From Evidence to Action: A Seamless Process in Formative Assessment?

Peer reviewed

Direct link

Heritage, Margaret; Kim, Jinok; Vendlinski, Terry; Herman, Joan – Educational Measurement: Issues and Practice, 2009

Based on the results of a generalizability study of measures of teacher knowledge for teaching mathematics developed at the National Center for Research on Evaluation, Standards, and Student Testing at the University of California, Los Angeles, this article provides evidence that teachers are better at drawing reasonable inferences about student…

Descriptors: Formative Evaluation, Educational Testing, Inferences, Mathematics Instruction

A Framework for Evaluating and Planning Assessments Intended to Improve Student Achievement

Peer reviewed

Direct link

Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009

Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…

Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods

Determining Sufficient Measurement Opportunities when Using Multiple Cut Scores

Peer reviewed

Direct link

Norman, Rebecca L.; Buckendahl, Chad W. – Educational Measurement: Issues and Practice, 2008

Many educational testing programs report examinee performance at more than two levels of proficiency. Whether these assessments have the capacity to support these multiple inferences, though, is a topic that has not been widely discussed. This study proposes a method for evaluating the minimum number of measurement opportunities for reporting…

Descriptors: Testing Programs, Student Evaluation, Educational Testing, Mathematics Achievement

An NCME Instructional Module on Booklet Designs in Large-Scale Assessments of Student Achievement: Theory and Practice

Peer reviewed

Direct link

Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009

In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…

Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Commentary: Evaluating the Validity of Formative and Interim Assessment

Peer reviewed

Direct link

Shepard, Lorrie A. – Educational Measurement: Issues and Practice, 2009

In many school districts, the pressure to raise test scores has created overnight celebrity status for formative assessment. Its powers to raise student achievement have been touted, however, without attending to the research on which these claims were based. Sociocultural learning theory provides theoretical grounding for understanding how…

Descriptors: Learning Theories, Validity, Student Evaluation, Evaluation Methods

Building Validity Evidence for Scores on a State-Wide Alternate Assessment: A Contrasting Groups, Multimethod Approach

Peer reviewed

Direct link

Elliott, Stephen N.; Compton, Elizabeth; Roach, Andrew T. – Educational Measurement: Issues and Practice, 2007

The relationships between ratings on the Idaho Alternate Assessment (IAA) for 116 students with significant disabilities and corresponding ratings for the same students on two norm-referenced teacher rating scales were examined to gain evidence about the validity of resulting IAA scores. To contextualize these findings, another group of 54…

Descriptors: Inferences, Disabilities, Rating Scales, Eligibility

Are U.S. Students the Most Heavily Tested on Earth?

Peer reviewed

Phelps, Richard P. – Educational Measurement: Issues and Practice, 1997

Data from large-scale international studies for 13 countries indicate that U.S. students are clearly not the most heavily tested students on earth if one compares systemwide tests by their duration. In the United States, tests are much more likely to be of low consequence for the student. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Educational Testing, Elementary Secondary Education

Buckendahl, Chad W.	1
Burling, Kelly S.	1
Cawthon, Stephanie W.	1
Compton, Elizabeth	1
Davidson, Anne H.	1
Elliott, Stephen N.	1
Ferrara, Steve	1
Frey, Andreas	1
Gong, Brian	1
Hartig, Johannes	1
Heritage, Margaret	1
Herman, Joan	1
Kato, Kentaro	1
Kim, Jinok	1
Marion, Scott	1
Meyers, Jason L.	1
Moen, Ross E.	1
Nichols, Paul D.	1
Norman, Rebecca L.	1
Perie, Marianne	1
Phelps, Richard P.	1
Roach, Andrew T.	1
Rupp, Andre A.	1
Shepard, Lorrie A.	1
Skucha, Sylvia	1
More ▼