ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	24

Descriptor

Student Evaluation	65
Educational Assessment	30
Elementary Secondary Education	25
Test Use	24
Evaluation Methods	17
Test Construction	15
Academic Achievement	13
Measurement Techniques	12
Educational Testing	11
Achievement Tests	9
Performance Based Assessment	8
Scores	8
Test Interpretation	8
Test Items	8
Testing Programs	8
Evaluation Criteria	7
Measurement	7
Criterion Referenced Tests	6
Grading	6
High Stakes Tests	6
Scoring	6
Test Validity	6
Academic Standards	5
Cutting Scores	5
Diagnostic Tests	5
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	65
Reports - Evaluative	28
Reports - Research	16
Reports - Descriptive	13
Opinion Papers	6
Speeches/Meeting Papers	6
Information Analyses	3
Guides - Non-Classroom	2
Tests/Questionnaires	2
Guides - Classroom - Teacher	1

Education Level

Elementary Secondary Education	8
Elementary Education	5
Grade 3	2
Grade 4	2
Grade 5	2
Postsecondary Education	2
High Schools	1
Higher Education	1
Intermediate Grades	1

Audience

Teachers

Location

Nebraska	2
United States	2
California	1
China	1
Idaho	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

Modeling Slipping Effects in a Large-Scale Assessment with Innovative Item Formats

Peer reviewed

Direct link

Cuhadar, Ismail; Binici, Salih – Educational Measurement: Issues and Practice, 2022

This study employs the 4-parameter logistic item response theory model to account for the unexpected incorrect responses or slipping effects observed in a large-scale Algebra 1 End-of-Course assessment, including several innovative item formats. It investigates whether modeling the misfit at the upper asymptote has any practical impact on the…

Descriptors: Item Response Theory, Measurement, Student Evaluation, Algebra

How Did Students Engage with a Remote Educational Assessment? A Case Study

Peer reviewed

Direct link

Guo, Hongwen – Educational Measurement: Issues and Practice, 2022

Many educational summative and formative assessments have been transferred to a remote online setting because of the pandemic. Educational professionals and stakeholders have shown interest in learning how this change in the test mode influenced test takers; that is, whether test-taking experiences in a remote test setting were different from…

Descriptors: Distance Education, Educational Assessment, Student Evaluation, Summative Evaluation

Demystifying Adequate Growth Percentiles

Peer reviewed

Direct link

Katherine E. Castellano; Daniel F. McCaffrey; Joseph A. Martineau – Educational Measurement: Issues and Practice, 2025

Growth-to-standard models evaluate student growth against the growth needed to reach a future standard or target of interest, such as proficiency. A common growth-to-standard model involves comparing the popular Student Growth Percentile (SGP) to Adequate Growth Percentiles (AGPs). AGPs follow from an involved process based on fitting a series of…

Descriptors: Student Evaluation, Growth Models, Student Educational Objectives, Educational Indicators

A Problem with the Bookmark Procedure's Correction for Guessing

Peer reviewed

Direct link

Baldwin, Peter – Educational Measurement: Issues and Practice, 2021

In the Bookmark standard-setting procedure, panelists are instructed to consider what examinees know rather than what they might attain by guessing; however, because examinees sometimes do guess, the procedure includes a correction for guessing. Like other corrections for guessing, the Bookmark's correction assumes that examinees either know the…

Descriptors: Guessing (Tests), Student Evaluation, Evaluation Methods, Standard Setting (Scoring)

Achievement and Growth on English Language Proficiency and Content Assessments for English Learners in Elementary Grades

Peer reviewed

Direct link

Heather M. Buzick; Mikyung Kim Wolf; Laura Ballard – Educational Measurement: Issues and Practice, 2024

English language proficiency (ELP) assessment scores are used by states to make high-stakes decisions related to linguistic support in instruction and assessment for English learner (EL) students and for EL student reclassification. Changes to both academic content standards and ELP academic standards within the last decade have resulted in…

Descriptors: English Language Learners, Elementary School Students, English (Second Language), Language Proficiency

Deficiency, Contamination, and the Signal Processing Metaphor

Peer reviewed

Direct link

Newton, Paul E. – Educational Measurement: Issues and Practice, 2020

Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…

Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity

Standardization and "UNDERSTAND"ardization in Educational Assessment

Peer reviewed

Direct link

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2020

Educational tests are standardized so that all examinees are tested on the same material, under the same testing conditions, and with the same scoring protocols. This uniformity is designed to provide a level "playing field" for all examinees so that the test is "the same" for everyone. Thus, standardization is designed to…

Descriptors: Standards, Educational Assessment, Culture Fair Tests, Scoring

Assessment in the Service of Student Learning: Three Cases in Point

Peer reviewed

Direct link

Bond, Lloyd – Educational Measurement: Issues and Practice, 2020

Three examples of extant testing practices (i.e., a classroom instructor's use of a simple pre-post design, the practice of teaching to the test, and the think aloud verbal protocol) are discussed to illustrate the contention that assessment in the service of testing and learning does not necessarily involve radically different assessment…

Descriptors: Testing, Test Preparation, Teaching Methods, Protocol Analysis

The Value of Choice: An Experiment Using Multiple-Choice Tests

Peer reviewed

Direct link

Aray, Henry; Pedauga, Luis – Educational Measurement: Issues and Practice, 2019

This article presents a novel experimental methodology in which groups of students were offered the option to choose between two equivalent scoring rules to assess a multiple-choice test. The effect of choosing the scoring rule on marks is tested. Two major contributions arise from this research. First, it contributes to the literature on the…

Descriptors: Multiple Choice Tests, Scoring, Student Attitudes, Decision Making

Effect of Content Knowledge on Angoff-Style Standard Setting Judgments

Peer reviewed

Direct link

Margolis, Melissa J.; Mee, Janet; Clauser, Brian E.; Winward, Marcia; Clauser, Jerome C. – Educational Measurement: Issues and Practice, 2016

Evidence to support the credibility of standard setting procedures is a critical part of the validity argument for decisions made based on tests that are used for classification. One area in which there has been limited empirical study is the impact of standard setting judge selection on the resulting cut score. One important issue related to…

Descriptors: Academic Standards, Standard Setting (Scoring), Cutting Scores, Credibility

Automated Scoring of Students' Small-Group Discussions to Assess Reading Ability

Peer reviewed

Direct link

Kosh, Audra E.; Greene, Jeffrey A.; Murphy, P. Karen; Burdick, Hal; Firetto, Carla M.; Elmore, Jeff – Educational Measurement: Issues and Practice, 2018

We explored the feasibility of using automated scoring to assess upper-elementary students' reading ability through analysis of transcripts of students' small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one…

Descriptors: Computer Assisted Testing, Small Group Instruction, Group Discussion, Student Evaluation

Test Development with Performance Standards and Achievement Growth in Mind

Peer reviewed

Direct link

Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011

Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…

Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences

An NCME Instructional Module on Booklet Designs in Large-Scale Assessments of Student Achievement: Theory and Practice

Peer reviewed

Direct link

Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009

In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…

Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Differentials of a State Reading Assessment: Item Functioning, Distractor Functioning, and Omission Frequency for Disability Categories

Peer reviewed

Direct link

Kato, Kentaro; Moen, Ross E.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2009

Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional…

Descriptors: Learning Disabilities, Language Impairments, Behavior Disorders, Affective Behavior

A Framework for Evaluating and Planning Assessments Intended to Improve Student Achievement

Peer reviewed

Direct link

Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009

Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…

Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Brookhart, Susan M.	2
Cizek, Gregory J.	2
Hills, John R.	2
Plake, Barbara S.	2
Schafer, William D.	2
Stiggins, Richard J.	2
Airasian, Peter W.	1
Aray, Henry	1
Arter, Judith A.	1
Baldwin, Peter	1
Bauer, Ernest A.	1
Bennett, Randy Elliot	1
Binici, Salih	1
Bond, Lloyd	1
Buckendahl, Chad W.	1
Bunch, Michael B.	1
Burdick, Hal	1
Burling, Kelly S.	1
Cai, Jinfa	1
Cawthon, Stephanie W.	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Clemans, William V.	1
Compton, Elizabeth	1
More ▼