ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Source

Educational Measurement:…

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Secondary Education	1

Audience

Location

China

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

A Cost-Benefit Analysis of Automatic Item Generation

Peer reviewed

Direct link

Kosh, Audra E.; Simpson, Mary Ann; Bickel, Lisa; Kellogg, Mark; Sanford-Moore, Ellie – Educational Measurement: Issues and Practice, 2019

Automatic item generation (AIG)--a means of leveraging technology to create large quantities of items--requires a minimum number of items to offset the sizable upfront investment (i.e., model development and technology deployment) in order to achieve cost savings. In this cost-benefit analysis, we estimated the cost of each step of AIG and manual…

Descriptors: Cost Effectiveness, Automation, Test Items, Mathematics Tests

The Effect of Drag-and-Drop Item Features on Test-Taker Performance and Response Strategies

Peer reviewed

Direct link

Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020

Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…

Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making

Affordances of Item Formats and Their Effects on Test-Taker Cognition under Uncertainty

Peer reviewed

Direct link

Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Measurement: Issues and Practice, 2019

The current study investigated how item formats and their inherent affordances influence test-takers' cognition under uncertainty. Adult participants solved content-equivalent math items in multiple-selection multiple-choice and four alternative grid formats. The results indicated that participants' affirmative response tendency (i.e., judge the…

Descriptors: Affordances, Test Items, Test Format, Test Wiseness

Measuring Widening Proficiency Differences in International Assessments: Are Current Approaches Enough?

Peer reviewed

Direct link

Rutkowski, David; Rutkowski, Leslie; Liaw, Yuan-Ling – Educational Measurement: Issues and Practice, 2018

Participation in international large-scale assessments has grown over time with the largest, the Programme for International Student Assessment (PISA), including more than 70 education systems that are economically and educationally diverse. To help accommodate for large achievement differences among participants, in 2009 PISA offered…

Descriptors: Educational Assessment, Foreign Countries, Achievement Tests, Secondary School Students

Can Item Response Times Provide Insight into Students' Motivation and Self-Efficacy in Math? An Initial Application of Test Metadata to Understand Students' Social-Emotional Needs

Peer reviewed

Direct link

Soland, James – Educational Measurement: Issues and Practice, 2019

As computer-based tests become more common, there is a growing wealth of metadata related to examinees' response processes, which include solution strategies, concentration, and operating speed. One common type of metadata is item response time. While response times have been used extensively to improve estimates of achievement, little work…

Descriptors: Test Items, Item Response Theory, Metadata, Self Efficacy

Gauging Item Alignment through Online Systems While Controlling for Rater Effects

Peer reviewed

Direct link

Anderson, Daniel; Irvin, Shawn; Alonzo, Julie; Tindal, Gerald A. – Educational Measurement: Issues and Practice, 2015

The alignment of test items to content standards is critical to the validity of decisions made from standards-based tests. Generally, alignment is determined based on judgments made by a panel of content experts with either ratings averaged or via a consensus reached through discussion. When the pool of items to be reviewed is large, or the…

Descriptors: Test Items, Alignment (Education), Standards, Online Systems

Test Development with Performance Standards and Achievement Growth in Mind

Peer reviewed

Direct link

Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011

Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…

Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences

Alignment of Mathematics State-Level Standards and Assessments: The Role of Reviewer Agreement

Peer reviewed

Direct link

Webb, Noreen M.; Herman, Joan L.; Webb, Norman L. – Educational Measurement: Issues and Practice, 2007

This article examines the role of reviewer agreement in judgments about alignment between tests and standards. We used case data from three state alignment studies to explore how different approaches to incorporating reviewer agreement changes alignment conclusions. The three case studies showed varying degrees of reviewer agreement about…

Descriptors: Test Items, Case Studies, Mathematics, Interrater Reliability

Effects of Assigning Raters to Items

Peer reviewed

Direct link

Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008

Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring

Peer reviewed

Lane, Suzanne; And Others – Educational Measurement: Issues and Practice, 1996

Gender-related differential item functioning (DIF) was examined in a context in which 3,946 middle school students received mathematics instruction focusing on problem solving. Reasons why four tasks on the performance assessment favored female students and two favored male students are discussed. (SLD)

Descriptors: Item Bias, Mathematics Achievement, Mathematics Tests, Middle School Students

Beyond Computation and Correctness: Contributions of Open-Ended Tasks in Examining U.S. and Chinese Students' Mathematical Performance.

Peer reviewed

Cai, Jinfa – Educational Measurement: Issues and Practice, 1997

The contributions of open-ended tasks in examining students' mathematical performance were studied with 250 U.S. and 425 Chinese sixth graders. Open-ended tasks allow for analysis of student performance that cannot be assessed solely by percent correct or incorrect, but they pose many problems, such as those of translation. (SLD)

Descriptors: Cognitive Processes, Computation, Cross Cultural Studies, Elementary School Students

Mathematics Tests	11
Test Items	11
Interrater Reliability	3
Test Construction	3
Cognitive Processes	2
Computer Assisted Testing	2
Difficulty Level	2
Educational Assessment	2
Foreign Countries	2
Mathematics Achievement	2
Performance Based Assessment	2
Performance Factors	2
Standards	2
Student Evaluation	2
Test Format	2
Academic Achievement	1
Achievement Gains	1
Achievement Tests	1
Adults	1
Affordances	1
Alignment (Education)	1
Ambiguity (Context)	1
Automation	1
Bias	1
Case Studies	1
More ▼

Katz, Irvin R.	2
Keehner, Madeleine	2
Alonzo, Julie	1
Anderson, Daniel	1
Arslan, Burcu	1
Bickel, Lisa	1
Cai, Jinfa	1
Davidson, Anne H.	1
Ferrara, Steve	1
Gong, Tao	1
Herman, Joan L.	1
Irvin, Shawn	1
Ito, Kyoko	1
Jiang, Yang	1
Kellogg, Mark	1
Kosh, Audra E.	1
Lane, Suzanne	1
Liaw, Yuan-Ling	1
Moon, Jung Aa	1
Rutkowski, David	1
Rutkowski, Leslie	1
Sanford-Moore, Ellie	1
Simpson, Mary Ann	1
Skucha, Sylvia	1
More ▼