ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	6

Source

Educational Measurement:…

Author

Wind, Stefanie A.	2
Bennett, Randy E.	1
Cox, Troy L.	1
Deane, Paul	1
Eckstein, Grant T.	1
Hart, Judson M.	1
Hartshorn, K. James	1
Li, Feiming	1
McVay, Aaron	1
Sims, Maureen E.	1
Walker, A. Adrienne	1
Wilcox, Matthew P.	1
Wolfe, Edward W.	1
Xiong, Jiawei	1
Zhang, Mo	1
van Rijn, Peter W.	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Descriptive	1

Education Level

Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Bilevel Topic Model-Based Multitask Learning for Constructed-Responses Multidimensional Automated Scoring and Interpretation

Peer reviewed

Direct link

Xiong, Jiawei; Li, Feiming – Educational Measurement: Issues and Practice, 2023

Multidimensional scoring evaluates each constructed-response answer from more than one rating dimension and/or trait such as lexicon, organization, and supporting ideas instead of only one holistic score, to help students distinguish between various dimensions of writing quality. In this work, we present a bilevel learning model for combining two…

Descriptors: Scoring, Models, Task Analysis, Learning Processes

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Rubric Rating with MFRM versus Randomly Distributed Comparative Judgment: A Comparison of Two Approaches to Second-Language Writing Assessment

Peer reviewed

Direct link

Sims, Maureen E.; Cox, Troy L.; Eckstein, Grant T.; Hartshorn, K. James; Wilcox, Matthew P.; Hart, Judson M. – Educational Measurement: Issues and Practice, 2020

The purpose of this study is to explore the reliability of a potentially more practical approach to direct writing assessment in the context of ESL writing. Traditional rubric rating (RR) is a common yet resource-intensive evaluation practice when performed reliably. This study compared the traditional rubric model of ESL writing assessment and…

Descriptors: Scoring Rubrics, Item Response Theory, Second Language Learning, English (Second Language)

Are There Gender Differences in "How" Students Write Their Essays? An Analysis of Writing Processes

Peer reviewed

Direct link

Zhang, Mo; Bennett, Randy E.; Deane, Paul; van Rijn, Peter W. – Educational Measurement: Issues and Practice, 2019

This study compared gender groups on the processes used in writing essays in an online assessment. Middle-school students from four grades responded to essays in two persuasive subgenres, argumentation and policy recommendation. Writing processes were inferred from four indicators extracted from students' keystroke logs. In comparison to males, on…

Descriptors: Gender Differences, Essays, Computer Assisted Testing, Persuasive Discourse

An Instructional Module on Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2017

Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) that can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. This instructional module provides an introduction to MSA as a probabilistic-nonparametric framework in which to explore…

Descriptors: Probability, Nonparametric Statistics, Item Response Theory, Scaling

Application of Latent Trait Models to Identifying Substantively Interesting Raters

Peer reviewed

Direct link

Wolfe, Edward W.; McVay, Aaron – Educational Measurement: Issues and Practice, 2012

Historically, research focusing on rater characteristics and rating contexts that enable the assignment of accurate ratings and research focusing on statistical indicators of accurate ratings has been conducted by separate communities of researchers. This study demonstrates how existing latent trait modeling procedures can identify groups of…

Descriptors: Researchers, Research, Correlation, Test Bias

Writing Evaluation	6
Item Response Theory	3
Models	3
Comparative Analysis	2
Computer Assisted Testing	2
Evaluation Methods	2
Multiple Choice Tests	2
Raw Scores	2
Scores	2
Writing Skills	2
Attention	1
Connected Discourse	1
Correlation	1
Decision Making	1
Editing	1
Educational Assessment	1
English (Second Language)	1
Essays	1
Evaluators	1
Gender Differences	1
Goodness of Fit	1
Holistic Approach	1
Interrater Reliability	1
Keyboarding (Data Entry)	1
Language Skills	1
More ▼