ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	3

Descriptor

Data Collection	5
Evaluators	5
Scoring	5
Computer Assisted Testing	2
Adoption (Ideas)	1
Bias	1
Biology	1
Cutting Scores	1
Design	1
Elementary Secondary Education	1
Foreign Countries	1
Geography	1
History	1
Holistic Evaluation	1
Information Technology	1
Intellectual Disciplines	1
Interviews	1
Item Response Theory	1
Licensing Examinations…	1
Longitudinal Studies	1
Mathematics	1
Measures (Individuals)	1
Multiple Choice Tests	1
Performance Based Assessment	1
Psychometrics	1
More ▼

Source

Educational and Psychological…	2
British Journal of…	1

Author

Wind, Stefanie A.	2
Coniam, David	1
Cramer, Stephen E.	1
Ge, Yuan	1
Gray, James	1
Guo, Wenjing	1
Yan, Zi	1

Publication Type

Reports - Research	4
Journal Articles	3
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Hong Kong

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Detecting Rater Biases in Sparse Rater-Mediated Assessment Networks

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Educational and Psychological Measurement, 2021

Practical constraints in rater-mediated assessments limit the availability of complete data. Instead, most scoring procedures include one or two ratings for each performance, with overlapping performances across raters or linking sets of multiple-choice items to facilitate model estimation. These incomplete scoring designs present challenges for…

Descriptors: Evaluators, Scoring, Data Collection, Design

Exploring the Combined Effects of Rater Misfit and Differential Rater Functioning in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Guo, Wenjing – Educational and Psychological Measurement, 2019

Rater effects, or raters' tendencies to assign ratings to performances that are different from the ratings that the performances warranted, are well documented in rater-mediated assessments across a variety of disciplines. In many real-data studies of rater effects, researchers have reported that raters exhibit more than one effect, such as a…

Descriptors: Evaluators, Bias, Scoring, Data Collection

A Comparative Picture of the Ease of Use and Acceptance of Onscreen Marking by Markers across Subject Areas

Peer reviewed

Direct link

Coniam, David; Yan, Zi – British Journal of Educational Technology, 2016

Onscreen marking (OSM) has been used for the majority of Hong Kong public examinations since 2012. The current study compares marker reactions to OSM, ie, perceived ease of use and acceptance of OSM, against the backdrop of virtually all subject areas being marked on screen. The data were collected from three major sources: (1) survey data…

Descriptors: Foreign Countries, Computer Assisted Testing, Usability, Adoption (Ideas)

Properties of Writing Tasks: A Study of Alternative Procedures for Holistic Writing Assessment. Final Report.

Download full text

Gray, James; And Others – 1982

Five studies of holistic writing assessment procedures examined interactive relationships of the participants, processes, and products of writing assessment episodes. The first study examined practices in designing writing test prompts. The second study investigated the effects of variation in the specification of audience in a writing test prompt…

Descriptors: Data Collection, Evaluators, Holistic Evaluation, Longitudinal Studies

Some Practical Solutions to Standard-Setting Problems: The Georgia Teacher Certification Test Experience.

Download full text

Cramer, Stephen E. – 1990

A standard-setting procedure was developed for the Georgia Teacher Certification Testing Program as tests in 30 teaching fields were revised. A list of important characteristics of a standard-setting procedure was derived, drawing on the work of R. A. Berk (1986). The best method was found to be a highly formalized judgmental, empirical Angoff…

Descriptors: Computer Assisted Testing, Cutting Scores, Data Collection, Elementary Secondary Education