ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Evaluators	3
Computer Assisted Testing	2
Scoring	2
Accuracy	1
Bayesian Statistics	1
Computation	1
Correlation	1
Cutting Scores	1
Essay Tests	1
Foreign Countries	1
Foreign Students	1
German	1
Goodness of Fit	1
Individual Characteristics	1
Interrater Reliability	1
Item Analysis	1
Item Response Theory	1
Language Tests	1
Markov Processes	1
Measurement	1
Monte Carlo Methods	1
Psychometrics	1
Reliability	1
Scores	1
Second Languages	1
More ▼

Source

International Journal of…

Author

Childs, Ruth A.	1
Eckes, Thomas	1
Engelhard, George, Jr.	1
Foltz, Peter	1
Jaciw, Andrew P.	1
Jin, Kuan-Yu	1
Rosenstein, Mark	1
Saunders, Kelsey	1
Wind, Stefanie A.	1
Wolfe, Edward W.	1

Publication Type

Journal Articles	3
Reports - Research	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education

Audience

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Examining Severity and Centrality Effects in TestDaF Writing and Speaking Assessments: An Extended Bayesian Many-Facet Rasch Analysis

Peer reviewed

Direct link

Eckes, Thomas; Jin, Kuan-Yu – International Journal of Testing, 2021

Severity and centrality are two main kinds of rater effects posing threats to the validity and fairness of performance assessments. Adopting Jin and Wang's (2018) extended facets modeling approach, we separately estimated the magnitude of rater severity and centrality effects in the web-based TestDaF (Test of German as a Foreign Language) writing…

Descriptors: Language Tests, German, Second Languages, Writing Tests

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Scoring Guide Alignment: Combining Scorer Judgments with Item Parameter Estimates to Set Cut Scores

Peer reviewed

Direct link

Childs, Ruth A.; Jaciw, Andrew P.; Saunders, Kelsey – International Journal of Testing, 2007

Many approaches to standard-setting use item calibration and student score estimation results to structure panelists' tasks. However, this requires collecting standard-setting judgments after the item analysis results are available. The Scoring Guide Alignment approach collects standard-setting judgments during the scoring sessions from teachers…

Descriptors: Testing Programs, Scoring, Item Analysis, Test Items