ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Comparative Testing	3
Test Reliability	3
Test Validity	2
Achievement Tests	1
Black Students	1
College Students	1
Computer Assisted Testing	1
Context Effect	1
Correlation	1
Essay Tests	1
Essays	1
Evaluation Criteria	1
Evaluation Methods	1
German	1
Grade 8	1
Higher Education	1
Interrater Reliability	1
Italian	1
Item Bias	1
Junior High School Students	1
Junior High Schools	1
Mathematics Tests	1
Multilingual Materials	1
Multiple Choice Tests	1
Racial Bias	1
More ▼

Source

Journal of Educational…

Author

Breland, Hunter M.	1
Gaynor, Judith L.	1
Hamid Mohammadi	1
Mark J. Gierl	1
Ryan, Katherine E.	1
Tahereh Firoozi	1

Publication Type

Journal Articles	3
Reports - Research	3
Speeches/Meeting Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Test of Standard Written…

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

A Comparison of Direct and Indirect Assessments of Writing Skill.

Peer reviewed

Breland, Hunter M.; Gaynor, Judith L. – Journal of Educational Measurement, 1979

Over 2,000 writing samples were collected from four undergraduate institutions and compared, where possible, with scores on a multiple-choice test. High correlations between ratings of the writing samples and multiple-choice test scores were obtained. Samples contributed substantially to the prediction of both college grades and writing…

Descriptors: Achievement Tests, Comparative Testing, Correlation, Essay Tests

The Performance of the Mantel-Haenszel Procedure across Samples and Matching Criteria.

Peer reviewed

Ryan, Katherine E. – Journal of Educational Measurement, 1991

The reliability of Mantel-Haenszel (MH) indexes across samples of examinees and sample sizes and their robustness to item context effects were investigated with data for 670 African-American and 5,015 white students from the Second International Mathematics Study. MH procedures can be used to detect differential item functioning. (SLD)

Descriptors: Black Students, Comparative Testing, Context Effect, Evaluation Criteria