Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Comparative Testing | 3 |
Test Reliability | 3 |
Test Validity | 2 |
Achievement Tests | 1 |
Black Students | 1 |
College Students | 1 |
Computer Assisted Testing | 1 |
Context Effect | 1 |
Correlation | 1 |
Essay Tests | 1 |
Essays | 1 |
More ▼ |
Source
Journal of Educational… | 3 |
Author
Breland, Hunter M. | 1 |
Gaynor, Judith L. | 1 |
Hamid Mohammadi | 1 |
Mark J. Gierl | 1 |
Ryan, Katherine E. | 1 |
Tahereh Firoozi | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 3 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Test of Standard Written… | 1 |
What Works Clearinghouse Rating
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian

Breland, Hunter M.; Gaynor, Judith L. – Journal of Educational Measurement, 1979
Over 2,000 writing samples were collected from four undergraduate institutions and compared, where possible, with scores on a multiple-choice test. High correlations between ratings of the writing samples and multiple-choice test scores were obtained. Samples contributed substantially to the prediction of both college grades and writing…
Descriptors: Achievement Tests, Comparative Testing, Correlation, Essay Tests

Ryan, Katherine E. – Journal of Educational Measurement, 1991
The reliability of Mantel-Haenszel (MH) indexes across samples of examinees and sample sizes and their robustness to item context effects were investigated with data for 670 African-American and 5,015 white students from the Second International Mathematics Study. MH procedures can be used to detect differential item functioning. (SLD)
Descriptors: Black Students, Comparative Testing, Context Effect, Evaluation Criteria