ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Data Analysis	5
Language Tests	5
English (Second Language)	3
Foreign Countries	3
Error of Measurement	2
Interrater Reliability	2
Item Response Theory	2
Language Proficiency	2
Scores	2
Scoring	2
Second Language Learning	2
Academic Aptitude	1
Accuracy	1
Aptitude Tests	1
Coding	1
College Entrance Examinations	1
College Students	1
Comparative Analysis	1
Componential Analysis	1
Correlation	1
Cues	1
Danish	1
Evaluation Methods	1
Evaluators	1
Feedback	1
More ▼

Source

Language Testing

Author

Deygers, Bart	1
Dollerup, Cay	1
Lin, Chih-Kai	1
Pae, Tae-Il	1
Staples, Shelley	1
Van Gorp, Koen	1
Yan, Xun	1

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	1

Education Level

Higher Education

Audience

Location

Denmark	1
Netherlands	1
South Korea	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Fitting MD Analysis in an Argument-Based Validity Framework for Writing Assessment: Explanation and Generalization Inferences for the ECPE

Peer reviewed

Direct link

Yan, Xun; Staples, Shelley – Language Testing, 2020

The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…

Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Determining the Scoring Validity of a Co-Constructed CEFR-Based Rating Scale

Peer reviewed

Direct link

Deygers, Bart; Van Gorp, Koen – Language Testing, 2015

Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…

Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability

Causes of Gender DIF on an EFL Language Test: A Multiple-Data Analysis over Nine Years

Peer reviewed

Direct link

Pae, Tae-Il – Language Testing, 2012

This study tracked gender differential item functioning (DIF) on the English subtest of the Korean College Scholastic Aptitude Test (KCSAT) over a nine-year period across three data points, using both the Mantel-Haenszel (MH) and item response theory likelihood ratio (IRT-LR) procedures. Further, the study identified two factors (i.e. reading…

Descriptors: Aptitude Tests, Academic Aptitude, Language Tests, Test Items

"Sprogtest": A Smart Test (or How to Develop a Reliable and Anonymous EFL Reading Test).

Peer reviewed

Dollerup, Cay; And Others – Language Testing, 1994

Examines a Danish English-language reading proficiency test offered to freshman students to diagnose weaknesses which may impede their academic careers. To facilitate the assessment of what parts can be transferred and used in other language areas, the article discusses the test construction, development and improvement. (11 references) (Author/CK)

Descriptors: College Students, Comparative Analysis, Danish, Data Analysis