ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	21
Since 2007 (last 20 years)	29

Descriptor

Comparative Analysis	63
Language Proficiency	63
Language Tests	47
English (Second Language)	41
Test Reliability	40
Second Language Learning	33
Foreign Countries	29
Test Validity	24
Second Language Instruction	21
Interrater Reliability	17
Oral Language	15
College Students	13
Scores	13
Interviews	11
Reliability	10
Test Construction	10
Testing	10
Evaluators	9
Rating Scales	9
Speech Communication	9
Computer Assisted Testing	8
Higher Education	8
Statistical Analysis	7
Teaching Methods	7
Test Format	7
More ▼

Publication Type

Journal Articles	42
Reports - Research	42
Reports - Evaluative	8
Speeches/Meeting Papers	8
Reports - Descriptive	6
Tests/Questionnaires	5
Information Analyses	2
Book/Product Reviews	1
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Opinion Papers	1
More ▼

Education Level

Higher Education	13
Postsecondary Education	11
Elementary Education	3
Early Childhood Education	2
Elementary Secondary Education	2
High Schools	1
Kindergarten	1
Preschool Education	1
Primary Education	1
Secondary Education	1
Two Year Colleges	1
More ▼

Audience

Location

Iran	7
China	4
Europe	2
Japan	2
Taiwan	2
Australia	1
Cyprus	1
Denmark	1
Indonesia	1
Israel	1
Jamaica	1
Philippines	1
Russia	1
Sweden	1
Texas	1
Thailand	1
United Kingdom (Great Britain)	1
United Kingdom (Reading)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
ACTFL Oral Proficiency…	2
Child Behavior Checklist	1
English Proficiency Test	1
International English…	1
Michigan Test of English…	1
National Assessment of Adult…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 63 results Save | Export

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

Crowdsourced Adaptive Comparative Judgment: A Community-Based Solution for Proficiency Rating

Peer reviewed

Direct link

Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022

The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…

Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Can Recall Data Be Trusted? Evaluating Reliability of Interview Data on Traditional Multilingualism in Highland Daghestan

Peer reviewed

Direct link

Daniel, Michael; Koshevoy, Alexey; Schurov, Ilya; Dobrushina, Nina – Field Methods, 2022

In this article, we address the issue of reliability of quantitative data on multilingualism of the past obtained as recall data. More specifically, we investigate whether the interviewees' assessments of the language repertoires of their late relatives (indirect data) provide results that are quantitatively similar to those obtained from the…

Descriptors: Recall (Psychology), Multilingualism, Artificial Intelligence, Second Languages

Elicited Imitation as a Measure of L2 Proficiency: New Insights from a Comparison of Two L2 English Parallel Forms

Peer reviewed

Direct link

Wu, Shu-Ling; Tio, Yee Pin; Ortega, Lourdes – Studies in Second Language Acquisition, 2022

Elicited imitation (EI), a short-cut measure of global proficiency in second language (L2) research, requires participants to listen to sentences and repeat them as closely as possible. To support instrument sharing and assessment of L2 proficiency for longitudinal and crosslinguistic research, we created a parallel form of an EI task (EIT) for L2…

Descriptors: Imitation, Second Language Learning, Second Language Instruction, Language Proficiency

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Rater Effects on L2 Oral Assessment: Focusing on Accent Familiarity of L2 Teachers

Peer reviewed

Direct link

Park, Mi Sun – Language Assessment Quarterly, 2020

In the present study, I examined the effects of rater characteristics, in particular, raters' familiarity with a foreign accent, on the assessment of second language (L2) pronunciation. Forty-three native English-speaking teachers were divided into three groups according to their reported types of familiarity with Korean accents: heritage,…

Descriptors: Evaluators, Familiarity, Second Language Learning, English (Second Language)

Validation of a Bilingual Version of the Vocabulary Size Test: Comparison with the Monolingual Version

Peer reviewed

Direct link

Karami, Hossein; Kouhpaee Nejad, Mohammadhossein; Nourzadeh, Saeed; Ahmadi Shirazi, Masoumeh – International Journal of Bilingual Education and Bilingualism, 2020

This study was set to cross-validate a bilingual Persian-English version of the Vocabulary Size Test (VST) against the monolingual English version and compare Iranian EFL learners' performance on the two versions. Various bilingual versions of the VST have been developed based on the assumption that bilingual versions are not affected by the…

Descriptors: Bilingualism, Indo European Languages, English (Second Language), Second Language Learning

The Effects of Proficiency and Study-Abroad on Chinese EFL Learners' Refusals

Peer reviewed

Direct link

Wang, Yuqi; Ren, Wei – Language Learning Journal, 2022

L2 pragmatics have explored the effects of different factors on different aspects of learners' pragmatic performance, but often not simultaneously. In addition, syntactic complexity is rarely examined in L2 pragmatics. This cross-sectional study aimed to conduct a multidimensional analysis to explore the effects of proficiency and study-abroad…

Descriptors: Pragmatics, Second Language Learning, Second Language Instruction, English (Second Language)

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Distractor Analysis for Multiple-Choice Tests: An Empirical Study with International Language Assessment Data. Research Report. ETS RR-19-39

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Liu, Yang; Lee, Yi-Hsuan – ETS Research Report Series, 2019

Distractor analyses are routinely conducted in educational assessments with multiple-choice items. In this research report, we focus on three item response models for distractors: (a) the traditional nominal response (NR) model, (b) a combination of a two-parameter logistic model for item scores and a NR model for selections of incorrect…

Descriptors: Multiple Choice Tests, Scores, Test Reliability, High Stakes Tests

Mapping the Fluctuating Effect of Strategy Use Ability on English Reading Performance for Nursing Students: A Multi-Layered Moderation Analysis Approach

Peer reviewed

Direct link

Cai, Yuyang; Kunnan, Antony John – Language Testing, 2020

An essential hypothesis of modern language assessment theory pertains to the interaction between strategy use ability (strategic competence) and second language knowledge. However, how they interact with each other is rarely explored. Drawing on relevant research in the literature, in this paper we proposed three interaction patterns (i.e.,…

Descriptors: English (Second Language), Second Language Learning, Nursing Education, Reading Tests

Differences in Less Proficient and More Proficient ESL College Writing in the Philippine Setting

Download full text

Gustilo, Leah E. – Online Submission, 2016

The present study aimed at characterizing what skilled or more proficient ESL college writing is in the Philippine setting through a contrastive analysis of three groups of variables identified from previous studies: resources, processes, and performance of ESL writers. Based on Chenoweth and Hayes' (2001; 2003) framework, the resource level…

Descriptors: Language Proficiency, English (Second Language), Second Language Learning, Foreign Countries

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Language Testing	8
Foreign Language Annals	2
Language Learning	2
Language Learning Journal	2
System	2
Assessment in Education:…	1
Association for Educational…	1
Canadian Modern Language…	1
Cross Currents	1
ELT Journal	1
ETS Research Report Series	1
Edinburgh Working Papers in…	1
Education and Information…	1
Elementary School Journal	1
English Language Teaching	1
Field Methods	1
International Journal of…	1
International Journal of…	1
International Review of…	1
JALT CALL Journal	1
Journal of Research on…	1
Language Assessment Quarterly	1
Language Teaching Research…	1
Language Testing in Asia	1
Modern Language Journal	1
More ▼

Stansfield, Charles W.	3
Brown, James Dean	2
Henning, Grant	2
Adams, R. J.	1
Ahmadi Shirazi, Masoumeh	1
Ahour, Touran	1
Alderson, J. Charles, Ed.	1
Arani, Davood Khedmatkar	1
Ardasheva, Yuliya	1
Arth, Thomas O.	1
August, Diane	1
Azizi, Aliye	1
Barbour, Ross Patrick	1
Blanc, Oscar	1
Burgess, Thomas C.	1
Cai, Yuyang	1
Clark, John L. D.	1
Cox, Troy L.	1
Daniel, Michael	1
Dobrushina, Nina	1
Dollerup, Cay	1
Entezari Maleki, Saeideh	1
Esmat Babaii	1
Farshad Effatpanah	1
More ▼