ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	14

Descriptor

Comparative Analysis	40
Language Proficiency	40
Test Reliability	40
Language Tests	33
English (Second Language)	24
Test Validity	23
Second Language Learning	17
Foreign Countries	16
Scores	9
Second Language Instruction	9
Test Construction	9
College Students	8
Interviews	8
Oral Language	8
Testing	8
Cloze Procedure	6
Rating Scales	6
Test Format	6
Computer Assisted Testing	5
Higher Education	5
Test Items	5
Evaluation Methods	4
Foreign Students	4
Item Response Theory	4
Multiple Choice Tests	4
More ▼

Publication Type

Journal Articles	23
Reports - Research	20
Reports - Evaluative	8
Reports - Descriptive	6
Speeches/Meeting Papers	6
Book/Product Reviews	1
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	4
Elementary Education	3
Postsecondary Education	3
Early Childhood Education	1
Elementary Secondary Education	1
High Schools	1
Kindergarten	1
Primary Education	1
Secondary Education	1
Two Year Colleges	1

Audience

Location

Iran	4
Taiwan	2
Australia	1
China	1
Cyprus	1
Denmark	1
Europe	1
Indonesia	1
Israel	1
Jamaica	1
Japan	1
Thailand	1
United Kingdom (Great Britain)	1
United Kingdom (Reading)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACTFL Oral Proficiency…	2
English Proficiency Test	1
Michigan Test of English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Validation of a Bilingual Version of the Vocabulary Size Test: Comparison with the Monolingual Version

Peer reviewed

Direct link

Karami, Hossein; Kouhpaee Nejad, Mohammadhossein; Nourzadeh, Saeed; Ahmadi Shirazi, Masoumeh – International Journal of Bilingual Education and Bilingualism, 2020

This study was set to cross-validate a bilingual Persian-English version of the Vocabulary Size Test (VST) against the monolingual English version and compare Iranian EFL learners' performance on the two versions. Various bilingual versions of the VST have been developed based on the assumption that bilingual versions are not affected by the…

Descriptors: Bilingualism, Indo European Languages, English (Second Language), Second Language Learning

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Distractor Analysis for Multiple-Choice Tests: An Empirical Study with International Language Assessment Data. Research Report. ETS RR-19-39

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Liu, Yang; Lee, Yi-Hsuan – ETS Research Report Series, 2019

Distractor analyses are routinely conducted in educational assessments with multiple-choice items. In this research report, we focus on three item response models for distractors: (a) the traditional nominal response (NR) model, (b) a combination of a two-parameter logistic model for item scores and a NR model for selections of incorrect…

Descriptors: Multiple Choice Tests, Scores, Test Reliability, High Stakes Tests

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Comparing the OPI and the OPIc: The Effect of Test Method on Oral Proficiency Scores and Student Preference

Peer reviewed

Direct link

Thompson, Gregory L.; Cox, Troy L.; Knapp, Nieves – Foreign Language Annals, 2016

While studies have been done to rate the validity and reliability of the Oral Proficiency Interview (OPI) and Oral Proficiency Interview-Computer (OPIc) independently, a limited amount of research has analyzed the interexam reliability of these tests, and studies have yet to be conducted comparing the results of Spanish language learners who take…

Descriptors: Comparative Analysis, Oral Language, Language Proficiency, Spanish

The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

Peer reviewed
PDF on ERIC

Download full text

Retnawati, Heri – Turkish Online Journal of Educational Technology - TOJET, 2015

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

Descriptors: Scores, Accuracy, Computer Assisted Testing, English (Second Language)

Investigating the Effectiveness of Audio Input Enhancement on EFL Learners' Retention of Intensifiers

Peer reviewed
PDF on ERIC

Download full text

Negari, Giti Mousapour; Azizi, Aliye; Arani, Davood Khedmatkar – International Journal of Instruction, 2018

The present study attempted to investigate the effects of audio input enhancement on EFL learners' retention of intensifiers. To this end, two research questions were formulated. In order to address these research questions, this study attempted to reject two null hypotheses. Pretest-posttest control group quasi-experimental design was employed to…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Quasiexperimental Design

How Good Is Your Test?

Peer reviewed

Direct link

Kucuk, Funda; Walters, JoDee – ELT Journal, 2009

This article reports on a study of the validity and reliability of tests administered in an EFL university setting. The study addresses the question of how well face validity reflects more objective measures of the quality of a test, such as predictive validity and reliability. According to some researchers, face validity, defined as the surface…

Descriptors: Language Tests, Test Validity, Achievement Tests, English (Second Language)

Research on the Comparability of the Oral Proficiency Interview and the Simulated Oral Proficiency Interview.

Peer reviewed

Stansfield, Charles W.; Kenyon, Dorry Mann – System, 1992

Reviews research that sheds light on the comparability of Oral Proficiency Interview and the Simulated Oral Proficiency Interview. Suggestions are provided for further research. (16 references) (VWL)

Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests

A Comparative Analysis of Simulated and Direct Oral Proficiency Interviews.

Download full text

Stansfield, Charles W. – 1990

The simulated oral proficiency interview (SOPI) is a semi-direct speaking test that models the format of the oral proficiency interview (OPI). The OPI is a method of assessing general speaking proficiency in a second language. The SOPI is a tape-recorded test consisting of six parts: simple personal background questions posed in a simulated…

Descriptors: Comparative Analysis, Interviews, Language Proficiency, Language Tests

Is There a Global Factor of Language Proficiency? A Critique of Oller and Hinofotis (1980).

Peer reviewed

Pang, Lee Yick – International Review of Applied Linguistics in Language Teaching, 1984

Examines and contests the claim that all language tests are in reality testing the same underlying ability which is very similar to the Spearman g-factor for intelligence. Conclusions indicate that the argument for the existence of a g-factor in language tests is not tenable on statistical grounds. (SL)

Descriptors: Comparative Analysis, Intelligence, Language Proficiency, Language Tests

A Cloze Test of English Prepositions

Peer reviewed

Oller, John W., Jr.; Inal, Nevin – TESOL Quarterly, 1971

Descriptors: Cloze Procedure, Comparative Analysis, Educational Experiments, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	6
System	2
Assessment in Education:…	1
Association for Educational…	1
Canadian Modern Language…	1
ELT Journal	1
ETS Research Report Series	1
Edinburgh Working Papers in…	1
Education and Information…	1
Elementary School Journal	1
Foreign Language Annals	1
International Journal of…	1
International Journal of…	1
International Review of…	1
Language Learning	1
Modern Language Journal	1
NABE: The Journal for the…	1
TESOL Quarterly	1
Turkish Online Journal of…	1
More ▼

Brown, James Dean	2
Stansfield, Charles W.	2
Adams, R. J.	1
Ahmadi Shirazi, Masoumeh	1
Alderson, J. Charles, Ed.	1
Arani, Davood Khedmatkar	1
Arth, Thomas O.	1
August, Diane	1
Azizi, Aliye	1
Barbour, Ross Patrick	1
Burgess, Thomas C.	1
Clark, John L. D.	1
Cox, Troy L.	1
Dollerup, Cay	1
Esmat Babaii	1
Farshad Effatpanah	1
Francis, David J.	1
Galaczi, Evelina	1
Gelbal, Selahattin	1
Ghonsooly, Behzad	1
Greis, Naguib A. F.	1
Guerrero, Michael D.	1
Haberman, Shelby J.	1
Henning, Grant	1
More ▼