ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	23

Descriptor

Second Language Learning	34
Test Items	34
Scoring	33
Language Tests	26
English (Second Language)	21
Foreign Countries	15
Language Proficiency	14
Item Analysis	12
Second Language Instruction	12
Test Validity	11
Computer Assisted Testing	8
Test Construction	8
Scores	7
Test Reliability	7
College Students	6
Grammar	6
Comparative Analysis	5
Correlation	5
Difficulty Level	5
Reading Tests	5
Spanish	5
Testing	5
Bilingual Education	4
Higher Education	4
Item Response Theory	4
More ▼

Publication Type

Journal Articles	22
Reports - Research	22
Guides - General	4
Reports - Descriptive	4
Tests/Questionnaires	4
Speeches/Meeting Papers	3
Dissertations/Theses -…	2
Guides - Non-Classroom	2
Reports - Evaluative	2

Education Level

Higher Education	7
Postsecondary Education	4
Elementary Education	1
High Schools	1
Secondary Education	1

Audience

Practitioners	1
Teachers	1

Location

China	5
Japan	2
Europe	1
Iran	1
Israel	1
Poland	1
Spain	1
Taiwan	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Computer Attitude Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

Psychometric Approaches to Analyzing C-Tests

Peer reviewed

Direct link

Alpizar, David; Li, Tongyun; Norris, John M.; Gu, Lixiong – Language Testing, 2023

The C-test is a type of gap-filling test designed to efficiently measure second language proficiency. The typical C-test consists of several short paragraphs with the second half of every second word deleted. The words with deleted parts are considered as items nested within the corresponding paragraph. Given this testlet structure, it is commonly…

Descriptors: Psychometrics, Language Tests, Second Language Learning, Test Items

Performance of Automated Speech Scoring on Different Low- to Medium-Entropy Item Types for Low-Proficiency English Learners. Research Report. ETS RR-17-12

Peer reviewed
PDF on ERIC

Download full text

Loukina, Anastassia; Zechner, Klaus; Yoon, Su-Youn; Zhang, Mo; Tao, Jidong; Wang, Xinhao; Lee, Chong Min; Mulholland, Matthew – ETS Research Report Series, 2017

This report presents an overview of the "SpeechRater"? automated scoring engine model building and evaluation process for several item types with a focus on a low-English-proficiency test-taker population. We discuss each stage of speech scoring, including automatic speech recognition, filtering models for nonscorable responses, and…

Descriptors: Automation, Scoring, Speech Tests, Test Items

Towards Optimal Measurement and Theoretical Grounding of L2 English Elicited Imitation: Examining Scales, (Mis)Fits, and Prompt Features from Item Response Theory and Random Forest Approaches

Direct link

Ji-young Shin – ProQuest LLC, 2021

The present dissertation investigated the impact of scales/scoring methods and prompt linguistic features on the measurement quality of L2 English elicited imitation (EI). Scales/scoring methods are an important feature for the validity and reliability of L2 EI test, but less is known (Yan et al., 2016). Prompt linguistic features are also known…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Semantics

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

Computer Adaptive Language Testing According to NATO STANAG 6001 Requirements

Peer reviewed
PDF on ERIC

Download full text

Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021

The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction

The Role of Expert Judgement in Language Test Validation

Peer reviewed
PDF on ERIC

Download full text

Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022

The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…

Descriptors: Specialists, Language Tests, Test Validity, College Faculty

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

The New Computer Adaptive Test of Size and Strength (CATSS): Development and Validation

Peer reviewed

Direct link

Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019

This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…

Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability

The Fairness of a Graduate School Admission Test in China: Voices from Administrators, Teachers, and Test-Takers

Peer reviewed

Direct link

Song, Xiaomei – Asia-Pacific Education Researcher, 2018

Fairness and social justice has been the subject of much discussion in educational research, and concerns about fairness are paramount in the milieu of high-stakes admission testing. This study explored stakeholders' perceptions of the fairness of a high-stakes graduate school admission test, the Graduate School Entrance English Examination…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Teachers

A Dynamic Online System for Translation Learning and Testing

Peer reviewed
PDF on ERIC

Download full text

Tian, Yan – Research-publishing.net, 2017

Translation is one of the items tested in many national English proficiency tests for non-English majors in China because translation competence is regarded as one of the productive language skills which could be used to assess learners' language proficiency. However, the feedback on translation exercises and self-tests are usually provided by…

Descriptors: Translation, English (Second Language), Second Language Learning, Second Language Instruction

Replication and Expansion: Activity Type in Processing Instruction's Structured Input

Peer reviewed

Direct link

Díaz, Erin McNulty – Hispania, 2018

In seeking to both confirm previous conclusions and expand the literature of the field with a different group of participants, McNulty (2012) was (partially) replicated. Three instructional interventions were designed to ascertain which activity type was responsible for learner gains. One treatment group (R) included referential-only practice…

Descriptors: Linguistic Input, Teaching Methods, Intervention, Control Groups

Lexical Difficulty--Using Elicited Imitation to Study Child L2

Peer reviewed

Direct link

Campfield, Dorota E. – Language Testing, 2017

This paper reports a post-hoc analysis of the influence of lexical difficulty of cue sentences on performance in an elicited imitation (EI) task to assess oral production skills for 645 child L2 English learners in instructional settings. This formed part of a large-scale investigation into effectiveness of foreign language teaching in Polish…

Descriptors: Difficulty Level, Second Language Learning, Second Language Instruction, Elementary School Students

Comparing PETS and GEPT in China and Taiwan

Peer reviewed
PDF on ERIC

Download full text

Wu, Mei – English Language Teaching, 2012

This paper compares the Public English Test System (PETS) administered in mainland, China and the General English Proficiency Test (GEPT) administered in Taiwan, from the aspects of test levels, test contents and scoring weight. Compared with the PETS, the GEPT is found to value the English productive skills more, and have a greater ability to…

Descriptors: Foreign Countries, Second Language Instruction, Second Language Learning, Test Items

How Accurately Can the Google Web Speech API Recognize and Transcribe Japanese L2 English Learners' Oral Production?

Peer reviewed
PDF on ERIC

Download full text

Ashwell, Tim; Elam, Jesse R. – JALT CALL Journal, 2017

The ultimate aim of our research project was to use the Google Web Speech API to automate scoring of elicited imitation (EI) tests. However, in order to achieve this goal, we had to take a number of preparatory steps. We needed to assess how accurate this speech recognition tool is in recognizing native speakers' production of the test items; we…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

Piloting a Polychotomous Partial-Credit Scoring Procedure in a Multiple-Choice Test

Peer reviewed

Direct link

Tsopanoglou, Antonios; Ypsilandis, George S.; Mouti, Anna – Language Learning in Higher Education, 2014

Multiple-choice (MC) tests are frequently used to measure language competence because they are quick, economical and straightforward to score. While degrees of correctness have been investigated for partially correct responses in combined-response MC tests, degrees of incorrectness in distractors and the role they play in determining the…

Descriptors: Scoring, Pilot Projects, Multiple Choice Tests, Language Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Testing	5
ETS Research Report Series	2
English Language Teaching	2
ProQuest LLC	2
Advanced Education	1
Asia-Pacific Education…	1
Classical Outlook	1
Education and Information…	1
Educational and Psychological…	1
English Teaching Forum	1
Hispania	1
JALT CALL Journal	1
Language Assessment Quarterly	1
Language Education &…	1
Language Learning	1
Language Learning in Higher…	1
Modern Language Journal	1
Online Submission	1
Research-publishing.net	1
More ▼

De Avila, Edward A.	2
Duncan, Sharon E.	2
Zhang, Mo	2
Abdellah, Antar Solhy	1
Alpizar, David	1
Ashwell, Tim	1
Aviad-Levitzky, Tami	1
Breyer, F. Jay	1
Campfield, Dorota E.	1
Chalhoub-Deville, Micheline	1
Chapelle, Carol A.	1
Chung, Yoo-Ree	1
Coniam, David	1
Deville, Craig W.	1
Díaz, Erin McNulty	1
Elam, Jesse R.	1
Erickson, Gerald	1
Feng, Yali	1
Fitzpatrick, Steven J.	1
Gawliczek, Piotr	1
Goldstein, Zahava	1
Gu, Lixiong	1
Hegelheimer, Volker	1
Henning, Grant	1
More ▼