Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 23 |
Descriptor
Second Language Learning | 34 |
Test Items | 34 |
Scoring | 33 |
Language Tests | 26 |
English (Second Language) | 21 |
Foreign Countries | 15 |
Language Proficiency | 14 |
Item Analysis | 12 |
Second Language Instruction | 12 |
Test Validity | 11 |
Computer Assisted Testing | 8 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 22 |
Reports - Research | 22 |
Guides - General | 4 |
Reports - Descriptive | 4 |
Tests/Questionnaires | 4 |
Speeches/Meeting Papers | 3 |
Dissertations/Theses -… | 2 |
Guides - Non-Classroom | 2 |
Reports - Evaluative | 2 |
Education Level
Higher Education | 7 |
Postsecondary Education | 4 |
Elementary Education | 1 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Practitioners | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
Computer Attitude Scale | 1 |
What Works Clearinghouse Rating
Alpizar, David; Li, Tongyun; Norris, John M.; Gu, Lixiong – Language Testing, 2023
The C-test is a type of gap-filling test designed to efficiently measure second language proficiency. The typical C-test consists of several short paragraphs with the second half of every second word deleted. The words with deleted parts are considered as items nested within the corresponding paragraph. Given this testlet structure, it is commonly…
Descriptors: Psychometrics, Language Tests, Second Language Learning, Test Items
Loukina, Anastassia; Zechner, Klaus; Yoon, Su-Youn; Zhang, Mo; Tao, Jidong; Wang, Xinhao; Lee, Chong Min; Mulholland, Matthew – ETS Research Report Series, 2017
This report presents an overview of the "SpeechRater"? automated scoring engine model building and evaluation process for several item types with a focus on a low-English-proficiency test-taker population. We discuss each stage of speech scoring, including automatic speech recognition, filtering models for nonscorable responses, and…
Descriptors: Automation, Scoring, Speech Tests, Test Items
Ji-young Shin – ProQuest LLC, 2021
The present dissertation investigated the impact of scales/scoring methods and prompt linguistic features on the measurement quality of L2 English elicited imitation (EI). Scales/scoring methods are an important feature for the validity and reliability of L2 EI test, but less is known (Yan et al., 2016). Prompt linguistic features are also known…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Semantics
Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023
This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…
Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese
Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021
The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…
Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction
Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022
The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…
Descriptors: Specialists, Language Tests, Test Validity, College Faculty
Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022
Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…
Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring
Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019
This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Song, Xiaomei – Asia-Pacific Education Researcher, 2018
Fairness and social justice has been the subject of much discussion in educational research, and concerns about fairness are paramount in the milieu of high-stakes admission testing. This study explored stakeholders' perceptions of the fairness of a high-stakes graduate school admission test, the Graduate School Entrance English Examination…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Teachers
Tian, Yan – Research-publishing.net, 2017
Translation is one of the items tested in many national English proficiency tests for non-English majors in China because translation competence is regarded as one of the productive language skills which could be used to assess learners' language proficiency. However, the feedback on translation exercises and self-tests are usually provided by…
Descriptors: Translation, English (Second Language), Second Language Learning, Second Language Instruction
Díaz, Erin McNulty – Hispania, 2018
In seeking to both confirm previous conclusions and expand the literature of the field with a different group of participants, McNulty (2012) was (partially) replicated. Three instructional interventions were designed to ascertain which activity type was responsible for learner gains. One treatment group (R) included referential-only practice…
Descriptors: Linguistic Input, Teaching Methods, Intervention, Control Groups
Campfield, Dorota E. – Language Testing, 2017
This paper reports a post-hoc analysis of the influence of lexical difficulty of cue sentences on performance in an elicited imitation (EI) task to assess oral production skills for 645 child L2 English learners in instructional settings. This formed part of a large-scale investigation into effectiveness of foreign language teaching in Polish…
Descriptors: Difficulty Level, Second Language Learning, Second Language Instruction, Elementary School Students
Wu, Mei – English Language Teaching, 2012
This paper compares the Public English Test System (PETS) administered in mainland, China and the General English Proficiency Test (GEPT) administered in Taiwan, from the aspects of test levels, test contents and scoring weight. Compared with the PETS, the GEPT is found to value the English productive skills more, and have a greater ability to…
Descriptors: Foreign Countries, Second Language Instruction, Second Language Learning, Test Items
Ashwell, Tim; Elam, Jesse R. – JALT CALL Journal, 2017
The ultimate aim of our research project was to use the Google Web Speech API to automate scoring of elicited imitation (EI) tests. However, in order to achieve this goal, we had to take a number of preparatory steps. We needed to assess how accurate this speech recognition tool is in recognizing native speakers' production of the test items; we…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests
Tsopanoglou, Antonios; Ypsilandis, George S.; Mouti, Anna – Language Learning in Higher Education, 2014
Multiple-choice (MC) tests are frequently used to measure language competence because they are quick, economical and straightforward to score. While degrees of correctness have been investigated for partially correct responses in combined-response MC tests, degrees of incorrectness in distractors and the role they play in determining the…
Descriptors: Scoring, Pilot Projects, Multiple Choice Tests, Language Tests