ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	9

Descriptor

Item Analysis	11
Second Language Instruction	11
Test Reliability	11
English (Second Language)	8
Second Language Learning	8
Test Items	8
Test Validity	7
Foreign Countries	6
Language Tests	6
Undergraduate Students	5
Grammar	4
Test Construction	4
Factor Analysis	3
Item Response Theory	3
Multiple Choice Tests	3
Accuracy	2
Comparative Analysis	2
Construct Validity	2
Correlation	2
Difficulty Level	2
Models	2
Native Language	2
Pragmatics	2
Spanish	2
Universities	2
More ▼

Source

International Journal of…	3
CALICO Journal	1
Computer Assisted Language…	1
ETS Research Report Series	1
LEARN Journal: Language…	1
Language Assessment Quarterly	1
Language Teaching Research…	1
PROFILE: Issues in Teachers'…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	9
Tests/Questionnaires	2
Dissertations/Theses -…	1
Reports - Descriptive	1

Education Level

Higher Education	6
Postsecondary Education	6
Adult Education	2
Early Childhood Education	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
Secondary Education	1

Audience

Location

China	2
Iran	2
Asia	1
Colombia	1
Iraq	1
Italy	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

The Development and Initial Validation of O-WSVLT, a Meaning-Recall Online L2 Spanish Vocabulary Levels Test

Peer reviewed

Direct link

Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024

Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…

Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development

Towards Optimal Measurement and Theoretical Grounding of L2 English Elicited Imitation: Examining Scales, (Mis)Fits, and Prompt Features from Item Response Theory and Random Forest Approaches

Direct link

Ji-young Shin – ProQuest LLC, 2021

The present dissertation investigated the impact of scales/scoring methods and prompt linguistic features on the measurement quality of L2 English elicited imitation (EI). Scales/scoring methods are an important feature for the validity and reliability of L2 EI test, but less is known (Yan et al., 2016). Prompt linguistic features are also known…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Semantics

Distractor Analysis in Multiple-Choice Items Using the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Omarov, Nazarbek Bakytbekovich; Mohammed, Aisha; Alghurabi, Ammar Muhi Khleel; Alallo, Hajir Mahmood Ibrahim; Ali, Yusra Mohammed; Hassan, Aalaa Yaseen; Demeuova, Lyazat; Viktorovna, Shvedova Irina; Nazym, Bekenova; Al Khateeb, Nashaat Sultan Afif – International Journal of Language Testing, 2023

The Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher-education assessments and found many flawed items, especially in terms of distractors. These faulty…

Descriptors: Test Items, Multiple Choice Tests, Item Response Theory, English (Second Language)

Concurrent Validity of LLAMA_F: Measure of Language Analytic Ability as a Predictor of Morphosyntax Knowledge

Peer reviewed
PDF on ERIC

Download full text

Kim, Peter – Language Teaching Research Quarterly, 2021

Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…

Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction

Properties of Single-Response and Double-Response Multiple-Choice Grammar Items

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Dourakhshan, Alireza – International Journal of Language Testing, 2016

The purpose of the present study is to compare the psychometric qualities of canonical single-response multiple-choice items with their double-response counterparts. Thirty, two-response fouroption grammar items for undergraduate students of English were constructed. A second version of the test was constructed by replacing one of the correct…

Descriptors: Language Tests, Multiple Choice Tests, Test Items, Factor Analysis

Aligning English Language Testing with Curriculum

Peer reviewed
PDF on ERIC

Download full text

Palacio, Marcela; Gaviria, Sandra; Brown, James Dean – PROFILE: Issues in Teachers' Professional Development, 2016

Frustrations with traditional testing led a group of teachers at the English for adults program at Universidad EAFIT (Colombia) to design tests aligned with the institutional teaching philosophy and classroom practices. This article reports on a study of an item-by-item evaluation of a series of English exams for validity and reliability in an…

Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Second Language Instruction

Developing an Interlanguage Pragmatic Competence Test on Routines in a Chinese EFL Context

Peer reviewed
PDF on ERIC

Download full text

Xu, Lan; Wannaruk, Anchalee – LEARN Journal: Language Education and Acquisition Research Network, 2016

Performing routines in interlanguage is vitally important for EFL learners since it can cause embarrassment between speakers from different cultures. The present study aims to 1) investigate the reliability and validity of an interlanguge pragmatic competence test on routines in a Chinese EFL context with multiple choice discourse completion task…

Descriptors: Language Tests, Test Construction, Pragmatics, Interlanguage

Examining the Internal Structure of the Test of English-for-Teaching ("TEFT"™). Research Report. ETS RR-15-16

Peer reviewed
PDF on ERIC

Download full text

Gu, Lin; Turkan, Sultan; Gomez, Pablo Garcia – ETS Research Report Series, 2015

ELTeach is an online professional development program developed by Educational Testing Service (ETS) in collaboration with National Geographic Learning. The ELTeach program consists of two courses: English-for-Teaching and Professional Knowledge for English Language Teaching (ELT). Each course includes a coordinated assessment leading to a score…

Descriptors: Item Analysis, Test Items, English (Second Language), Second Language Instruction

Testing Computer Assisted Language Testing: Towards a Checklist for CALT.

Peer reviewed

Noijons, Jose – CALICO Journal, 1994

Defines computer assisted language testing (CALT), discusses the various processes involved, outlines the advantages and disadvantages, and examines psychometric aspects of computer testing. A table of factors distinguishes between test content and the mechanics of test taking. These factors constitute a table for developing a CALT checklist. (24…

Descriptors: Check Lists, Computer Assisted Testing, Factor Analysis, Feedback

A Computer Attitude Scale for Language Teachers.

Peer reviewed

Daud, Nuraihan Mat – Computer Assisted Language Learning, 1995

Discusses the development of a scale to measure variables that may have an effect on teacher's affective attitudes towards computer-assisted language learning. Both qualitative and quantitative methodologies were used in the development of the scale to ensure its validity and reliability. (13 references) (Author/CK)

Descriptors: Affective Behavior, Case Studies, Computer Assisted Instruction, Construct Validity

Ji-young Shin	2
Al Khateeb, Nashaat Sultan…	1
Alallo, Hajir Mahmood Ibrahim	1
Alghurabi, Ammar Muhi Khleel	1
Ali, Yusra Mohammed	1
Baghaei, Purya	1
Brown, James Dean	1
Claudia Helena…	1
Daud, Nuraihan Mat	1
Demeuova, Lyazat	1
Dourakhshan, Alireza	1
Gaviria, Sandra	1
Golam Reza Rohani	1
Gomez, Pablo Garcia	1
Gu, Lin	1
Hamdollah Ravand	1
Hassan, Aalaa Yaseen	1
Jeffrey Stewart	1
Kim, Peter	1
Mohammed, Aisha	1
Nazym, Bekenova	1
Noijons, Jose	1
Omarov, Nazarbek Bakytbekovich	1
Pablo Robles-García	1
Palacio, Marcela	1
More ▼