ERIC - Search Results

Publication Date

In 2025	3
Since 2024	3
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	20

Descriptor

Item Analysis	37
Language Tests	37
Test Reliability	37
English (Second Language)	22
Test Validity	22
Foreign Countries	18
Second Language Learning	14
Test Items	14
Test Construction	13
Language Proficiency	12
Comparative Analysis	7
Multiple Choice Tests	7
Testing	7
Undergraduate Students	7
Second Language Instruction	6
Achievement Tests	5
Difficulty Level	5
Grammar	5
Higher Education	5
Item Response Theory	5
Scoring	5
Cloze Procedure	4
Criterion Referenced Tests	4
Scores	4
Statistical Analysis	4
More ▼

Publication Type

Reports - Research	26
Journal Articles	21
Speeches/Meeting Papers	4
Reports - Descriptive	3
Tests/Questionnaires	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Reports - Evaluative	1

Education Level

Higher Education	9
Postsecondary Education	9
Adult Education	1
Early Childhood Education	1
Elementary Education	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Practitioners

Location

Iran	5
China	2
Asia	1
China (Guangzhou)	1
Connecticut	1
Europe	1
Indonesia	1
Iraq	1
Italy	1
Japan	1
Pakistan	1
Russia	1
Saudi Arabia	1
Sudan	1
Thailand	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

General Educational…	1
Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Development of a High-Level Thinking Skills Test (HOTS) in English Writing

Peer reviewed
PDF on ERIC

Download full text

Mardiana – Eurasian Journal of Applied Linguistics, 2023

Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…

Descriptors: Skill Development, Thinking Skills, Check Lists, Models

Numeral-Based English and Arabic Formulaic Expressions: Cultural, Linguistic and Translation Issues

Download full text

Al-Jarf, Reima – Online Submission, 2023

This study explores the similarities and differences between English and Arabic numeral-based formulaic expressions, and difficulties that student-translators have with them. A corpus of English and Arabic numeral-based formulaic expressions containing zero, two, three, twenty, sixty, hundred, thousand…etc., and another corpus of specialized…

Descriptors: Translation, Arabic, Contrastive Linguistics, Phrase Structure

Developing a Standardized English Proficiency Test in Alignment with the CEFR

Peer reviewed
PDF on ERIC

Download full text

Cheewasukthaworn, Kanchana – PASAA: Journal of Language Teaching and Learning in Thailand, 2022

In 2016, the Office of the Higher Education Commission issued a directive requiring all higher education institutions in Thailand to have their students take a standardized English proficiency test. According to the directive, the test's results had to align with the Common European Framework of Reference for Languages (CEFR). In response to this…

Descriptors: Test Construction, Standardized Tests, Language Tests, English (Second Language)

Distractor Analysis in Multiple-Choice Items Using the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Omarov, Nazarbek Bakytbekovich; Mohammed, Aisha; Alghurabi, Ammar Muhi Khleel; Alallo, Hajir Mahmood Ibrahim; Ali, Yusra Mohammed; Hassan, Aalaa Yaseen; Demeuova, Lyazat; Viktorovna, Shvedova Irina; Nazym, Bekenova; Al Khateeb, Nashaat Sultan Afif – International Journal of Language Testing, 2023

The Multiple-choice (MC) item format is commonly used in educational assessments due to its economy and effectiveness across a variety of content domains. However, numerous studies have examined the quality of MC items in high-stakes and higher-education assessments and found many flawed items, especially in terms of distractors. These faulty…

Descriptors: Test Items, Multiple Choice Tests, Item Response Theory, English (Second Language)

Concurrent Validity of LLAMA_F: Measure of Language Analytic Ability as a Predictor of Morphosyntax Knowledge

Peer reviewed
PDF on ERIC

Download full text

Kim, Peter – Language Teaching Research Quarterly, 2021

Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…

Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction

Properties of Single-Response and Double-Response Multiple-Choice Grammar Items

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Dourakhshan, Alireza – International Journal of Language Testing, 2016

The purpose of the present study is to compare the psychometric qualities of canonical single-response multiple-choice items with their double-response counterparts. Thirty, two-response fouroption grammar items for undergraduate students of English were constructed. A second version of the test was constructed by replacing one of the correct…

Descriptors: Language Tests, Multiple Choice Tests, Test Items, Factor Analysis

Developing an Interlanguage Pragmatic Competence Test on Routines in a Chinese EFL Context

Peer reviewed
PDF on ERIC

Download full text

Xu, Lan; Wannaruk, Anchalee – LEARN Journal: Language Education and Acquisition Research Network, 2016

Performing routines in interlanguage is vitally important for EFL learners since it can cause embarrassment between speakers from different cultures. The present study aims to 1) investigate the reliability and validity of an interlanguge pragmatic competence test on routines in a Chinese EFL context with multiple choice discourse completion task…

Descriptors: Language Tests, Test Construction, Pragmatics, Interlanguage

Assessing the Validity of Can-Do Statements in Retrospective (Then-Now) Self-Assessment

Peer reviewed

Direct link

Brown, N. Anthony; Dewey, Dan P.; Cox, Troy L. – Foreign Language Annals, 2014

In this study, the authors evaluated the strengths and limitations of a self-assessment based on ACTFL Can-Do statements ("ACTFL," 2013]) as a tool for measuring linguistic gains over an internship abroad in Russia. They assessed its reliability, determined how its items mapped with the ACTFL scale, and measured the degree to which…

Descriptors: Self Evaluation (Individuals), Pretests Posttests, Interviews, Language Proficiency

The Effect of Test Specifications Review on Improving the Quality of a Test

Peer reviewed
PDF on ERIC

Download full text

Zandi, Hamed; Kaivanpanah, Shiva; Alavi, Seyed Mohammad – Iranian Journal of Language Teaching Research, 2014

Reviewing the test specifications to improve the quality of language tests may be a routine process in professional testing systems. However, there is a paucity of research about the effect of specifications review on improving the quality of small-scale tests. The purpose of the present study was twofold: how specifications review could help…

Descriptors: Test Reliability, Test Validity, Language Tests, Test Items

Examining the Internal Structure of the Test of English-for-Teaching ("TEFT"™). Research Report. ETS RR-15-16

Peer reviewed
PDF on ERIC

Download full text

Gu, Lin; Turkan, Sultan; Gomez, Pablo Garcia – ETS Research Report Series, 2015

ELTeach is an online professional development program developed by Educational Testing Service (ETS) in collaboration with National Geographic Learning. The ELTeach program consists of two courses: English-for-Teaching and Professional Knowledge for English Language Teaching (ELT). Each course includes a coordinated assessment leading to a score…

Descriptors: Item Analysis, Test Items, English (Second Language), Second Language Instruction

Evaluation of English Achievement Test: A Comparison between High and Low Achievers amongst Selected Elementary School Students of Pakistan

Peer reviewed

Direct link

Haider, Zubair; Latif, Farah; Akhtar, Samina; Mushtaq, Maria – Educational Research and Reviews, 2012

Validity, reliability and item analysis are critical to the process of evaluating the quality of an educational measurement. The present study evaluates the quality of an assessment constructed to measure elementary school student's achievement in English. In this study, the survey model of descriptive research was used as a research method.…

Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Language Tests

Validation of an Academic Listening Test: Effects of "Breakdown" Tests and Test Takers' Cognitive Awareness of Listening Processes

Direct link

Chi, Youngshin – ProQuest LLC, 2011

This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…

Descriptors: Generalizability Theory, Listening Comprehension, Intervals, Second Languages

Previous Page | Next Page »

Pages: 1 | 2 | 3

International Journal of…	3
Language Testing	2
Online Submission	2
TESOL Quarterly	2
Bilingual Review	1
ETS Research Report Series	1
Edinburgh Working Papers in…	1
Educational Research and…	1
English Language Teaching	1
Eurasian Journal of Applied…	1
Foreign Language Annals	1
GED Testing Service	1
Iranian Journal of Language…	1
Journal on Educational…	1
LEARN Journal: Language…	1
Language Assessment Quarterly	1
Language Teaching Research…	1
PASAA: Journal of Language…	1
ProQuest LLC	1
System: An International…	1
More ▼

Aaronson, May	2
Oller, John W., Jr.	2
Salmani-Nodoushan, Mohammad…	2
Akhtar, Samina	1
Al Khateeb, Nashaat Sultan…	1
Al-Jarf, Reima	1
Alallo, Hajir Mahmood Ibrahim	1
Alavi, Seyed Mohammad	1
Alghurabi, Ammar Muhi Khleel	1
Ali, Yusra Mohammed	1
Annesley, Frederick R.	1
Bachman, Lyle F.	1
Baghaei, Purya	1
Baldauf, Richard B., Jr.	1
Bashaw, W. L.	1
Bernknopf, Stanley	1
Brown, James Dean	1
Brown, N. Anthony	1
Cheewasukthaworn, Kanchana	1
Chi, Youngshin	1
Clark, John L. D.	1
Coniam, David	1
Cox, Troy L.	1
Demeuova, Lyazat	1
More ▼