ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	11

Descriptor

Comparative Analysis	21
Test Validity	21
Language Tests	19
Second Language Learning	15
English (Second Language)	13
Language Proficiency	10
Test Reliability	8
Foreign Countries	7
Factor Analysis	6
Scores	6
Statistical Analysis	5
College Students	4
Correlation	4
Item Response Theory	4
Speech Communication	4
Testing	4
Cloze Procedure	3
English	3
Test Items	3
Computational Linguistics	2
Computer Assisted Testing	2
Difficulty Level	2
Elementary School Students	2
Evaluation Research	2
Higher Education	2
More ▼

Source

Language Testing

Publication Type

Journal Articles	21
Reports - Research	18
Reports - Evaluative	2
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Education	2
Secondary Education	1

Audience

Location

Japan	2
Australia	1
Europe	1
Germany	1
Israel	1
Russia	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Edinburgh Handedness Inventory	1
English Proficiency Test	1
International English…	1
Michigan Test of English…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Test Score Comparison Tables: How Well are They Serving Test Users?

Peer reviewed

Direct link

Ute Knoch; Jason Fan – Language Testing, 2024

While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…

Descriptors: Language Tests, English, Test Validity, Item Analysis

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

Measuring the Development of General Language Skills in English as a Foreign Language--Longitudinal Invariance of the C-Test

Peer reviewed

Direct link

Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023

Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies

What Can Gaze Behaviors, Neuroimaging Data, and Test Scores Tell Us about Test Method Effects and Cognitive Load in Listening Assessments?

Peer reviewed

Direct link

Aryadoust, Vahid; Foo, Stacy; Ng, Li Ying – Language Testing, 2022

The aim of this study was to investigate how test methods affect listening test takers' performance and cognitive load. Test methods were defined and operationalized as while-listening performance (WLP) and post-listening performance (PLP) formats. To achieve the goal of the study, we examined test takers' (N = 80) brain activity patterns…

Descriptors: Listening Comprehension Tests, Language Tests, Eye Movements, Brain Hemisphere Functions

A Comparison of Reliability and Precision of Subscore Reporting Methods for a State English Language Proficiency Assessment

Peer reviewed

Direct link

Longabach, Tanya; Peyton, Vicki – Language Testing, 2018

K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…

Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency

Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis

Peer reviewed

Direct link

Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016

Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…

Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

Using Corpus Linguistics to Examine the Extrapolation Inference in the Validity Argument for a High-Stakes Speaking Assessment

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017

Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…

Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences

Determining Cloze Item Difficulty from Item and Passage Characteristics across Different Learner Backgrounds

Peer reviewed

Direct link

Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017

Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…

Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis

Development and Validation of Extract the Base: An English Derivational Morphology Test for Third through Fifth Grade Monolingual Students and Spanish-Speaking English Language Learners

Peer reviewed

Direct link

Goodwin, Amanda P.; Huggins, A. Corinne; Carlo, Maria; Malabonga, Valerie; Kenyon, Dorry; Louguit, Mohammed; August, Diane – Language Testing, 2012

This study describes the development and validation of the Extract the Base test (ETB), which assesses derivational morphological awareness. Scores on this test were validated for 580 monolingual students and 373 Spanish-speaking English language learners (ELLs) in third through fifth grade. As part of the validation of the internal structure,…

Descriptors: Reading Comprehension, Speech Communication, Second Language Learning, Scoring

Authentic Language Tests: Where from and Where to?

Peer reviewed

Shohamy, Elana; Reves, Thea – Language Testing, 1985

Surveys the development of language tests toward authenticity and discusses the advantages and disadvantages of indirect and direct (authentic) language tests. Discusses the difficulty of applying appropriate psychometric measures to tests using real-life language, and the large number of tests variables which interfere with the authenticity of…

Descriptors: Comparative Analysis, Interviews, Language Tests, Language Usage

An Investigation into the Adequacy of Three IRT Models for Data from Two EFL Reading Tests.

Peer reviewed

Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992

This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)

Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory

Crossvalidation of Item Response Curve Models Using TOEFL Data.

Peer reviewed

Boldt, Robert F. – Language Testing, 1992

The assumption called PIRC (proportional item response curve) was tested in which PIRC was used to predict item scores of selected examinees on selected items. Findings show approximate accuracies of prediction for PIRC, the three-parameter logist model, and a modified Rasch model. (12 references) (Author/LB)

Descriptors: Comparative Analysis, English (Second Language), Factor Analysis, Item Response Theory

Validation of a Tape-Mediated ACTFL/ILR-Scale Based Test of Chinese Speaking Proficiency.

Peer reviewed

Clark, John L. D. – Language Testing, 1988

A validation study of the "semi-direct" Chinese Speaking Test (CST) directly compared college students' performance on the test with their performance on the "live" language proficiency interview. CST provided scoring results largely equivalent to those of the live interview, although examinees perceived CST to be more…

Descriptors: Chinese, College Students, Comparative Analysis, Higher Education

An Investigation of a Criterion-Referenced Test Using G-Theory, and Factor and Cluster Analyses.

Peer reviewed

Kunnan, Antony John – Language Testing, 1992

Three analysis procedures were used to study the dependability and validity of ESLPE, a criterion-referenced English-as-a-Second-Language placement test developed at the University of California at Los Angeles in 1989. Findings led to the suggestion that some students might have been differently placed if subtest scores were used for placement.(38…

Descriptors: Cluster Analysis, Comparative Analysis, Criterion Referenced Tests, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2

Choi, Inn-Chull	2
Klein-Braley, Christine	2
Aryadoust, Vahid	1
August, Diane	1
Bachman, Lyle F.	1
Boldt, Robert F.	1
Boo, Jaeyool	1
Brown, James Dean	1
Carlo, Maria	1
Clark, John L. D.	1
Culligan, Brent	1
Foo, Stacy	1
Galaczi, Evelina D.	1
Ginther, April	1
Goodwin, Amanda P.	1
Hartig, Johannes	1
Huggins, A. Corinne	1
Janssen, Gerriet	1
Jason Fan	1
Kenyon, Dorry	1
Khabbazbashi, Nahal	1
Kim, Kyoung Sung	1
Klinger, Thorsten	1
Kozhevnikova, Liudmila	1
Kunnan, Antony John	1
More ▼