Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 12 |
Descriptor
Language Proficiency | 31 |
Test Theory | 31 |
Language Tests | 25 |
English (Second Language) | 14 |
Second Language Learning | 13 |
Test Validity | 10 |
Testing | 9 |
Comparative Analysis | 8 |
Foreign Countries | 8 |
Test Reliability | 8 |
Item Response Theory | 7 |
More ▼ |
Source
Author
Bachman, Lyle F. | 2 |
Douglas, Dan | 2 |
Salmani-Nodoushan, Mohammad… | 2 |
Brooks, Lindsay | 1 |
Brown, Cheri | 1 |
Cascallar, Alicia S. | 1 |
Cziko, Gary A. | 1 |
Davies, Alan | 1 |
Dieterich, Thomas G. | 1 |
Dorans, Neil J. | 1 |
Dugan, J. Sanford | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 7 |
Postsecondary Education | 6 |
Elementary Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
ACTFL Oral Proficiency… | 2 |
English Proficiency Test | 1 |
SAT (College Admission Test) | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Sivaci, Seda – Journal on English Language Teaching, 2020
The purpose of the present study is to evaluate proficiency tests of the universities in Turkey in line with the ALTE Quality Assurance Checklists in terms of four dimensions: a. Test construction, b. Administration & Logistics c. Grading, Marking Results and d. Test Analysis & Post-examination Review. The study took place in four…
Descriptors: Foreign Countries, Language Tests, Language Proficiency, Educational Quality
Polat, Murat – International Online Journal of Education and Teaching, 2022
Foreign language testing is a multi-dimensional phenomenon and obtaining objective and error-free scores on learners' language skills is often problematic. While assessing foreign language performance on high-stakes tests, using different testing approaches including Classical Test Theory (CTT), Generalizability Theory (GT) and/or Item Response…
Descriptors: Second Language Learning, Second Language Instruction, Item Response Theory, Language Tests
Tschirner, Erwin – Unterrichtspraxis/Teaching German, 2018
Concepts of second language proficiency and how proficiency may be assessed have changed considerably over the last 20 years. New notions of validity with respect to the interpretation and uses of test scores have begun to shape discussions about test validity and quality assurance in college world language departments, in government, and in…
Descriptors: Language Tests, Testing, Test Theory, German
Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017
The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…
Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores
Longabach, Tanya; Peyton, Vicki – Language Testing, 2018
K-12 English language proficiency tests that assess multiple content domains (e.g., listening, speaking, reading, writing) often have subsections based on these content domains; scores assigned to these subsections are commonly known as subscores. Testing programs face increasing customer demands for the reporting of subscores in addition to the…
Descriptors: Comparative Analysis, Test Reliability, Second Language Learning, Language Proficiency
Zhao, Ping; Ji, Xiaoli – RELC Journal: A Journal of Language Teaching and Research, 2018
This article provides preliminary validity evidence for the shorter Mandarin version of the Vocabulary Size Test (VST) under the content aspect, technical quality, substantive and generalizability aspect of Messick's (1995) construct validity framework. The shorter version with 177 Chinese university students in three proficiency levels indicates…
Descriptors: Language Tests, Test Validity, Mandarin Chinese, Second Language Learning
Retnawati, Heri – Turkish Online Journal of Educational Technology - TOJET, 2015
This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…
Descriptors: Scores, Accuracy, Computer Assisted Testing, English (Second Language)
Pan, Yi-Ching – TESL-EJ, 2014
In much of the world, the issue of accountability and measurement of educational outcomes is highly controversial. Exit testing is part of the movement to ascertain what students have learned and hold institutions and teachers to account. However, compared to the large number of teacher washback studies, learner washback research is lacking…
Descriptors: Standardized Tests, Exit Examinations, Questionnaires, College Students
Ellis, David P. – ProQuest LLC, 2011
The current version of the International Language Testing Association (ILTA) Guidelines for Practice requires language testers to pretest items before including them on an exam, or when pretesting is not possible, to conduct post-hoc item analysis to ensure any malfunctioning items are excluded from scoring. However, the guidelines are devoid of…
Descriptors: Item Response Theory, High Stakes Tests, College Entrance Examinations, Item Analysis
Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Brooks, Lindsay – Language Testing, 2009
This study, framed within sociocultural theory, examines the interaction of adult ESL test-takers in two tests of oral proficiency: one in which they interacted with an examiner (the individual format) and one in which they interacted with another student (the paired format). The data for the eight pairs in this study were drawn from a larger…
Descriptors: Testing, Rating Scales, Program Effectiveness, Interaction
Moy, Raymond – 1982
Score equating requires that the forms to be equated are functionally parallel. That is, the two test forms should rank order examinees in a similar fashion. In language proficiency testing situations, this assumption is often put into doubt because of the numerous tests that have been proposed as measures of language proficiency and the…
Descriptors: Equated Scores, Language Proficiency, Language Tests, Latent Trait Theory

Davies, Alan – Language Testing, 1984
Discusses validation studies of three British English language proficiency tests--the English Proficiency Test Battery, the English Language Battery, and the English Language Testing Service. Concludes that valid language tests depend on test constructors' knowledge of language and on their judgment as to the parameters of language proficiency.…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Second Language Learning

Bachman, Lyle F. – Annual Review of Applied Linguistics, 1988
Discusses three research/testing interfaces in second-language (L2) testing: the covariance structure analysis of ex post facto correlational data, the qualitative investigation of test-taking processes, and the development of L2 assessment instruments based on developmental sequences in L2 acquisition. (61 references) (GLR)
Descriptors: Language Proficiency, Language Research, Language Tests, Multivariate Analysis