Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Source
Language Testing | 3 |
Online Submission | 2 |
Educational Testing Service | 1 |
Journal on Educational… | 1 |
TESOL Quarterly | 1 |
Unterrichtspraxis/Teaching… | 1 |
Author
Publication Type
Reports - Descriptive | 12 |
Journal Articles | 7 |
Books | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
Africa | 1 |
Mexico | 1 |
Mexico (Mexico City) | 1 |
USSR | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
What Works Clearinghouse Rating
Tschirner, Erwin – Unterrichtspraxis/Teaching German, 2018
Concepts of second language proficiency and how proficiency may be assessed have changed considerably over the last 20 years. New notions of validity with respect to the interpretation and uses of test scores have begun to shape discussions about test validity and quality assurance in college world language departments, in government, and in…
Descriptors: Language Tests, Testing, Test Theory, German
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Mota, Marisol – Online Submission, 2008
This study examines Linguistic Competence in English Language (LCE) as a general indicator of Communicative Competence. A test and a questionnaire were administered to 1838 undergraduate freshmen from five major institutes of higher education in Aguascalientes, Mexico. The results of the test are analysed in their association with main features of…
Descriptors: Linguistic Competence, English (Second Language), College Freshmen, Higher Education

Brown, James Dean – Language Testing, 1999
Explored the relative contributions to Test of English as a Foreign Language (TOEFL) score dependability of various numbers of persons, items, subtests, languages, and their various interactions. Sampled 15,000 test takers, 1000 each from 15 different language backgrounds. (Author/VWL)
Descriptors: English (Second Language), Language Tests, Second Language Learning, Student Characteristics

Pollitt, Alastair; Hutchinson, Carolyn – Language Testing, 1987
Describes the use of the partial credit form of the Rasch model in the analysis and calibration of a set of writing tasks in which assessment scales and criteria were adapted to suit each task's specific demands. Potential applications of the partial credit model in language testing are discussed. (Author/CB)
Descriptors: Evaluation Criteria, Language Tests, Performance Tests, Second Language Learning

Wall, Dianne – Language Testing, 1996
Suggests that any model of washback must include insights from the theory of educational innovation to help explain why tests do not always have the desired or feared effect. Key concepts in educational innovation are reviewed, showing how these concepts are manifested in a case study in washback and outlining how they are being applied in recent…
Descriptors: Case Studies, Change Strategies, Cognitive Development, Educational Innovation
Kokkota, V. A. – 1989
This book contrasts non-Soviet approaches to language testing and provides definitions from four Soviet language test experts. The role of foreign language teaching, the function of tests, and theoretical problems are discussed, with considerable focus on communicative competence. The book discusses test standardization and classification and…
Descriptors: Communicative Competence (Languages), Foreign Countries, Language Skills, Language Tests

Brown, James Dean – TESOL Quarterly, 1989
Criterion-referenced testing was used to complement norm-referenced procedures in a revision of a university's English-as-a-Second-Language placement test for reading. Test validation results indicated that the revised test better matched the university's program and included more items related to the content and skills that students were…
Descriptors: Criterion Referenced Tests, English (Second Language), Higher Education, Language Tests
Siskind, Teri G.; Rose, Janet S. – 1986
The Charleston County School District (CCSD) has recently begun development of criterion-referenced tests (CRT) in different subject areas and for different grade levels. This paper outlines the process that CCSD followed in the development of math and language arts tests for grades one through eight and area exams for required high school…
Descriptors: Behavioral Objectives, Criterion Referenced Tests, Educational Objectives, Educational Testing
Hathaway, Walter; And Others – 1985
This report describes the development, operation, maintenance, and future prospects of the item banks pioneered by the Portland (Oregon) School District. At the time of this report, there were 3,500 mathematics, 2,200 reading, and 2,300 language usage items calibrated under the fixed parameter model of item response theory (IRT) for Grades 3-8.…
Descriptors: Adaptive Testing, Competency Based Education, Computer Assisted Testing, Criterion Referenced Tests