Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 4 |
Descriptor
| Interrater Reliability | 14 |
| Language Proficiency | 14 |
| Test Reliability | 14 |
| Language Tests | 11 |
| Second Language Learning | 8 |
| Test Validity | 7 |
| English (Second Language) | 6 |
| Interviews | 5 |
| Rating Scales | 5 |
| Scores | 5 |
| Oral Language | 4 |
| More ▼ | |
Source
| Annual Review of Applied… | 1 |
| Canadian Modern Language… | 1 |
| Cogent Education | 1 |
| Educational and Psychological… | 1 |
| Foreign Language Annals | 1 |
| International Journal of… | 1 |
| ProQuest LLC | 1 |
| RELC Journal: A Journal of… | 1 |
Author
| Adams, R. J. | 1 |
| Bachman, Lyle F. | 1 |
| Boldt, R. F. | 1 |
| Camp, Roberta | 1 |
| Cantor, Nancy K. | 1 |
| Carlson, Sybil B. | 1 |
| Clark, John L. D. | 1 |
| Dierdorff, Erich C. | 1 |
| Eng, Lin Siew | 1 |
| Hoover, H. D. | 1 |
| Iberri-Shea, Gina | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 11 |
| Journal Articles | 7 |
| Speeches/Meeting Papers | 4 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
| Numerical/Quantitative Data | 1 |
| Reports - Evaluative | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 2 |
Audience
| Researchers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
| ACTFL Oral Proficiency… | 2 |
| Test of English as a Foreign… | 2 |
| Cognitive Abilities Test | 1 |
| Graduate Record Examinations | 1 |
| Iowa Tests of Basic Skills | 1 |
What Works Clearinghouse Rating
Saeed, Karwan Mustafa; Ismail, Shaik Abdul Malik Mohamad; Eng, Lin Siew – International Journal of Instruction, 2019
This study was primarily aimed at developing an English-speaking proficiency test and analytic rubrics designed to measure speaking proficiency of Malaysian undergraduates. On the basis of Littlewood's Methodological Framework and Long's Interaction Hypothesis, the researchers derived three speaking tasks from four sources: (a) syllabus of the…
Descriptors: Foreign Countries, Undergraduate Students, Second Language Learning, English (Second Language)
Iberri-Shea, Gina – Cogent Education, 2017
Prominent spoken language assessments such as the Oral Proficiency Interview and the Test of Spoken English have been primarily concerned with speaking ability as it relates to conversation. This paper looks at an additional aspect of spoken language ability, namely public speaking. This study used an adapted form of a public speaking rating scale…
Descriptors: Public Speaking, Rating Scales, Adoption (Ideas), English Instruction
Zhao, Zhongbao – RELC Journal: A Journal of Language Teaching and Research, 2013
This study investigates the validity of the Diagnostic College English Speaking Test (DCEST) in the context of EFL teaching and learning in China. The experiment was conducted in three stages over the course of eight weeks at a national key university in China. By means of test administration and questionnaire survey, the researcher gathered…
Descriptors: Oral Language, Construct Validity, Language Tests, Diagnostic Tests
Lim, Gad S. – ProQuest LLC, 2009
Performance assessments have become the norm for evaluating language learners' writing abilities in international examinations of English proficiency. Two aspects of these assessments are usually systematically varied: test takers respond to different prompts, and their responses are read by different raters. This raises the possibility of undue…
Descriptors: Performance Based Assessment, Language Tests, Performance Tests, Test Validity
Peer reviewedPowers, Stephen; And Others – Educational and Psychological Measurement, 1985
Results of an administration of the Language Proficiency Measure indicated that the interrater reliability was adequate, internal-consistency reliability estimates were high, concurrent validity coefficients were adequate, and the classification validity was acceptable. (Author/LMO)
Descriptors: Elementary Education, Interrater Reliability, Language Proficiency, Language Tests
Boldt, R. F. – 1992
The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…
Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency
Surface, Eric A.; Dierdorff, Erich C. – Foreign Language Annals, 2003
The reliability of the ACTFL Oral Proficiency Interview (OPI) has not been reported since ACTFL revised its speaking proficiency guidelines in 1999. Reliability data for assessments should be reported periodically to provide users with enough information to evaluate the psychometric characteristics of the assessment. This study provided the most…
Descriptors: Language Tests, Interrater Reliability, Program Effectiveness, Psychometrics
Bachman, Lyle F.; And Others – 1993
This paper outlines the development of a performance assessment measure of language speaking ability, the Language Ability Assessment System (LAAS), which is highly reliable and can be examined for reliability through modern measurement theories, such as generalizability theory (G-theory) and the many-facet Rasch theory. LAAS was developed to…
Descriptors: College Students, Higher Education, Interrater Reliability, Language Proficiency
Clark, John L. D. – 1986
A study of the reliability of the proficiency ratings scale and techniques used by three federal government agencies--the Central Intelligence Agency, the Defense Language Institute, and the Foreign Service Institute (FSI)--to test employees' oral language proficiency in French and German had two randomly selected two-person teams of testers from…
Descriptors: Comparative Analysis, Federal Government, French, German
Peer reviewedMagnan, Sally Sieloff – Canadian Modern Language Review, 1987
Differences in procedures used by academic institutions and government agencies in administering the American Council on the Teaching of Foreign Languages' Oral Proficiency Interview test are examined, and results and implications of two studies of interrater reliability are discussed. (MSE)
Descriptors: Comparative Analysis, Correlation, Evaluation Methods, Evaluators
McNamara, T. F.; Adams, R. J. – 1991
A preliminary study is reported of the use of new multifaceted Rasch measurement mechanisms for investigating rater characteristics in language testing. Ratings from four judges of scripts from 50 candidates taking the International English Language Testing System test, a test of English for Academic Purposes, are analyzed. The analysis…
Descriptors: Comparative Analysis, English (Second Language), Foreign Countries, Interrater Reliability
Cantor, Nancy K.; Hoover, H. D. – 1986
This paper isolates and examines separately three distinct sources of error in essay scores: lack of agreement between raters; inconsistencies in performance within mode of discourse, and inconsistencies in performance between modes of discourse. Essay prompts in the Iowa Tests of Basic Skills (ITBS) Writing Supplement were designed to assess…
Descriptors: Academic Achievement, Cues, Elementary Secondary Education, Error of Measurement
Carlson, Sybil B.; Camp, Roberta – 1985
This paper reports on Educational Testing Service research studies investigating the parameters critical to reliability and validity in both the direct and indirect writing ability assessment of higher education applicants. The studies involved: (1) formulating an operational definition of writing competence; (2) designing and pretesting writing…
Descriptors: College Entrance Examinations, Computer Assisted Testing, English (Second Language), Essay Tests
Peer reviewedTurner, Jean – Annual Review of Applied Linguistics, 1998
This review of research on second-language oral testing outlines the nature of early research in interview-format proficiency testing, then reports on new directions in investigation of construct validity of interview-format and other oral skills tests through examination of examinee, interviewer, and rater performance. Research on empirically…
Descriptors: Construct Validity, Educational Trends, Interrater Reliability, Interviews

Direct link
