Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 22 |
| Since 2007 (last 20 years) | 28 |
Descriptor
| Evaluation Methods | 45 |
| Second Language Instruction | 45 |
| English (Second Language) | 30 |
| Second Language Learning | 24 |
| Foreign Countries | 23 |
| Test Reliability | 19 |
| Reliability | 18 |
| Student Evaluation | 13 |
| Test Validity | 13 |
| Language Tests | 12 |
| Validity | 11 |
| More ▼ | |
Source
Author
| Brown, James Dean | 2 |
| Adunyarittigun, Dumrong | 1 |
| Ahmadi Safa, Mohammad | 1 |
| Akihito Kamata | 1 |
| Ali Panahi | 1 |
| Alsree, Zubaida | 1 |
| Amini, Mojtaba | 1 |
| Arslan, Abdullah | 1 |
| Aziz, Anealka | 1 |
| Barnwell, David Patrick | 1 |
| Bhamani, Shelina | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 13 |
| Postsecondary Education | 12 |
| Elementary Education | 3 |
| Secondary Education | 3 |
| Elementary Secondary Education | 2 |
| Early Childhood Education | 1 |
| Grade 2 | 1 |
| Primary Education | 1 |
Audience
| Practitioners | 1 |
| Teachers | 1 |
Location
| Iran | 4 |
| China | 2 |
| Egypt | 2 |
| Japan | 2 |
| Netherlands | 2 |
| Pakistan | 2 |
| Turkey | 2 |
| United Kingdom (Great Britain) | 2 |
| Vietnam | 2 |
| Asia | 1 |
| Australia | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Dale Chall Readability Formula | 1 |
| Flesch Kincaid Grade Level… | 1 |
| Flesch Reading Ease Formula | 1 |
| Fry Readability Formula | 1 |
What Works Clearinghouse Rating
Yang Yang – Shanlax International Journal of Education, 2024
This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…
Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods
Ghiasvand, Farhad; Jahanbakhsh, Akbar A.; Sharifpour, Pardis – Language Testing in Asia, 2023
Teacher agency is a pivotal element of professionalism and second/foreign language (L2) education. However, its role in L2 assessment has remained under-researched. Part of this negligence is due to the absence of a validated questionnaire to measure the construct and its underlying components. To address this gap, drawing on the ecological…
Descriptors: Test Construction, Test Validity, Professional Autonomy, Evaluation Methods
Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025
Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Tu, Thuy Thi Minh – ProQuest LLC, 2023
The study aimed to elicit information from Vietnamese EFL university instructors about their knowledge and skills regarding the principles, theory, and practices of language assessment by means of revision and validation of the Language Assessment Literacy--Revised Vietnam (LAL-RV), which was previously developed by Kremmel and Harding (2020). A…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, College Faculty
Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023
Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…
Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication
Doosti, Mehdi; Ahmadi Safa, Mohammad – International Journal of Language Testing, 2021
This study examined the effect of rater training on promoting inter-rater reliability in oral language assessment. It also investigated whether rater training and the consideration of the examinees' expectations by the examiners have any effect on test-takers' perceptions of being fairly evaluated. To this end, four raters scored 31 Iranian…
Descriptors: Oral Language, Language Tests, Interrater Reliability, Training
Shang, Xiaoqi; Xie, Guixia – Interpreter and Translator Trainer, 2023
Sight translation has been widely used in aptitude testing to screen prospective trainee interpreters at leading interpreter training schools, including ESIT, ISIT, and EMCI. However, it has also been criticised for its lack of validity and reliability. No empirical study has thus far been conducted to explore its power to predict interpreting…
Descriptors: Translation, Second Language Learning, Second Language Instruction, Chinese
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022
In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…
Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction
Marshall, Paul Anthony – International Journal of Curriculum and Instruction, 2020
This study measures the perceptions of English language teachers at Japanese universities. It combines the results of online questionnaires on assessment practices and another one on teacher autonomy. Results suggest that according to teachers' self-reports of assessment practices, certain measures of assessment quality are being affected by the…
Descriptors: Educational Quality, Universities, Foreign Countries, Language Teachers
Zhongdi Wu; Eric Larson; Makoto Sano; Doris Baker; Nathan Gage; Akihito Kamata – Grantee Submission, 2023
In this investigation we propose new machine learning methods for automated scoring models that predict the vocabulary acquisition in science and social studies of second grade English language learners, based upon free-form spoken responses. We evaluate performance on an existing dataset and use transfer learning from a large pre-trained language…
Descriptors: Prediction, Vocabulary Development, English (Second Language), Second Language Learning
Nikmard, Fateme; Mohamadi Zenouzagh, Zohre – Language Testing in Asia, 2020
English teachers' assessment literacy has always been considered as an important factor in their performance. However, no instrument has ever been developed to assess this construct among Iranian EFL teachers. To fill this gap, in the first phase of the present study, a theoretical framework for the main four components of teacher assessment…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Validity
Musthofa, Tulus – Eurasian Journal of Applied Linguistics, 2022
Common European Framework of Reference (CEFR) is an international standard to measure learners' language abilities on a six-point scale, A1 for beginners up to C2 for those who have mastered a language. this study attempted to examine the implementation of the CEFR policy in learning Arabic in Indonesia at al levels, beginning form curriculum…
Descriptors: Arabic, Semitic Languages, Second Language Learning, Second Language Instruction
Kutuk, Gulsah; Putwain, David W.; Kaye, Linda; Garrett, Bethan – Journal of Psychoeducational Assessment, 2020
This study reports on the development and assessment of a new 30-item Multidimensional Language Class Anxiety Scale which is designed to assess foreign language learners' anxiety regarding four language skills (listening, reading, writing, and speaking) and testing. In Study 1, the initial items were piloted with 323 students studying English as a…
Descriptors: Validity, Anxiety, Second Language Learning, Second Language Instruction
Amini, Mojtaba – Language Testing in Asia, 2018
Background: Translation quality assessment (TQA) suffers from subjectivity in both neighboring disciplines: 'TEFL' and 'Translation Studies, and more empirical studies are required to get closer to objectivity in this domain. The present study evaluated the quality of the written translation of TEFL students through three different approaches to…
Descriptors: Second Language Instruction, English (Second Language), Student Evaluation, Translation

Peer reviewed
Direct link
