Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 24 |
Descriptor
English (Second Language) | 45 |
Test Reliability | 45 |
Scoring | 37 |
Language Tests | 32 |
Test Validity | 26 |
Second Language Learning | 22 |
Language Proficiency | 17 |
Test Construction | 15 |
Foreign Countries | 12 |
Computer Assisted Testing | 10 |
Test Items | 10 |
More ▼ |
Source
Author
Carlson, Sybil B. | 2 |
Eng, Lin Siew | 2 |
Xu, Jing | 2 |
Alderson, J. Charles | 1 |
Ann Tai Choe | 1 |
Anthony, Jason L. | 1 |
Assel, Michael M. | 1 |
Attali, Yigal | 1 |
August, Diane | 1 |
Aviad-Levitzky, Tami | 1 |
Baldauf, Richard B., Jr. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 6 |
Postsecondary Education | 5 |
Elementary Education | 3 |
Early Childhood Education | 2 |
Grade 8 | 2 |
Elementary Secondary Education | 1 |
Grade 7 | 1 |
Grade 9 | 1 |
Kindergarten | 1 |
Middle Schools | 1 |
Preschool Education | 1 |
More ▼ |
Audience
Practitioners | 2 |
Researchers | 2 |
Teachers | 2 |
Location
Europe | 2 |
Iran | 2 |
Japan | 2 |
Malaysia | 2 |
Texas | 2 |
California | 1 |
Greece | 1 |
Hawaii | 1 |
Israel | 1 |
Netherlands | 1 |
Northern Mariana Islands | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024
In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…
Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Davis, Larry; Papageorgiou, Spiros – Assessment in Education: Principles, Policy & Practice, 2021
Human raters and machine scoring systems potentially have complementary strengths in evaluating language ability; specifically, it has been suggested that automated systems might be used to make consistent measurements of specific linguistic phenomena, whilst humans evaluate more global aspects of performance. We report on an empirical study that…
Descriptors: Scoring, English for Academic Purposes, Oral English, Speech Tests
Heng Lu – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
The test view is on the Duolingo English Test (DET), an alternative online English proficiency test with a machine-driven characteristic. The review covers essential information of the DET such as test purpose, usage, score-mapping with CEFR scale, price, and publisher. Meanwhile, the test usefulness is discussed with focuses on reliability,…
Descriptors: Computer Software, Computer Assisted Instruction, Second Language Learning, Second Language Instruction
Ji-young Shin – ProQuest LLC, 2021
The present dissertation investigated the impact of scales/scoring methods and prompt linguistic features on the measurement quality of L2 English elicited imitation (EI). Scales/scoring methods are an important feature for the validity and reliability of L2 EI test, but less is known (Yan et al., 2016). Prompt linguistic features are also known…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Semantics
Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021
Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software
Collier, Jo-Kate; Huang, Becky – Language Assessment Quarterly, 2020
This article presents a critical review of the Texas English Language Proficiency Assessment System (TELPAS), a large scale standardized English language proficiency (ELP) assessment developed by the Texas Education Agency (TEA) and administered since 2004. TELPAS is used as an annual summative assessment for all English Learners (ELs) in grades…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Standardized Tests
Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022
Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…
Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring
Trina D. Spencer; Marilyn S. Thompson; Douglas B. Petersen; Yixing Liu; M. Adelaida Restrepo – Grantee Submission, 2023
For young Spanish-speaking children entering U. S. schools, it is imperative that educators foster growth in the home language and in the language of instruction to the fullest extent possible. Monitoring language development over time is crucial for promoting language development because it allows educators to individualize student instruction.…
Descriptors: Spanish Speaking, English (Second Language), Second Language Learning, Native Language
Saeed, Karwan Mustafa; Ismail, Shaik Abdul Malik Mohamad; Eng, Lin Siew – International Journal of Instruction, 2019
This study was primarily aimed at developing an English-speaking proficiency test and analytic rubrics designed to measure speaking proficiency of Malaysian undergraduates. On the basis of Littlewood's Methodological Framework and Long's Interaction Hypothesis, the researchers derived three speaking tasks from four sources: (a) syllabus of the…
Descriptors: Foreign Countries, Undergraduate Students, Second Language Learning, English (Second Language)
Montroy, Janelle J.; Zucker, Tricia A.; Assel, Michael M.; Landry, Susan H.; Anthony, Jason L.; Williams, Jeffrey M.; Hsu, Hsien-Yuan; Crawford, April; Johnson, Ursula Y.; Carlo, Maria S.; Taylor, Heather B. – Early Education and Development, 2020
There is a significant need for kindergarten entry assessments (KEA) that meet state education agency (SEA) requirements and are psychometrically sound measures of a broad range of school readiness domains such as language, literacy, math, science, executive function, and social-emotional skills. Research Findings: In this paper, we describe five…
Descriptors: Kindergarten, School Readiness, Student Evaluation, Test Construction
Lim, Chang Kuan; Eng, Lin Siew; Mohamed, Abdul Rashid; Ismail, Shaik Abdul Malik Mohamed – English Language Teaching, 2018
The purpose of the study is to have a relook at the ESL reading comprehension assessment system for Malaysian Year Five students. Traditionally, the ESL teachers have been assessing and reporting on their primary year's students by merely giving a composite grade with some vague remarks. This process has been used and is still being employed in…
Descriptors: Foreign Countries, Elementary Schools, English (Second Language), Second Language Instruction
Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019
This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Hirai, Akiyo; Koizumi, Rie – Language Assessment Quarterly, 2013
In recognition of the rating scale as a crucial tool of performance assessment, this study aims to establish a rating scale suitable for a Story Retelling Speaking Test (SRST), which is a semidirect test of speaking ability in English as a foreign language for classroom use. To identify an appropriate scale, three rating scales, all of which have…
Descriptors: Test Validity, Rating Scales, Story Telling, Speech Tests
Nushi, Musa – Journal of Language and Linguistic Studies, 2016
Han's (2009, 2013) selective fossilization hypothesis (SFH) claims that L1 markedness and L2 input robustness determine the fossilizability (and learnability) of an L2 feature. To test the validity of the model, a pseudo-longitudinal study was designed in which the errors in the argumentative essays of 52 Iranian EFL learners were identified and…
Descriptors: Foreign Countries, Longitudinal Studies, English (Second Language), Second Language Instruction