Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 6 |
Descriptor
Comparative Analysis | 6 |
Task Analysis | 6 |
Test Construction | 6 |
Language Tests | 4 |
Second Language Learning | 4 |
English (Second Language) | 3 |
Evaluation Methods | 3 |
Foreign Countries | 3 |
Language Proficiency | 3 |
Evaluators | 2 |
Second Language Instruction | 2 |
More ▼ |
Source
ETS Research Report Series | 1 |
International Educational… | 1 |
International Society for… | 1 |
Language Assessment Quarterly | 1 |
Language and Education | 1 |
ProQuest LLC | 1 |
Author
Davis, James R. | 1 |
Garcia Gomez, Pablo | 1 |
Han, Lu | 1 |
Kolarec, Biserka | 1 |
Leung, Constant | 1 |
Li, Jiuliang | 1 |
López-Gopar, Mario | 1 |
Nincevic, Marina | 1 |
Norris, John M. | 1 |
Piech, Chris | 1 |
Sasayama, Shoko | 1 |
More ▼ |
Publication Type
Reports - Research | 5 |
Journal Articles | 3 |
Speeches/Meeting Papers | 2 |
Dissertations/Theses -… | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Audience
Location
China | 1 |
Colombia | 1 |
Europe | 1 |
Japan | 1 |
Mexico (Oaxaca) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022
The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…
Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Sasayama, Shoko; Garcia Gomez, Pablo; Norris, John M. – ETS Research Report Series, 2021
This report describes the development of efficient second language (L2) writing assessment tasks designed specifically for low-proficiency learners of English to be included in the "TOEFL® Essentials"™ test. Based on the can-do descriptors of the Common European Framework of Reference for Languages for the A1 through B1 levels of…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Writing Tests
Han, Lu – ProQuest LLC, 2022
This dissertation study explored the feasibility of using authenticated spoken texts to test L2 Chinese listening comprehension. The spoken texts used in the study were created using an "authenticating" technique, in which scripted spoken Chinese texts were infused with characteristics of real-world, unscripted spoken Chinese. In the…
Descriptors: Second Language Learning, Second Language Instruction, Listening Comprehension Tests, Chinese
Li, Jiuliang – Language Assessment Quarterly, 2018
In language testing programs, different test forms are often used to administer the same test. Demonstrating the comparability of these forms is essential to avoid criticisms of potential test unfairness. However, studies with this objective are scarce. This study aims to investigate the extent to which the picture-prompt writing tasks of three…
Descriptors: Writing Tests, Language Tests, Check Lists, Culture Fair Tests
Schissel, Jamie L.; Leung, Constant; López-Gopar, Mario; Davis, James R. – Language and Education, 2018
The assessments designed for and analyzed in this study used a task-based language design template rooted in theories of language reflecting heteroglossic language practices and funds of knowledge learning theories, which were understood as transforming classroom teaching, learning, and assessment through continua of biliteracy lenses. Using a…
Descriptors: Multilingualism, Spanish, Task Analysis, Preservice Teachers