ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	6

Descriptor

Comparative Analysis	6
Task Analysis	6
Test Construction	6
Language Tests	4
Second Language Learning	4
English (Second Language)	3
Evaluation Methods	3
Foreign Countries	3
Language Proficiency	3
Evaluators	2
Second Language Instruction	2
Teaching Methods	2
Writing Tests	2
Action Research	1
Artificial Intelligence	1
Authentic Learning	1
Bayesian Statistics	1
Check Lists	1
Chinese	1
College Students	1
Computer Mediated…	1
Computer Simulation	1
Computer Software	1
Cross Cultural Studies	1
Cultural Background	1
More ▼

Source

ETS Research Report Series	1
International Educational…	1
International Society for…	1
Language Assessment Quarterly	1
Language and Education	1
ProQuest LLC	1

Author

Davis, James R.	1
Garcia Gomez, Pablo	1
Han, Lu	1
Kolarec, Biserka	1
Leung, Constant	1
Li, Jiuliang	1
López-Gopar, Mario	1
Nincevic, Marina	1
Norris, John M.	1
Piech, Chris	1
Sasayama, Shoko	1
Schissel, Jamie L.	1
Tack, Anaïs	1
More ▼

Publication Type

Reports - Research	5
Journal Articles	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	2

Audience

Location

China	1
Colombia	1
Europe	1
Japan	1
Mexico (Oaxaca)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Comparison of Two Exam Evaluation Methods for Objectivity

Peer reviewed
PDF on ERIC

Download full text

Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022

The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…

Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Designing Efficient L2 Writing Assessment Tasks for Low-Proficiency Learners of English. TOEFL® Research Report. RR-97. ETS RR-21-27

Peer reviewed
PDF on ERIC

Download full text

Sasayama, Shoko; Garcia Gomez, Pablo; Norris, John M. – ETS Research Report Series, 2021

This report describes the development of efficient second language (L2) writing assessment tasks designed specifically for low-proficiency learners of English to be included in the "TOEFL® Essentials"™ test. Based on the can-do descriptors of the Common European Framework of Reference for Languages for the A1 through B1 levels of…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Writing Tests

Assessing L2 Chinese Listening Using Authenticated Spoken Texts

Direct link

Han, Lu – ProQuest LLC, 2022

This dissertation study explored the feasibility of using authenticated spoken texts to test L2 Chinese listening comprehension. The spoken texts used in the study were created using an "authenticating" technique, in which scripted spoken Chinese texts were infused with characteristics of real-world, unscripted spoken Chinese. In the…

Descriptors: Second Language Learning, Second Language Instruction, Listening Comprehension Tests, Chinese

Establishing Comparability across Writing Tasks with Picture Prompts of Three Alternate Tests

Peer reviewed

Direct link

Li, Jiuliang – Language Assessment Quarterly, 2018

In language testing programs, different test forms are often used to administer the same test. Demonstrating the comparability of these forms is essential to avoid criticisms of potential test unfairness. However, studies with this objective are scarce. This study aims to investigate the extent to which the picture-prompt writing tasks of three…

Descriptors: Writing Tests, Language Tests, Check Lists, Culture Fair Tests

Multilingual Learners in Language Assessment: Assessment Design for Linguistically Diverse Communities

Peer reviewed

Direct link

Schissel, Jamie L.; Leung, Constant; López-Gopar, Mario; Davis, James R. – Language and Education, 2018

The assessments designed for and analyzed in this study used a task-based language design template rooted in theories of language reflecting heteroglossic language practices and funds of knowledge learning theories, which were understood as transforming classroom teaching, learning, and assessment through continua of biliteracy lenses. Using a…

Descriptors: Multilingualism, Spanish, Task Analysis, Preservice Teachers