ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	12

Descriptor

Computational Linguistics	12
Computer Assisted Testing	12
Correlation	12
English (Second Language)	7
Language Tests	7
Language Proficiency	6
Second Language Learning	6
Scoring	5
Comparative Analysis	4
Test Validity	4
Evaluators	3
Foreign Countries	3
Grammar	3
Language Usage	3
Regression (Statistics)	3
Scores	3
Scoring Rubrics	3
Second Language Instruction	3
Speech Communication	3
Statistical Analysis	3
Syntax	3
Writing Evaluation	3
Academic Discourse	2
Artificial Intelligence	2
Cognitive Processes	2
More ▼

Source

Grantee Submission	2
Language Testing	2
Advances in Physiology…	1
ETS Research Report Series	1
English Teaching	1
Journal of Experimental…	1
Language Learning & Technology	1
Modern Language Journal	1
Reading in a Foreign Language	1
Turkish Online Journal of…	1

Publication Type

Journal Articles	11
Reports - Research	10
Information Analyses	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Elementary Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 4	1
Grade 6	1
High Schools	1
Secondary Education	1

Audience

Location

Mexico	1
Singapore	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	5
Gates MacGinitie Reading Tests	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Corpus Linguistics and Language Testing: Navigating Uncharted Waters

Peer reviewed

Direct link

Egbert, Jesse – Language Testing, 2017

The use of corpora and corpus linguistic methods in language testing research is increasing at an accelerated pace. The growing body of language testing research that uses corpus linguistic data is a testament to their utility in test development and validation. Although there are many reasons to be optimistic about the future of using corpus data…

Descriptors: Language Tests, Second Language Learning, Computational Linguistics, Best Practices

Distinguishing Discrete and Gradient Category Structure in Language: Insights from Verb-Particle Constructions

Peer reviewed

Direct link

Brehm, Laurel; Goldrick, Matthew – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2017

The current work uses memory errors to examine the mental representation of verb-particle constructions (VPCs; e.g., "make up" the story, "cut up the meat"). Some evidence suggests that VPCs are represented by a cline in which the relationship between the VPC and its component elements ranges from highly transparent ("cut…

Descriptors: Verbs, Form Classes (Languages), Regression (Statistics), Error Patterns

Automated Measurement of Syntactic Complexity in Corpus-Based L2 Writing Research and Implications for Writing Assessment

Peer reviewed

Direct link

Lu, Xiaofei – Language Testing, 2017

Research investigating corpora of English learners' language raises new questions about how syntactic complexity is defined theoretically and operationally for second language (L2) writing assessment. I show that syntactic complexity is important in construct definitions and L2 writing rating scales as well as in L2 writing research. I describe…

Descriptors: Syntax, Computational Linguistics, Second Language Learning, Writing Research

Using Corpus-Based Register Analysis to Explore the Authenticity of High-Stakes Language Exams: A Register Comparison of TOEFL iBT and Disciplinary Writing Tasks

Peer reviewed

Direct link

Staples, Shelley; Biber, Douglas; Reppen, Randi – Modern Language Journal, 2018

One of the central considerations in the validity argument for the TOEFL iBT is the relationship between the language on the exam and the language required for university courses. Corpus linguistics has recently been shown to be an effective way to explore this relationship, which can also be considered as an aspect of authenticity. Applying…

Descriptors: Computational Linguistics, Computer Assisted Testing, English (Second Language), Language Tests

What's so Simple about Simplified Texts? A Computational and Psycholinguistic Investigation of Text Comprehension and Text Processing

Peer reviewed
PDF on ERIC

Download full text

Crossley, Scott A.; Yang, Hae Sung; McNamara, Danielle S. – Reading in a Foreign Language, 2014

This study uses a moving windows self-paced reading task to assess both text comprehension and processing time of authentic texts and these same texts simplified to beginning and intermediate levels. Forty-eight second language learners each read 9 texts (3 different authentic, beginning, and intermediate level texts). Repeated measures ANOVAs…

Descriptors: Reading Comprehension, Reading Processes, Second Language Instruction, Difficulty Level

Applications of Text Analysis Tools for Spoken Response Grading

Peer reviewed
PDF on ERIC

Download full text

Direct link

Crossley, Scott; McNamara, Danielle – Language Learning & Technology, 2013

This study explores the potential for automated indices related to speech delivery, language use, and topic development to model human judgments of TOEFL speaking proficiency in second language (L2) speech samples. For this study, 244 transcribed TOEFL speech samples taken from 244 L2 learners were analyzed using automated indices taken from…

Descriptors: English (Second Language), Language Proficiency, Language Tests, Speech Communication

Applications of Text Analysis Tools for Spoken Response Grading

Peer reviewed
PDF on ERIC

Download full text

Direct link

Crossley, Scott; McNamara, Danielle – Grantee Submission, 2013

Descriptors: English (Second Language), Language Proficiency, Language Tests, Speech Communication

Discourse Characteristics of Writing and Speaking Task Types on the "TOEFL iBT"® Test: A Lexico-Grammatical Analysis. "TOEFL iBT"® Research Report. TOEFL iBT-19. Research Report. RR-13-04

Peer reviewed
PDF on ERIC

Download full text

Biber, Douglas; Gray, Bethany – ETS Research Report Series, 2013

One of the major innovations of the "TOEFL iBT"® test is the incorporation of integrated tasks complementing the independent tasks to which examinees respond. In addition, examinees must produce discourse in both modes (speech and writing). The validity argument for the TOEFL iBT includes the claim that examinees vary their discourse in…

Descriptors: Discourse Analysis, English (Second Language), Second Language Learning, Language Tests

Effectiveness of Automated Chinese Sentence Scoring with Latent Semantic Analysis

Peer reviewed
PDF on ERIC

Download full text

Liao, Chen-Huei; Kuo, Bor-Chen; Pai, Kai-Chih – Turkish Online Journal of Educational Technology - TOJET, 2012

Automated scoring by means of Latent Semantic Analysis (LSA) has been introduced lately to improve the traditional human scoring system. The purposes of the present study were to develop a LSA-based assessment system to evaluate children's Chinese sentence construction skills and to examine the effectiveness of LSA-based automated scoring function…

Descriptors: Foreign Countries, Program Effectiveness, Scoring, Personality

Biber, Douglas	2
Crossley, Scott	2
McNamara, Danielle	2
Amanda Huee-Ping Wong	1
Brehm, Laurel	1
Crossley, Scott A.	1
Denis Dumas	1
Egbert, Jesse	1
Goldrick, Matthew	1
Gray, Bethany	1
Ivan Cherh Chiet Low	1
Jiyeo Yun	1
Kelly Berthiaume	1
Kuo, Bor-Chen	1
Liao, Chen-Huei	1
Lu, Xiaofei	1
McNamara, Danielle S.	1
Nathasha Vihangi Luke	1
Pai, Kai-Chih	1
Peter Organisciak	1
Reppen, Randi	1
Selcuk Acar	1
Staples, Shelley	1
Swapna Haresh Teckwani	1
Yang, Hae Sung	1
More ▼