ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	7
Since 2017 (last 10 years)	15
Since 2007 (last 20 years)	29

Descriptor

Computer Assisted Testing	39
Validity	39
Scoring	35
Essays	15
Reliability	14
Comparative Analysis	13
Writing Evaluation	11
Models	10
Correlation	9
Evaluation Methods	8
Scoring Rubrics	8
Automation	7
Student Evaluation	7
Essay Tests	6
Psychometrics	6
Standardized Tests	5
Writing Tests	5
Accuracy	4
Educational Assessment	4
Elementary Secondary Education	4
Evaluators	4
Foreign Countries	4
Higher Education	4
Performance Based Assessment	4
Second Language Learning	4
More ▼

Publication Type

Journal Articles	29
Reports - Research	19
Reports - Evaluative	11
Reports - Descriptive	5
Speeches/Meeting Papers	4
Dissertations/Theses -…	3
Information Analyses	1

Education Level

Higher Education	11
Postsecondary Education	10
Elementary Secondary Education	5
Secondary Education	4
Elementary Education	2
High Schools	2
Middle Schools	2
Grade 6	1
Grade 7	1
Grade 8	1
Junior High Schools	1
More ▼

Audience

Policymakers

Location

Australia	2
Connecticut	2
New Hampshire	2
New York	2
Rhode Island	2
United Kingdom (England)	2
Vermont	2
Canada	1
China	1
North Carolina (Greensboro)	1
Singapore	1
United States	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	2
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	3
New York State Regents…	2
Test of English as a Foreign…	2
Dynamic Indicators of Basic…	1
Graduate Record Examinations	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

Automatic Prompt Engineering for Automatic Scoring

Peer reviewed

Direct link

Mingfeng Xue; Yunting Liu; Xingyao Xiao; Mark Wilson – Journal of Educational Measurement, 2025

Prompts play a crucial role in eliciting accurate outputs from large language models (LLMs). This study examines the effectiveness of an automatic prompt engineering (APE) framework for automatic scoring in educational measurement. We collected constructed-response data from 930 students across 11 items and used human scores as the true labels. A…

Descriptors: Computer Assisted Testing, Prompting, Educational Assessment, Automation

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Reflections on the Application and Validation of Technology in Language Testing

Peer reviewed

Direct link

Barry O'Sullivan – Language Assessment Quarterly, 2023

This paper highlights as issues of concern the rapid changes in technology and the tendency to report on partial validation efforts where the work is not identified as forming part of a larger validation project. With close human supervision emerging technologies can have a significant and positive impact on language testing. While technology…

Descriptors: Technology Uses in Education, Computer Assisted Testing, Language Tests, Supervision

Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment

Peer reviewed

Direct link

Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022

We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…

Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods

Online Assessment of Students' Reasoning When Solving Example-Eliciting Tasks: Using Conjunction and Disjunction to Increase the Power of Examples

Peer reviewed

Direct link

Yerushalmy, Michal; Olsher, Shai – ZDM: The International Journal on Mathematics Education, 2020

We argue that examples can do more than serve the purpose of illustrating the truth of an existential statement or disconfirming the truth of a universal statement. Our argument is relevant to the use of technology in classroom assessment. A central challenge of computer-assisted assessment is to develop ways of collecting rich and complex data…

Descriptors: Computer Assisted Testing, Student Evaluation, Problem Solving, Thinking Skills

Semantic Distance and the Alternate Uses Task: Recommendations for Reliable Automated Assessment of Originality

Peer reviewed

Direct link

Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022

Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…

Descriptors: Semantics, Scoring, Creative Thinking, Creativity

Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test

Peer reviewed

Direct link

LaVoie, Noelle; Parker, James; Legree, Peter J.; Ardison, Sharon; Kilcullen, Robert N. – Educational and Psychological Measurement, 2020

Automated scoring based on Latent Semantic Analysis (LSA) has been successfully used to score essays and constrained short answer responses. Scoring tests that capture open-ended, short answer responses poses some challenges for machine learning approaches. We used LSA techniques to score short answer responses to the Consequences Test, a measure…

Descriptors: Semantics, Evaluators, Essays, Scoring

Toward Culturally Responsive and Equitable Testing: Innovative Psychometric Analyses on Contextualized Measurement and Adaptive Testing

Direct link

Nixi Wang – ProQuest LLC, 2022

Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…

Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

A Historical Analysis of Technological Advances to Educational Testing: A Drive for Efficiency and the Interplay with Validity

Peer reviewed

Direct link

Moncaleano, Sebastian; Russell, Michael – Journal of Applied Testing Technology, 2018

2017 marked a century since the development and administration of the first large-scale group administered standardized test. Since that time, both the importance of testing and the technology of testing have advanced significantly. This paper traces the technological advances that have led to the large-scale administration of educational tests in…

Descriptors: Technological Advancement, Standardized Tests, Computer Assisted Testing, Automation

Automated L2 Writing Performance Assessment: A Literature Review

Peer reviewed

Direct link

Sari, Elif; Han, Turgay – Reading Matrix: An International Online Journal, 2021

Providing both effective feedback applications and reliable assessment practices are two central issues in ESL/EFL writing instruction contexts. Giving individual feedback is very difficult in crowded classes as it requires a great amount of time and effort for instructors. Moreover, instructors likely employ inconsistent assessment procedures,…

Descriptors: Automation, Writing Evaluation, Artificial Intelligence, Natural Language Processing

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Objective Intelligibility Assessment by Automated Segmental and Suprasegmental Listening Error Analysis

Peer reviewed

Direct link

Jiao, Yishan; LaCross, Amy; Berisha, Visar; Liss, Julie – Journal of Speech, Language, and Hearing Research, 2019

Purpose: Subjective speech intelligibility assessment is often preferred over more objective approaches that rely on transcript scoring. This is, in part, because of the intensive manual labor associated with extracting objective metrics from transcribed speech. In this study, we propose an automated approach for scoring transcripts that provides…

Descriptors: Suprasegmentals, Phonemes, Error Patterns, Scoring

Developing and Measuring Higher Order Skills: Models for State Performance Assessment Systems. Research Brief

Peer reviewed
PDF on ERIC

Download full text

Darling-Hammond, Linda – Learning Policy Institute, 2017

After passage of the Every Student Succeeds Act (ESSA) in 2015, states assumed greater responsibility for designing their own accountability and assessment systems. ESSA requires states to measure "higher order thinking skills and understanding" and encourages the use of open-ended performance assessments, which are essential for…

Descriptors: Performance Based Assessment, Accountability, Portfolios (Background Materials), Task Analysis

Developing and Measuring Higher Order Skills: Models for State Performance Assessment Systems

Download full text

Darling-Hammond, Linda – Council of Chief State School Officers, 2017

The Every Student Succeeds Act (ESSA) opened up new possibilities for how student and school success are defined and supported in American public education. States have greater responsibility for designing and building their assessment and accountability systems. These new opportunities to develop performance assessments are critically important…

Descriptors: Performance Based Assessment, Accountability, Portfolios (Background Materials), Task Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Assessing Writing	5
Journal of Educational…	3
ProQuest LLC	3
Educational and Psychological…	2
Journal of Applied Testing…	2
Journal of Speech, Language,…	2
Journal of Technology,…	2
Applied Psychological…	1
Assessment & Evaluation in…	1
Center for American Progress	1
Council of Chief State School…	1
Creativity Research Journal	1
ETS Research Report Series	1
Higher Education Quarterly	1
International Educational…	1
Journal of Educational…	1
Journal of Outcome Measurement	1
Journal of Psychoeducational…	1
Language Assessment Quarterly	1
Language Testing	1
Learning Policy Institute	1
Reading Matrix: An…	1
Theory Into Practice	1
ZDM: The International…	1
More ▼

Attali, Yigal	2
Darling-Hammond, Linda	2
Kelly, P. Adam	2
Ramineni, Chaitanya	2
Williamson, David M.	2
Apple, Kristen	1
Ardison, Sharon	1
Baldwin, Peter	1
Barry O'Sullivan	1
Beaty, Roger E.	1
Bejar, Isaac I.	1
Bergstrom, Betty	1
Berisha, Visar	1
Bhola, Dennison S.	1
Breyer, F. Jay	1
Brown, Gavin T. L.	1
Buckendahl, Chad W.	1
Burstein, Jill	1
Clauser, Brian	1
Condon, William	1
Dalton, Sarah Grace	1
Davis, Lawrence Edward	1
Deane, Paul	1
Doewes, Afrizal	1
More ▼