ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	8

Descriptor

Computer Assisted Testing	9
Interrater Reliability	9
Foreign Countries	6
Scoring	5
Secondary School Students	4
Correlation	3
English (Second Language)	3
Second Language Learning	3
Test Construction	3
Test Reliability	3
Adolescents	2
Comparative Analysis	2
Computer Software	2
Construct Validity	2
Educational Technology	2
Essays	2
Evaluation Methods	2
Evaluators	2
Grading	2
High School Students	2
Language Tests	2
Test Items	2
Test Scoring Machines	2
Test Validity	2
Accuracy	1
More ▼

Source

ETS Research Report Series	2
American College Testing…	1
Computers & Education	1
E-Learning	1
English Language Teaching	1
European Journal of Science…	1
Journal of Mental Health…	1
ReCALL	1

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Secondary Education	9
High Schools	3
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2

Audience

Location

China	2
Germany	2
Australia	1
France	1
Hong Kong	1
Japan	1
Netherlands	1
South Korea	1
Sweden	1
Switzerland	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Strengths and Difficulties…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

Comparison of Automatic and Expert Teachers' Rating of Computerized English Listening-Speaking Test

Peer reviewed
PDF on ERIC

Download full text

Linlin, Cao – English Language Teaching, 2020

Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

Developing a Machine-Supported Coding System for Constructed-Response Items in PISA. Research Report. ETS RR-17-47

Peer reviewed
PDF on ERIC

Download full text

Yamamoto, Kentaro; He, Qiwei; Shin, Hyo Jeong; von Davier, Mattias – ETS Research Report Series, 2017

Approximately a third of the Programme for International Student Assessment (PISA) items in the core domains (math, reading, and science) are constructed-response items and require human coding (scoring). This process is time-consuming, expensive, and prone to error as often (a) humans code inconsistently, and (b) coding reliability in…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Subjective Mental Health, Peer Relations, Family, and School Environment in Adolescents with Intellectual Developmental Disorder: A First Report of a New Questionnaire Administered on Tablet PCs

Peer reviewed

Direct link

Boström, Petra; Johnels, Jakob Åsberg; Thorson, Maria; Broberg, Malin – Journal of Mental Health Research in Intellectual Disabilities, 2016

Few studies have explored the subjective mental health of adolescents with intellectual disabilities, while proxy ratings indicate an overrepresentation of mental health problems. The present study reports on the design and an initial empirical evaluation of the Well-being in Special Education Questionnaire (WellSEQ). Questions, response scales,…

Descriptors: Mental Health, Peer Relationship, Family Environment, Educational Environment

Experimenting with a Computer Essay-Scoring Program Based on ESL Student Writing Scripts

Peer reviewed

Direct link

Coniam, David – ReCALL, 2009

This paper describes a study of the computer essay-scoring program BETSY. While the use of computers in rating written scripts has been criticised in some quarters for lacking transparency or lack of fit with how human raters rate written scripts, a number of essay rating programs are available commercially, many of which claim to offer comparable…

Descriptors: Writing Tests, Scoring, Foreign Countries, Interrater Reliability

Essay Marking On-Screen: Implications for Assessment Validity

Peer reviewed

Direct link

Shaw, Stuart – E-Learning, 2008

Computer-assisted assessment offers many benefits over traditional paper methods. However, in transferring from one medium to another, it is crucial to ascertain the extent to which the new medium may alter the nature of traditional assessment practice or affect marking reliability. Whilst there is a substantial body of research comparing marking…

Descriptors: Construct Validity, Writing Instruction, Computer Assisted Testing, Student Evaluation

Assessing Creative Problem-Solving with Automated Text Grading

Peer reviewed

Direct link

Wang, Hao-Chuan; Chang, Chun-Yen; Li, Tsai-Yen – Computers & Education, 2008

The work aims to improve the assessment of creative problem-solving in science education by employing language technologies and computational-statistical machine learning methods to grade students' natural language responses automatically. To evaluate constructs like creative problem-solving with validity, open-ended questions that elicit…

Descriptors: Interrater Reliability, Earth Science, Problem Solving, Grading

Inventory of Work-Relevant Values: 2001 Revision. ACT Research Report Series, 2004-03

Download full text

Bobek, Becky L.; Gore, Paul A. – American College Testing (ACT), Inc., 2004

This research report describes changes made to the Inventory of Work-Relevant Values when it was revised for online use as a part of the Internet version of DISCOVER. Users will see the following differences between the online and CD-ROM versions of the inventory: 22 items rather than 61, simplified presentation, and the contribution of all items…

Descriptors: Interrater Reliability, Field Tests, Internet, Test Construction

Bobek, Becky L.	1
Boström, Petra	1
Braithwaite, Nicholas St. J.	1
Broberg, Malin	1
Casabianca, Jodi M.	1
Chang, Chun-Yen	1
Coniam, David	1
Gore, Paul A.	1
He, Qiwei	1
Hedgeland, Holly	1
Johnels, Jakob Åsberg	1
Jordan, Sally E.	1
Keller, Stefan	1
Krüger, Maleika	1
Köller, Olaf	1
Li, Tsai-Yen	1
Linlin, Cao	1
Parker, Mark A. J.	1
Rupp, André A.	1
Shaw, Stuart	1
Shin, Hyo Jeong	1
Thorson, Maria	1
Wang, Hao-Chuan	1
Yamamoto, Kentaro	1
von Davier, Mattias	1
More ▼