ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	6

Descriptor

Computer Assisted Testing	9
Interrater Reliability	9
Test Items	9
Difficulty Level	6
English (Second Language)	4
Language Tests	4
Scores	4
Scoring	4
Second Language Learning	4
Foreign Countries	3
Comparative Analysis	2
Computer Software	2
Evaluators	2
Higher Education	2
Item Analysis	2
Language Proficiency	2
Models	2
Reading Tests	2
Statistical Analysis	2
Test Construction	2
Test Reliability	2
Test Validity	2
Testing	2
Accuracy	1
Achievement Tests	1
More ▼

Source

ETS Research Report Series	2
European Journal of Science…	1
Language Assessment Quarterly	1
New Directions for Teaching…	1
Online Submission	1
Smarter Balanced Assessment…	1

Publication Type

Journal Articles	5
Reports - Research	5
Reports - Evaluative	2
Tests/Questionnaires	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Higher Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
High Schools	1
Postsecondary Education	1

Audience

Practitioners	1
Teachers	1

Location

Australia	1
China	1
France	1
Germany	1
Japan	1
Netherlands	1
South Korea	1
Washington	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Program for International…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Developing a Machine-Supported Coding System for Constructed-Response Items in PISA. Research Report. ETS RR-17-47

Peer reviewed
PDF on ERIC

Download full text

Yamamoto, Kentaro; He, Qiwei; Shin, Hyo Jeong; von Davier, Mattias – ETS Research Report Series, 2017

Approximately a third of the Programme for International Student Assessment (PISA) items in the core domains (math, reading, and science) are constructed-response items and require human coding (scoring). This process is time-consuming, expensive, and prone to error as often (a) humans code inconsistently, and (b) coding reliability in…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Smarter Balanced Assessment Consortium: Alignment Study Report. Revised

Download full text

Smarter Balanced Assessment Consortium, 2016

The goal of this study was to gather comprehensive evidence about the alignment of the Smarter Balanced summative assessments to the Common Core State Standards (CCSS). Alignment of the Smarter Balanced summative assessments to the CCSS is a critical piece of evidence regarding the validity of inferences students, teachers and policy makers can…

Descriptors: Alignment (Education), Summative Evaluation, Common Core State Standards, Test Content

The Role of Lexical Properties and Cohesive Devices in Text Integration and Their Effect on Human Ratings of Speaking Proficiency

Peer reviewed

Direct link

Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014

There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…

Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Generalizability, Validity, and Examinee Perceptions of a Computer-Delivered Formulating-Hypotheses Test. GRE Board Professional Report No. 90-02aP.

Download full text

Bennett, Randy Elliot; Rock, Donald A. – 1993

Formulating-Hypotheses (F-H) items present a situation and ask the examinee to generate as many explanations for it as possible. This study examined the generalizability, validity, and examinee perceptions of a computer-delivered version of the task. Eight F-H questions were administered to 192 graduate students. Half of the items restricted…

Descriptors: Computer Assisted Testing, Difficulty Level, Generalizability Theory, Graduate Students

Psychometric Properties of Student Ratings of Instruction in Online and On-Campus Courses

Peer reviewed

Direct link

McGhee, Debbie E.; Lowell, Nana – New Directions for Teaching and Learning, 2003

This study compares mean ratings, inter-rater reliabilities, and the factor structure of items for online and paper student-rating forms from the University of Washington's Instructional Assessment System. (Contains 3 figures and 2 tables.)

Descriptors: Psychometrics, Factor Structure, Student Evaluation of Teacher Performance, Test Items

Technology and Language Testing. A Collection of Papers from the Annual Colloquium on Language Testing Research (7th, Princeton, New Jersey, April 6-9, 1985).

Stansfield, Charles W., Ed. – 1986

This collection of essays on measurement theory and language testing includes: "Computerized Adaptive Testing: Implications for Language Test Developers" (Peter Tung); "The Promise and Threat of Computerized Adaptive Assessment of Reading Comprehension" (Michael Canale); "Computerized Rasch Analysis of Item Bias in ESL…

Descriptors: Chinese, Cloze Procedure, Computer Assisted Testing, Computer Software

Bennett, Randy Elliot	1
Braithwaite, Nicholas St. J.	1
Breyer, F. Jay	1
Clevinger, Amanda	1
Crossley, Scott	1
Edward Paul Getman	1
He, Qiwei	1
Hedgeland, Holly	1
Jordan, Sally E.	1
Kim, YouJin	1
Lorenz, Florian	1
Lowell, Nana	1
McGhee, Debbie E.	1
Parker, Mark A. J.	1
Rock, Donald A.	1
Shin, Hyo Jeong	1
Stansfield, Charles W., Ed.	1
Yamamoto, Kentaro	1
Zhang, Mo	1
von Davier, Mattias	1
More ▼