ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	21

Descriptor

Computer Assisted Testing	23
Test Reliability	12
English (Second Language)	10
Language Tests	10
Scores	10
Second Language Learning	10
Correlation	9
Scoring	9
Interrater Reliability	7
Evaluators	6
Factor Analysis	6
Models	6
Regression (Statistics)	6
Test Validity	6
Comparative Analysis	5
Statistical Analysis	5
Computer Software	4
Essays	4
Language Usage	4
Oral Language	4
Prompting	4
Psychometrics	4
Reliability	4
Test Format	4
Writing Tests	4
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	23
Reports - Research	21
Tests/Questionnaires	3
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	5
Postsecondary Education	5
Secondary Education	5
High Schools	3
Early Childhood Education	2
Elementary Education	2
Kindergarten	2
Primary Education	2
Adult Education	1
Grade 6	1
Grade 7	1
Grade 8	1
High School Equivalency…	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Location

Germany	2
Australia	1
China	1
Delaware	1
France	1
Illinois	1
Japan	1
Maryland	1
Netherlands	1
North Carolina	1
Ohio	1
Oregon	1
Pennsylvania	1
South Korea	1
Switzerland	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	9
Graduate Record Examinations	2
Praxis Series	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

Examination of the Test-Retest Reliability of a Forced-Choice Personality Measure. Research Report. ETS RR-19-37

Peer reviewed
PDF on ERIC

Download full text

Seybert, Jacob; Becker, Dovid – ETS Research Report Series, 2019

Forced-choice (FC) measures are becoming increasingly common in the assessment of personality for high-stakes testing purposes in both educational and organizational settings. Despite this, there has been relatively little research into the reliability of scores obtained from these measures, particularly when administered as a computerized…

Descriptors: Test Reliability, Personality Measures, Measurement Techniques, Computer Assisted Testing

Charting the Future of Assessments. Research Report. ETS RR-24-13

Peer reviewed
PDF on ERIC

Download full text

Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…

Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction

Benchmark Keystroke Biometrics Accuracy from High-Stakes Writing Tasks. Research Report. ETS RR-21-15

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Hao, Jiangang; Deane, Paul; Zhang, Mo – ETS Research Report Series, 2021

"Biometrics" are physical or behavioral human characteristics that can be used to identify a person. It is widely known that keystroke or typing dynamics for short, fixed texts (e.g., passwords) could serve as a behavioral biometric. In this study, we investigate whether keystroke data from essay responses can lead to a reliable…

Descriptors: Accuracy, High Stakes Tests, Writing Tests, Benchmarking

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

The State Kindergarten Entry Assessment Digital Technology Landscape. PERC Report and ETS Research Report Series No. RR-20-26

Peer reviewed
PDF on ERIC

Download full text

Ackerman, Debra J. – ETS Research Report Series, 2020

Over the past 8 years, U.S. kindergarten classrooms have been impacted by policies mandating or recommending the administration of a specific kindergarten entry assessment (KEA) in the initial months of school as well as the increasing reliance on digital technology in the form of mobile apps, touchscreen devices, and online data platforms. Using…

Descriptors: Kindergarten, School Readiness, Computer Assisted Testing, Preschool Teachers

Developing a Machine-Supported Coding System for Constructed-Response Items in PISA. Research Report. ETS RR-17-47

Peer reviewed
PDF on ERIC

Download full text

Yamamoto, Kentaro; He, Qiwei; Shin, Hyo Jeong; von Davier, Mattias – ETS Research Report Series, 2017

Approximately a third of the Programme for International Student Assessment (PISA) items in the core domains (math, reading, and science) are constructed-response items and require human coding (scoring). This process is time-consuming, expensive, and prone to error as often (a) humans code inconsistently, and (b) coding reliability in…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Real World Compromises: Policy and Practice Impacts of Kindergarten Entry Assessment-Related Validity and Reliability Challenges. Research Report. ETS RR-18-13

Peer reviewed
PDF on ERIC

Download full text

Ackerman, Debra J. – ETS Research Report Series, 2018

Kindergarten entry assessments (KEAs) have increasingly been incorporated into state education policies over the past 5 years, with much of this interest stemming from Race to the Top--Early Learning Challenge (RTT-ELC) awards, Enhanced Assessment Grants, and nationwide efforts to develop common K-12 state learning standards. Drawing on…

Descriptors: Screening Tests, Kindergarten, Test Validity, Test Reliability

Development of the Mission Skills Assessment and Evidence of Its Reliability and Internal Structure. Research Report. ETS RR-16-19

Peer reviewed
PDF on ERIC

Download full text

Petway, Kevin T., II; Rikoon, Samuel H.; Brenneman, Meghan W.; Burrus, Jeremy; Roberts, Richard D. – ETS Research Report Series, 2016

The Mission Skills Assessment (MSA) is an online assessment that targets 6 noncognitive constructs: creativity, curiosity, ethics, resilience, teamwork, and time management. Each construct is measured by means of a student self-report scale, a student alternative scale (e.g., situational judgment test), and a teacher report scale. Use of the MSA…

Descriptors: Test Construction, Computer Assisted Testing, Creativity, Imagination

Examining Evidence of Reliability, Validity, and Fairness for the "SuccessNavigator"™ Assessment. Research Report. ETS RR-13-12

Peer reviewed
PDF on ERIC

Download full text

Markle, Ross; Olivera-Aguilar, Margarita; Jackson, Teresa; Noeth, Richard; Robbins, Steven – ETS Research Report Series, 2013

The "SuccessNavigator"™ assessment is an online, 30 minute self-assessment of psychosocial and study skills designed for students entering postsecondary education. In addition to providing feedback in areas such as classroom and study behaviors, commitment to educational goals, management of academic stress, and connection to social…

Descriptors: Self Evaluation (Individuals), Computer Assisted Testing, Test Reliability, Test Validity

Automated Trait Scores for "GRE"® Writing Tasks. Research Report. ETS RR-15-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…

Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar

A Comparison of Achievement Gaps and Test-Taker Characteristics on Computer-Delivered and Paper-Delivered "Praxis I"® Tests. Research Report. ETS RR-14-35

Peer reviewed
PDF on ERIC

Download full text

Steinberg, Jonathan; Brenneman, Meghan; Castellano, Karen; Lin, Peng; Miller, Susanne – ETS Research Report Series, 2014

Test providers are increasingly moving toward exclusively administering assessments by computer. Computerized testing is becoming more desirable for test takers because of increased opportunities to test, faster turnaround of individual scores, or perhaps other factors, offering potential benefits for those who may be struggling to pass licensure…

Descriptors: Comparative Analysis, Achievement Gap, Academic Achievement, Test Format

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Peer reviewed
PDF on ERIC

Download full text

Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation

Developing Analytic Rating Guides for "TOEFL iBT"® Integrated Speaking Tasks. "TOEFL iBT"® Research Report, TOEFL iBT-20. ETS Research Report. RR-13-13

Peer reviewed
PDF on ERIC

Download full text

Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013

Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…

Descriptors: Oral Language, Language Proficiency, Scaling, Scores

Previous Page | Next Page »

Pages: 1 | 2

Attali, Yigal	3
Ackerman, Debra J.	2
Casabianca, Jodi M.	2
Sinharay, Sandip	2
Zhang, Mo	2
Amit Sevak	1
Becker, Dovid	1
Bejar, Isaac I.	1
Brenneman, Meghan	1
Brenneman, Meghan W.	1
Breyer, F. Jay	1
Burrus, Jeremy	1
Castellano, Karen	1
Choi, Ikkyu	1
Daniel Fishtein	1
Deane, Paul	1
Gentile, Claudia	1
Hao, Jiangang	1
Hawthorn, John	1
He, Qiwei	1
Hemat, Ramin	1
Ikkyu Choi	1
Jackson, Teresa	1
Jamieson, Joan	1
Jesse Sparks	1
More ▼