ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	11

Source

Educational Testing Service

Publication Type

Reports - Research	13
Numerical/Quantitative Data	3
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	6
Higher Education	4
Postsecondary Education	3
Grade 8	2
Elementary Education	1
Grade 7	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Chile

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
Test of English as a Foreign…	2
Marlowe Crowne Social…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Automated Scoring within a Developmental, Cognitive Model of Writing Proficiency. Research Report. ETS RR-11-16

Download full text

Deane, Paul; Quinlan, Thomas; Kostin, Irene – Educational Testing Service, 2011

ETS has recently instituted the Cognitively Based Assessments of, for, and as Learning (CBAL) research initiative to create a new generation of assessment designed from the ground up to enhance learning. It is intended as a general approach, covering multiple subject areas including reading, writing, and math. This paper is concerned with the…

Descriptors: Automation, Scoring, Educational Assessment, Writing Tests

Automated Subscores for TOEFL iBT[R] Independent Essays. Research Report. ETS RR-11-39

Download full text

Attali, Yigal – Educational Testing Service, 2011

The e-rater[R] automated essay scoring system is used operationally in the scoring of TOEFL iBT[R] independent essays. Previous research has found support for a 3-factor structure of the e-rater features. This 3-factor structure has an attractive hierarchical linguistic interpretation with a word choice factor, a grammatical convention within a…

Descriptors: Essay Tests, Language Tests, Test Scoring Machines, Automation

A Differential Word Use Measure for Content Analysis in Automated Essay Scoring. Research Report. ETS RR-11-36

Download full text

Attali, Yigal – Educational Testing Service, 2011

This paper proposes an alternative content measure for essay scoring, based on the "difference" in the relative frequency of a word in high-scored versus low-scored essays. The "differential word use" (DWU) measure is the average of these differences across all words in the essay. A positive value indicates the essay is using…

Descriptors: Scoring, Essay Tests, Word Frequency, Content Analysis

Aligning Scales of Certification Tests. Research Report. ETS RR-10-07

Download full text

Dorans, Neil J.; Liang, Longjuan; Puhan, Gautam – Educational Testing Service, 2010

Scores are the most visible and widely used products of a testing program. The choice of score scale has implications for test specifications, equating, and test reliability and validity, as well as for test interpretation. At the same time, the score scale should be viewed as infrastructure likely to require repair at some point. In this report…

Descriptors: Testing Programs, Standard Setting (Scoring), Test Interpretation, Certification

An Examination of the Link between Rater Calibration Performance and Subsequent Scoring Accuracy in Graduate Record Examinations[R] (GRE[R]) Writing. Research Report. ETS RR-11-03

Download full text

Ricker-Pedley, Kathryn L. – Educational Testing Service, 2011

A pseudo-experimental study was conducted to examine the link between rater accuracy calibration performances and subsequent accuracy during operational scoring. The study asked 45 raters to score a 75-response calibration set and then a 100-response (operational) set of responses from a retired Graduate Record Examinations[R] (GRE[R]) writing…

Descriptors: Scoring, Accuracy, College Entrance Examinations, Writing Tests

Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

Download full text

Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam – Educational Testing Service, 2010

This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

Descriptors: Equated Scores, Scoring, Responses, Test Items

Studies of a Latent Class Signal Detection Model for Constructed Response Scoring II: Incomplete and Hierarchical Designs. Research Report. ETS RR-10-08

Download full text

DeCarlo, Lawrence T. – Educational Testing Service, 2010

A basic consideration in large-scale assessments that use constructed response (CR) items, such as essays, is how to allocate the essays to the raters that score them. Designs that are used in practice are incomplete, in that each essay is scored by only a subset of the raters, and also unbalanced, in that the number of essays scored by each rater…

Descriptors: Test Items, Responses, Essay Tests, Scoring

CBAL: Results from Piloting Innovative K-12 Assessments. Research Report. ETS RR-11-23

Download full text

Bennett, Randy Elliot – Educational Testing Service, 2011

CBAL, an acronym for Cognitively Based Assessment of, for, and as Learning, is a research initiative intended to create a model for an innovative K-12 assessment system that provides summative information for policy makers, as well as formative information for classroom instructional purposes. This paper summarizes empirical results from 16 CBAL…

Descriptors: Educational Assessment, Elementary Secondary Education, Summative Evaluation, Formative Evaluation

Measurement of New Attributes for Chile's Admissions System to Higher Education. Research Report. ETS RR-11-18

Download full text

Santelices, Maria Veronica; Ugarte, Juan Jose; Flotts, Paulina; Radovic, Darinka; Kyllonen, Patrick – Educational Testing Service, 2011

This paper presents the development and initial validation of new measures of critical thinking and noncognitive attributes that were designed to supplement existing standardized tests used in the admissions system for higher education in Chile. The importance of various facets of this process, including the establishment of technical rigor and…

Descriptors: Foreign Countries, College Entrance Examinations, Test Construction, Test Validity

Writing Assessment and Cognition. Research Report. ETS RR-11-14

Download full text

Deane, Paul – Educational Testing Service, 2011

This paper presents a socio-cognitive framework for connecting writing pedagogy and writing assessment with modern social and cognitive theories of writing. It focuses on providing a general framework that highlights the connections between writing competency and other literacy skills; identifies key connections between literacy instruction,…

Descriptors: Writing (Composition), Writing Evaluation, Writing Tests, Cognitive Ability

Score Equity Assessment:Development of a Prototype Analysis Using SAT[R] Mathematics Test Data Across Several Administrations. Research Report. ETS RR-09-08

Download full text

Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009

The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…

Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics

Beyond Essay Length: Evaluating e-rater[R]'s Performance on TOEFL[R] Essays. Research Reports. Report 73. RR-04-04

Download full text

Chodorow, Martin; Burstein, Jill – Educational Testing Service, 2004

This study examines the relation between essay length and holistic scores assigned to Test of English as a Foreign Language[TM] (TOEFL[R]) essays by e-rater[R], the automated essay scoring system developed by ETS. Results show that an early version of the system, e-rater99, accounted for little variance in human reader scores beyond that which…

Descriptors: Essays, Test Scoring Machines, English (Second Language), Student Evaluation

Tolerable Variation in Item Parameter Estimates for Linear and Adaptive Computer-Based Testing. Research Report No. 04-28

Download full text

Rizavi, Saba; Way, Walter D.; Davey, Tim; Herbert, Erin – Educational Testing Service, 2004

Item parameter estimates vary for a variety of reasons, including estimation error, characteristics of the examinee samples, and context effects (e.g., item location effects, section location effects, etc.). Although we expect variation based on theory, there is reason to believe that observed variation in item parameter estimates exceeds what…

Descriptors: Adaptive Testing, Test Items, Computation, Context Effect

Scoring	12
College Entrance Examinations	4
Factor Analysis	4
Writing Tests	4
Automation	3
Educational Assessment	3
Essay Tests	3
Language Tests	3
Mathematics Tests	3
Psychometrics	3
Test Items	3
Test Scoring Machines	3
Computer Assisted Testing	2
Correlation	2
Critical Thinking	2
Error of Measurement	2
Evaluation Methods	2
Evaluation Research	2
Models	2
Responses	2
Scores	2
Scoring Rubrics	2
Standardized Tests	2
Test Bias	2
Test Construction	2
More ▼

Attali, Yigal	2
Deane, Paul	2
Dorans, Neil J.	2
Puhan, Gautam	2
Bennett, Randy Elliot	1
Burstein, Jill	1
Chodorow, Martin	1
Davey, Tim	1
DeCarlo, Lawrence T.	1
Flotts, Paulina	1
Herbert, Erin	1
Kostin, Irene	1
Kyllonen, Patrick	1
Liang, Longjuan	1
Liu, Jinghua	1
Quinlan, Thomas	1
Radovic, Darinka	1
Ricker, Kathryn L.	1
Ricker-Pedley, Kathryn L.	1
Rizavi, Saba	1
Santelices, Maria Veronica	1
Tan, Xuan	1
Ugarte, Juan Jose	1
Way, Walter D.	1
More ▼