ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	11

Source

Educational Testing Service

Publication Type

Numerical/Quantitative Data	14
Reports - Research	9
Reports - Evaluative	4
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	4
Secondary Education	3
Elementary Education	2
High Schools	2
Junior High Schools	2
Middle Schools	2
Grade 5	1
Grade 7	1
Grade 8	1
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Location

Australia	1
California	1
Chile	1
Finland	1
Sweden	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
California Achievement Tests	1
Graduate Record Examinations	1
Marlowe Crowne Social…	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Exploring Math Education Relations by Analyzing Large Data Sets II. Research Memorandum. ETS RM-21-02

Download full text

Weeks, Jonathan; Baron, Patricia – Educational Testing Service, 2021

The current project, Exploring Math Education Relations by Analyzing Large Data Sets (EMERALDS) II, is an attempt to identify specific Common Core State Standards procedural, conceptual, and problem-solving competencies in earlier grades that best predict success in algebraic areas in later grades. The data for this study include two cohorts of…

Descriptors: Mathematics Education, Common Core State Standards, Problem Solving, Mathematics Tests

Fit of Item Response Theory Models: A Survey of Data from Several Operational Tests. Research Report. ETS RR-11-29

Download full text

Sinharay, Sandip; Haberman, Shelby J.; Jia, Helena – Educational Testing Service, 2011

Standard 3.9 of the "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council for Measurement in Education, 1999) demands evidence of model fit when an item response theory (IRT) model is used to make inferences from a data set. We applied two recently…

Descriptors: Item Response Theory, Goodness of Fit, Statistical Analysis, Language Tests

Sources of Score Scale Inconsistency. Research Report. ETS RR-11-35

Download full text

Fife, James H.; Graf, Edith Aurora; Ohls, Sarah – Educational Testing Service, 2011

Six tasks, selected from assessments administered in 2007 as part of the Cognitively-Based Assessments of, for, and as Learning (CBAL) project, were revised in an effort to remove difficulties with the tasks that were unrelated to the construct being assessed. Because the revised tasks were piloted on a different population from the original…

Descriptors: Mathematics Tests, Responses, Test Construction, Construct Validity

Measurement of New Attributes for Chile's Admissions System to Higher Education. Research Report. ETS RR-11-18

Download full text

Santelices, Maria Veronica; Ugarte, Juan Jose; Flotts, Paulina; Radovic, Darinka; Kyllonen, Patrick – Educational Testing Service, 2011

This paper presents the development and initial validation of new measures of critical thinking and noncognitive attributes that were designed to supplement existing standardized tests used in the admissions system for higher education in Chile. The importance of various facets of this process, including the establishment of technical rigor and…

Descriptors: Foreign Countries, College Entrance Examinations, Test Construction, Test Validity

The Value of the Studied Item in the Matching Criterion in Differential Item Functioning (DIF) Analysis. Research Report. ETS RR-10-13

Download full text

Tan, Xuan; Xiang, Bihua; Dorans, Neil J.; Qu, Yanxuan – Educational Testing Service, 2010

The nature of the matching criterion (usually the total score) in the study of differential item functioning (DIF) has been shown to impact the accuracy of different DIF detection procedures. One of the topics related to the nature of the matching criterion is whether the studied item should be included. Although many studies exist that suggest…

Descriptors: Test Bias, Test Items, Item Response Theory

Equating of Subscores and Weighted Averages under the NEAT Design. Research Report. ETS RR-11-01

Download full text

Sinharay, Sandip; Haberman, Shelby – Educational Testing Service, 2011

Recently, the literature has seen increasing interest in subscores for their potential diagnostic values; for example, one study suggested the report of weighted averages of a subscore and the total score, whereas others showed, for various operational and simulated data sets, that weighted averages, as compared to subscores, lead to more accurate…

Descriptors: Equated Scores, Weighted Scores, Tests, Statistical Analysis

Automated Scoring within a Developmental, Cognitive Model of Writing Proficiency. Research Report. ETS RR-11-16

Download full text

Deane, Paul; Quinlan, Thomas; Kostin, Irene – Educational Testing Service, 2011

ETS has recently instituted the Cognitively Based Assessments of, for, and as Learning (CBAL) research initiative to create a new generation of assessment designed from the ground up to enhance learning. It is intended as a general approach, covering multiple subject areas including reading, writing, and math. This paper is concerned with the…

Descriptors: Automation, Scoring, Educational Assessment, Writing Tests

Examining the Factor Structure of a State Standards-Based Science Assessment for Students with Learning Disabilities. Research Report. ETS RR-11-38

Download full text

Steinberg, Jonathan; Cline, Frederick; Sawaki, Yasuyo – Educational Testing Service, 2011

This study examined the scores on a state standards-based Grade 5 Science assessment obtained by a group of students without learning disabilities who took the standard form of the test and by three groups of students with learning disabilities: one taking the standard form of the test without accommodations or modifications, a second taking the…

Descriptors: Learning Disabilities, State Standards, Educational Improvement, Science Tests

The Use of Two Anchors in Nonequivalent Groups with Anchor Test (NEAT) Equating. Research Report. ETS RR-10-23

Download full text

Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Educational Testing Service, 2010

In the equating literature, a recurring concern is that equating functions that utilize a single anchor to account for examinee groups' nonequivalence are biased when the groups are extremely different and/or when the anchor only weakly measures what the tests measure. Several proposals have been made to address this equating bias by incorporating…

Descriptors: Equated Scores, Data Collection, Statistical Analysis, Differences

Fault Lines in Our Democracy: Civic Knowledge, Voting Behavior, and Civic Engagement in the United States

Download full text

Coley, Richard J.; Sum, Andrew – Educational Testing Service, 2012

As the 21st century unfolds, the United States faces historic challenges, including a struggling economy, an aging infrastructure and global terrorism. Solutions will have to come from educated, skilled citizens who understand and believe in our democratic system and are civically engaged. This incisive new report examines these fault lines and…

Descriptors: Citizen Participation, Democracy, Citizenship Education, Civics

Testing and Time Limits. ETS R&D Connections

Download full text

Bridgeman, Brent; McBride, Amanda; Monaghan, William – Educational Testing Service, 2004

Imposing time limits on tests can serve a range of important functions. Time limits are essential, for example, if speed of performance is an integral component of what is being measured, as would be the case when testing such skills as how quickly someone can type. Limiting testing time also helps contain expenses associated with test…

Descriptors: Computer Assisted Testing, Timed Tests, Test Results, Aptitude Tests

Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

Download full text

Educational Testing Service, 2008

The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…

Descriptors: English (Second Language), Language Proficiency, Language Tests, Computer Assisted Testing

Beyond Essay Length: Evaluating e-rater[R]'s Performance on TOEFL[R] Essays. Research Reports. Report 73. RR-04-04

Download full text

Chodorow, Martin; Burstein, Jill – Educational Testing Service, 2004

This study examines the relation between essay length and holistic scores assigned to Test of English as a Foreign Language[TM] (TOEFL[R]) essays by e-rater[R], the automated essay scoring system developed by ETS. Results show that an early version of the system, e-rater99, accounted for little variance in human reader scores beyond that which…

Descriptors: Essays, Test Scoring Machines, English (Second Language), Student Evaluation

An Analysis of TOEFL CBT Writing Prompt Difficulty and Comparability for Different Gender Groups. Research Reports. Report 76. RR-04-05

Download full text

Breland, Hunter; Lee, Yong-Won; Najarian, Michelle; Muraki, Eiji – Educational Testing Service, 2004

This investigation of the comparability of writing assessment prompts was conducted in two phases. In an exploratory Phase I, 47 writing prompts administered in the computer-based Test of English as a Foreign Language[TM] (TOEFL[R] CBT) from July through December 1998 were examined. Logistic regression procedures were used to estimate prompt…

Descriptors: Writing Evaluation, Quality Control, Gender Differences, Writing Tests

Scores	4
College Entrance Examinations	3
Computer Assisted Testing	3
English (Second Language)	3
Factor Analysis	3
Item Response Theory	3
Mathematics Tests	3
Statistical Analysis	3
Writing Tests	3
Correlation	2
Elementary School Students	2
Equated Scores	2
Foreign Countries	2
Gender Differences	2
Item Analysis	2
Language Tests	2
Science Tests	2
Scoring	2
Test Construction	2
Test Items	2
Test Validity	2
Writing Evaluation	2
Adolescent Attitudes	1
Adolescents	1
Age Differences	1
More ▼

Sinharay, Sandip	2
Baron, Patricia	1
Breland, Hunter	1
Bridgeman, Brent	1
Burstein, Jill	1
Chodorow, Martin	1
Cline, Frederick	1
Coley, Richard J.	1
Deane, Paul	1
Deng, Weiling	1
Dorans, Neil J.	1
Fife, James H.	1
Flotts, Paulina	1
Graf, Edith Aurora	1
Haberman, Shelby	1
Haberman, Shelby J.	1
Jia, Helena	1
Kostin, Irene	1
Kyllonen, Patrick	1
Lee, Yong-Won	1
McBride, Amanda	1
Monaghan, William	1
Moses, Tim	1
Muraki, Eiji	1
Najarian, Michelle	1
More ▼