ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	30
Since 2016 (last 10 years)	104
Since 2006 (last 20 years)	260

Descriptor

Scores	195
Statistical Analysis	94
Equated Scores	78
Test Items	77
Correlation	72
English (Second Language)	69
Language Tests	69
Second Language Learning	67
Comparative Analysis	65
Item Response Theory	55
College Entrance Examinations	52
Computer Assisted Testing	49
Regression (Statistics)	41
Scoring	40
Foreign Countries	39
Accuracy	38
Computation	37
Error of Measurement	37
Test Validity	35
Test Construction	34
Graduate Study	31
Difficulty Level	30
Language Proficiency	28
Models	28
Sample Size	25
More ▼

Source

ETS Research Report Series

285

Publication Type

Journal Articles	285
Reports - Research	275
Tests/Questionnaires	34
Reports - Descriptive	9
Numerical/Quantitative Data	7
Information Analyses	4
Speeches/Meeting Papers	4
Collected Works - General	2

Education Level

Higher Education	86
Postsecondary Education	79
Secondary Education	37
Elementary Education	28
Middle Schools	19
Junior High Schools	18
High Schools	16
Grade 8	11
Intermediate Grades	10
Elementary Secondary Education	9
Grade 4	8
Grade 6	6
Early Childhood Education	5
Primary Education	5
Grade 3	4
Grade 5	4
Grade 7	4
Adult Education	2
Grade 1	2
Kindergarten	2
Adult Basic Education	1
Grade 10	1
Grade 12	1
Grade 9	1
High School Equivalency…	1
More ▼

Audience

Location

China	8
Japan	7
New Jersey	7
United States	6
South Korea	5
Taiwan	5
Canada	4
Asia	3
Indiana	3
Mexico	3
Arizona	2
Australia	2
Chile	2
Colombia	2
Florida	2
France	2
India	2
Latin America	2
Massachusetts	2
Thailand	2
United Kingdom	2
Vietnam	2
Armenia	1
Brazil	1
California (Los Angeles)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

ETS Research Report Series X

Showing 1 to 15 of 285 results Save | Export

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

Comparisons among Approaches to Link Tests Using Random Samples Selected under Suboptimal Conditions. Research Report. ETS RR-21-14

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2021

Equating the scores from different forms of a test requires collecting data that link the forms. Problems arise when the test forms to be linked are given to groups that are not equivalent and the forms share no common items by which to measure or adjust for this group nonequivalence. We compared three approaches to adjusting for group…

Descriptors: Equated Scores, Weighted Scores, Sampling, Multiple Choice Tests

Mapping "TOEFL® Essentials"™ Test Scores to the Canadian Language Benchmarks. "TOEFL"® Research Report. TOEFL-RR-100. ETS Research Report No. RR-22-16

Peer reviewed
PDF on ERIC

Download full text

Papageorgiou, Spiros; Davis, Larry; Ohta, Renka; Gomez, Pablo Garcia – ETS Research Report Series, 2022

In this research report, we describe a study to map the scores of the "TOEFL® Essentials"™ test to the Canadian Language Benchmarks (CLB). The TOEFL Essentials test is a four-skills assessment of foundational English language skills and communication abilities in academic and general (daily life) contexts. At the time of writing this…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

Methods for Imputing Scores When All Responses Are Missing for One or More Polytomous Items: Accuracy and Impact on Psychometric Property. Research Report. ETS RR-23-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023

Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…

Descriptors: Scores, Test Items, Accuracy, Psychometrics

Detecting the Impact of Remote Proctored At-Home Testing Using Propensity Score Weighting. Research Report. ETS RR-24-11

Peer reviewed
PDF on ERIC

Download full text

Jing Miao; Yi Cao; Michael E. Walker – ETS Research Report Series, 2024

Studies of test score comparability have been conducted at different stages in the history of testing to ensure that test results carry the same meaning regardless of test conditions. The expansion of at-home testing via remote proctoring sparked another round of interest. This study uses data from three licensure tests to assess potential mode…

Descriptors: Testing, Test Format, Computer Assisted Testing, Home Study

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Evaluating Targeted Double Scoring for the Performance Assessment for School Leaders Using Imputation and Decision Theory. Research Report. ETS RR-23-01

Peer reviewed
PDF on ERIC

Download full text

Jing Miao; Sandip Sinharay; Chris Kelbaugh; Yi Cao; Wei Wang – ETS Research Report Series, 2023

In a targeted double-scoring procedure for performance assessments that are used for licensure and certification purposes, a subset of responses receives an independent second rating if the first rating falls into a preidentified critical score range (CSR) where an additional rating would lead to considerably more reliable pass-fail decisions.…

Descriptors: Scoring, Performance Based Assessment, Licensing Examinations (Professions), Certification

Scoring Essays on an iPad Versus a Desktop Computer: An Exploratory Study. Research Report. ETS RR-22-08

Peer reviewed
PDF on ERIC

Download full text

Ling, Guangming; Williams, Jean; O'Brien, Sue; Cavalie, Carlos F. – ETS Research Report Series, 2022

Recognizing the appealing features of a tablet (e.g., an iPad), including size, mobility, touch screen display, and virtual keyboard, more educational professionals are moving away from larger laptop and desktop computers and turning to the iPad for their daily work, such as reading and writing. Following the results of a recent survey of…

Descriptors: Tablet Computers, Computers, Essays, Scoring

Building a Validity Argument for the TOEFL Junior® Tests. TOEFL® Research Report. RR-102. ETS RR-24-05

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2024

The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Observed Scores as Matching Variables in Differential Item Functioning under the One- and Two-Parameter Logistic Models: Population Results. Research Report. ETS RR-19-06

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

We derive formulas for the differential item functioning (DIF) measures that two routinely used DIF statistics are designed to estimate. The DIF measures that match on observed scores are compared to DIF measures based on an unobserved ability (theta or true score) for items that are described by either the one-parameter logistic (1PL) or…

Descriptors: Scores, Test Bias, Statistical Analysis, Item Response Theory

Alternative Methods for Item Parameter Estimation: From CTT to IRT. Research Report. ETS RR-22-12

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Lu, Ru; Johnson, Matthew S.; McCaffrey, Dan F. – ETS Research Report Series, 2022

It is desirable for an educational assessment to be constructed of items that can differentiate different performance levels of test takers, and thus it is important to estimate accurately the item discrimination parameters in either classical test theory or item response theory. It is particularly challenging to do so when the sample sizes are…

Descriptors: Test Items, Item Response Theory, Item Analysis, Educational Assessment

Studies of Possible Effects of GRE® ScoreSelect® on Subgroup Differences in GRE® General Test Scores. Research Report. ETS RR-22-13

Peer reviewed
PDF on ERIC

Download full text

Klieger, David M.; Kotloff, Lauren J.; Belur, Vinetha; Schramm-Possinger, Megan E.; Holtzman, Steven L.; Bunde, Hezekiah – ETS Research Report Series, 2022

Intended consequences of giving applicants the option to select which test scores to report include potentially reducing measurement error and inequity in applicants' prior test familiarity. Our first study determined whether score choice options resulted in unintended consequences for lower performing subgroups by detrimentally increasing score…

Descriptors: College Entrance Examinations, Graduate Study, Scores, High Stakes Tests

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Effect of Immediate Elaborated Feedback on Rater Accuracy. Research Report. ETS RR-20-09

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2020

Principles of skill acquisition dictate that raters should be provided with frequent feedback about their ratings. However, in current operational practice, raters rarely receive immediate feedback about their scores owing to the prohibitive effort required to generate such feedback. An approach for generating and administering feedback responses…

Descriptors: Feedback (Response), Evaluators, Accuracy, Scores

Interpretation and Use of a Workplace English Language Proficiency Test Score Report: Perspectives of TOEIC[R] Test Takers and Score Users in Taiwan. RR-23-10

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2023

Research in validity suggests that stakeholders' interpretation and use of test results should be an aspect of validity. Claims about the meaningfulness of test score interpretations and consequences of test use should be backed by evidence that stakeholders understand the definition of the construct assessed and the score report information. The…

Descriptors: Foreign Countries, Language Proficiency, English (Second Language), Language Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

Test of English as a Foreign…	49
Graduate Record Examinations	29
SAT (College Admission Test)	19
Test of English for…	15
Praxis Series	13
ACT Assessment	6
National Assessment of…	5
College Level Examination…	3
Early Childhood Longitudinal…	3
Graduate Management Admission…	3
Advanced Placement…	2
Major Field Achievement Test…	2
National Merit Scholarship…	2
Preliminary Scholastic…	2
Woodcock Johnson Tests of…	2
Collegiate Assessment of…	1
Gates MacGinitie Reading Tests	1
General Educational…	1
International English…	1
Law School Admission Test	1
Massachusetts Comprehensive…	1
National Assessment of Adult…	1
Program for International…	1
More ▼

Kim, Sooyeon	20
Haberman, Shelby J.	18
Guo, Hongwen	16
von Davier, Alina A.	16
Moses, Tim	13
Dorans, Neil J.	12
Liu, Jinghua	11
Livingston, Samuel A.	11
Liu, Ou Lydia	9
Puhan, Gautam	9
Qian, Jiahe	9
Sinharay, Sandip	9
Attali, Yigal	8
Bridgeman, Brent	7
Holland, Paul	7
Lee, Yi-Hsuan	7
Lu, Ru	7
Walker, Michael E.	7
Haberman, Shelby	6
Klieger, David M.	6
Ling, Guangming	6
Steinberg, Jonathan	6
Tannenbaum, Richard J.	6
Zhang, Mo	6
Belur, Vinetha	5
More ▼