ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	34

Descriptor

Difficulty Level	40
Test Items	40
Item Response Theory	17
Statistical Analysis	15
Item Analysis	11
College Entrance Examinations	10
Scores	10
Test Construction	10
Psychometrics	9
Comparative Analysis	8
Mathematics Tests	8
Correlation	7
Equated Scores	7
Graduate Study	7
Language Tests	7
Simulation	7
Multiple Choice Tests	6
English (Second Language)	5
Models	5
Reading Tests	5
Regression (Statistics)	5
Second Language Learning	5
Test Bias	5
Test Wiseness	5
Accuracy	4
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	40
Reports - Research	39
Tests/Questionnaires	4
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Higher Education	12
Postsecondary Education	11
Elementary Education	5
Secondary Education	5
Grade 8	4
Junior High Schools	3
Middle Schools	3
Grade 7	2
Grade 4	1
Grade 5	1
Grade 6	1
Intermediate Grades	1
More ▼

Audience

Location

Alabama	1
Arizona	1
Arkansas	1
California	1
Canada	1
Connecticut	1
France	1
Georgia	1
Greece	1
Idaho	1
Illinois	1
Indiana	1
Iowa	1
Kentucky	1
Minnesota	1
Nevada	1
New Jersey	1
New York	1
Pennsylvania	1
South Korea	1
Tennessee	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	7
SAT (College Admission Test)	3
Test of English as a Foreign…	3
National Assessment of…	1
Praxis Series	1
Stanford Achievement Tests	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Influence of Selected-Response Format Variants on Test Characteristics and Test-Taking Effort: An Empirical Study. Research Report. ETS RR-22-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Rios, Joseph A.; Ling, Guangming; Wang, Zhen; Gu, Lin; Yang, Zhitong; Liu, Lydia O. – ETS Research Report Series, 2022

Different variants of the selected-response (SR) item type have been developed for various reasons (i.e., simulating realistic situations, examining critical-thinking and/or problem-solving skills). Generally, the variants of SR item format are more complex than the traditional multiple-choice (MC) items, which may be more challenging to test…

Descriptors: Test Format, Test Wiseness, Test Items, Item Response Theory

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

Comparing Test-Taking Behaviors of English Language Learners (ELLs) to Non-ELL Students: Use of Response Time in Measurement Comparability Research. Research Report. ETS RR-21-25

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Ercikan, Kadriye – ETS Research Report Series, 2021

In this report, we demonstrate use of differential response time (DRT) methodology, an extension of differential item functioning methodology, for examining differences in how students from different backgrounds engage with assessment tasks. We analyze response time data from a digitally delivered mathematics assessment to examine timing…

Descriptors: Test Wiseness, English Language Learners, Reaction Time, Mathematics Tests

The Impact of Using Synthetically Generated Listening Stimuli on Test-Taker Performance: A Case Study with Multiple-Choice, Single-Selection Items. TOEFL® Research Reports. RR-98. ETS?RR-22-05

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022

Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…

Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Unidimensional Vertical Scaling in Multidimensional Space. Research Report. ETS RR-17-29

Peer reviewed
PDF on ERIC

Download full text

Carlson, James E. – ETS Research Report Series, 2017

In this paper, I consider a set of test items that are located in a multidimensional space, S[subscript M], but are located along a curved line in S[subscript M] and can be scaled unidimensionally. Furthermore, I am demonstrating a case in which the test items are administered across 6 levels, such as occurs in K-12 assessment across 6 grade…

Descriptors: Test Items, Item Response Theory, Difficulty Level, Scoring

The Pseudo-Equivalent Groups Approach as an Alternative to Common-Item Equating. Research Report. ETS RR-18-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Lu, Ru – ETS Research Report Series, 2018

The purpose of this study was to evaluate the effectiveness of linking test scores by using test takers' background data to form pseudo-equivalent groups (PEG) of test takers. Using 4 operational test forms that each included 100 items and were taken by more than 30,000 test takers, we created 2 half-length research forms that had either 20…

Descriptors: Test Items, Item Banks, Difficulty Level, Comparative Analysis

A Simulation-Based Method for Finding the Optimal Number of Options for Multiple-Choice Items on a Test. Research Report. ETS RR-18-22

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018

For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…

Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction

Examining How English Learners Interact with "WINSIGHT®" Summative Assessment Items: An Exploratory Study. Research Report. ETS RR-20-24

Peer reviewed
PDF on ERIC

Download full text

Lopez, Alexis A.; Tolentino, Florencia – ETS Research Report Series, 2020

In this study we investigated how English learners (ELs) interacted with "®" summative English language arts (ELA) and mathematics items, the embedded online tools, and accessibility features. We focused on how EL students navigated the assessment items; how they selected or constructed their responses; how they interacted with the…

Descriptors: English Language Learners, Student Evaluation, Language Arts, Summative Evaluation

Agreement of Teachers on Evaluating Assessments of Learning Progressions in English Language Arts and Mathematics. Research Report. ETS RR-18-11

Peer reviewed
PDF on ERIC

Download full text

van Rijn, Peter; Graf, Edith Aurora; Arieli-Attali, Meirav; Song, Yi – ETS Research Report Series, 2018

In this study, we explored the extent to which teachers agree on the ordering and separation of levels of two different learning progressions (LPs) in English language arts (ELA) and mathematics. In a panel meeting akin to a standard-setting procedure, we asked teachers to link the items and responses of summative educational assessments to LP…

Descriptors: Teacher Attitudes, Student Evaluation, Summative Evaluation, Language Arts

Assessing Elementary Teachers' Content Knowledge for Teaching Science for the "ETS®" Educator Series: Pilot Results. Research Report. ETS RR-18-20

Peer reviewed
PDF on ERIC

Download full text

Mikeska, Jamie N.; Kurzum, Christopher; Steinberg, Jonathan H.; Xu, Jun – ETS Research Report Series, 2018

The purpose of this report is to examine the performance of assessment items designed to measure elementary teachers' content knowledge for teaching (CKT) science as part of the ETS® Educator Series. The Elementary Education: CKT Science assessment is 1 component of licensure examination through the PRAXIS® assessments. The Elementary Education:…

Descriptors: Elementary School Teachers, Pedagogical Content Knowledge, Elementary School Science, Preservice Teachers

Practice-Based Measures of Elementary Science Teachers' Content Knowledge for Teaching: Initial Item Development and Validity Evidence. Research Report. ETS RR-17-43

Peer reviewed
PDF on ERIC

Download full text

Mikeska, Jamie N.; Phelps, Geoffrey; Croft, Andrew J. – ETS Research Report Series, 2017

This report describes efforts by a group of science teachers, teacher educators, researchers, and content specialists to conceptualize, develop, and pilot practice-based assessment items designed to measure elementary science teachers' content knowledge for teaching (CKT). The report documents the framework used to specify the content-specific…

Descriptors: Elementary School Teachers, Science Teachers, Knowledge Base for Teaching, Test Items

Integrating Cognitive Views into Psychometric Models for Reading Comprehension Assessment. Research Report. ETS RR-17-35

Peer reviewed
PDF on ERIC

Download full text

Rahman, Taslima; Mislevy, Robert J. – ETS Research Report Series, 2017

To demonstrate how methodologies for assessing reading comprehension can grow out of views of the construct suggested in the reading research literature, we constructed tasks and carried out psychometric analyses that were framed in accordance with 2 leading reading models. In estimating item difficulty and subsequently, examinee proficiency, an…

Descriptors: Reading Tests, Reading Comprehension, Psychometrics, Test Items

Enhancing the Equating of Item Difficulty Metrics: Estimation of Reference Distribution. Research Report. ETS RR-14-07

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Walker, Michael E. – ETS Research Report Series, 2014

Two methods are currently in use at Educational Testing Service (ETS) for equating observed item difficulty statistics. The first method involves the linear equating of item statistics in an observed sample to reference statistics on the same items. The second method, or the item response curve (IRC) method, involves the summation of conditional…

Descriptors: Difficulty Level, Test Items, Equated Scores, Causal Models

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

Previous Page | Next Page »

Pages: 1 | 2 | 3

Guo, Hongwen	5
Deane, Paul	3
Dorans, Neil J.	3
Futagi, Yoko	3
Graf, Edith Aurora	3
Holland, Paul	3
Kim, Sooyeon	3
Sinharay, Sandip	3
Attali, Yigal	2
Bejar, Isaac I.	2
Higgins, Derrick	2
Lawless, René	2
Livingston, Samuel A.	2
Lu, Ru	2
Mikeska, Jamie N.	2
Walker, Michael E.	2
Zu, Jiyun	2
Ali, Usama S.	1
Arieli-Attali, Meirav	1
Bridgeman, Brent	1
Broer, Markus	1
Carlson, James E.	1
Chen, Jing	1
Choi, Ikkyu	1
Chubbuck, Kay	1
More ▼