Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 10 |
Descriptor
Difficulty Level | 10 |
Simulation | 10 |
Test Items | 7 |
Correlation | 4 |
Equated Scores | 3 |
Scores | 3 |
Statistical Analysis | 3 |
Comparative Analysis | 2 |
Error of Measurement | 2 |
Item Analysis | 2 |
Item Response Theory | 2 |
More ▼ |
Source
ETS Research Report Series | 10 |
Author
Guo, Hongwen | 2 |
Holland, Paul | 2 |
Sinharay, Sandip | 2 |
Andrews-Todd, Jessica | 1 |
Attali, Yigal | 1 |
Chamberlain, John | 1 |
Cohen, Andrew D. | 1 |
Dorans, Neil J. | 1 |
Forsyth, Carolyn | 1 |
Horwitz, Paul | 1 |
Isham, Steven | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Reports - Research | 9 |
Numerical/Quantitative Data | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Two Year Colleges | 1 |
Audience
Location
Minnesota | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
John M. Norris; Shoko Sasayama; Michelle Kim – ETS Research Report Series, 2023
Accomplishing a communication task in the real world requires the ability not only to do the task per se but also to manage aspects of the context in which it occurs. For this reason, simulations of target language use contexts have been incorporated into the design of communicative language tests as a way of enhancing the authenticity of…
Descriptors: Electronic Mail, Writing (Composition), Task Analysis, Student Evaluation
Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021
Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…
Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018
For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…
Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction
Steinberg, Jonathan; Andrews-Todd, Jessica; Forsyth, Carolyn; Chamberlain, John; Horwitz, Paul; Koon, Al; Rupp, Andre; McCulla, Laura – ETS Research Report Series, 2020
This study discusses the development of a basic electronics knowledge (BEK) assessment as a pretest activity for undergraduate students in engineering and related fields. The 28 BEK items represent 12 key concepts, including properties of serial circuits, knowledge of electrical laws (e.g., Kirchhoff 's and Ohm's laws), and properties of digital…
Descriptors: Knowledge Level, Skill Development, Psychometrics, Student Evaluation
Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014
Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…
Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations
Zwick, Rebecca; Ye, Lei; Isham, Steven – ETS Research Report Series, 2013
Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. Although it is often assumed that refinement of the matching criterion always provides more accurate DIF results, the actual situation proves to be more complex. To explore the effectiveness of refinement, we…
Descriptors: Test Bias, Statistical Analysis, Simulation, Educational Testing
Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014
The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…
Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that an anchor test used in equating should be a miniature version (or "minitest") of the tests to be equated; that is, the anchor test should be proportionally representative of the two tests in content and statistical characteristics. This paper examines the scientific foundation of this belief, especially…
Descriptors: Test Items, Equated Scores, Correlation, Tests
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…
Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level
Strategies in Responding to the New TOEFL Reading Tasks. TOEFL Monograph Series. MS-33. ETS RR-06-06
Cohen, Andrew D.; Upton, Thomas A. – ETS Research Report Series, 2006
This study describes the reading and test-taking strategies that test takers used in the Reading section of the LanguEdge courseware (ETS, 2002a). These materials were developed to familiarize prospective respondents with the new TOEFL®. The investigation focused on strategies used to respond to more traditional single selection multiple-choice…
Descriptors: Reading Tests, Test Items, Courseware, Item Analysis