Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 61 |
Descriptor
Equated Scores | 117 |
Scores | 117 |
Test Items | 30 |
Comparative Analysis | 29 |
Item Response Theory | 28 |
Statistical Analysis | 27 |
Scaling | 21 |
Test Construction | 19 |
College Entrance Examinations | 17 |
Item Analysis | 17 |
Standardized Tests | 17 |
More ▼ |
Source
Author
Dorans, Neil J. | 11 |
Livingston, Samuel A. | 5 |
Moses, Tim | 4 |
von Davier, Alina A. | 4 |
Gallas, Edwin J. | 3 |
Haberman, Shelby J. | 3 |
Kim, Sooyeon | 3 |
Bramley, Tom | 2 |
Eignor, Daniel R. | 2 |
Gilmer, Jerry S. | 2 |
Holland, Paul W. | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 14 |
Postsecondary Education | 12 |
Elementary Secondary Education | 5 |
Elementary Education | 4 |
Secondary Education | 4 |
Middle Schools | 3 |
Grade 8 | 2 |
High Schools | 2 |
Early Childhood Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
More ▼ |
Audience
Researchers | 6 |
Practitioners | 2 |
Students | 1 |
Location
Canada | 2 |
Delaware | 1 |
Florida | 1 |
Luxembourg | 1 |
Malawi | 1 |
New Jersey | 1 |
New York (New York) | 1 |
Oregon | 1 |
Saudi Arabia | 1 |
Texas | 1 |
United Kingdom | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
No Child Left Behind Act 2001 | 3 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022
The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…
Descriptors: Equated Scores, Test Items, Scores, Probability
Wu, Tong – ProQuest LLC, 2023
This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…
Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores
Lee, Yi-Hsuan; Haberman, Shelby J.; Dorans, Neil J. – Journal of Educational Measurement, 2019
In many educational tests, both multiple-choice (MC) and constructed-response (CR) sections are used to measure different constructs. In many common cases, security concerns lead to the use of form-specific CR items that cannot be used for equating test scores, along with MC sections that can be linked to previous test forms via common items. In…
Descriptors: Scores, Multiple Choice Tests, Test Items, Responses
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability
Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020
In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…
Descriptors: Test Format, Reading Tests, Language Tests, English
Wiberg, Marie; González, Jorge – Journal of Educational Measurement, 2016
Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…
Descriptors: Statistical Analysis, Equated Scores, Scores, Models
Camara, Wayne J.; Allen, Jeff – ACT, Inc., 2017
Students must choose when to take the ACT for the first time and if and when to retest. States and districts that administer the ACT test to all students must also choose when to administer the test. A key consideration in making these decisions is the impact on scores. Because the ACT is a curriculum-based test of academic achievement, students…
Descriptors: Scores, Time Perspective, Scheduling, Testing
Bramley, Tom – Cambridge Assessment, 2018
The aim of the research reported here was to get some idea of the accuracy of grade boundaries (cut-scores) obtained by applying the 'similar items method' described in Bramley & Wilson (2016). In this method experts identify items on the current version of a test that are sufficiently similar to items on previous versions for them to be…
Descriptors: Accuracy, Cutting Scores, Test Items, Item Analysis
Livingston, Samuel A.; Chen, Haiwen H. – ETS Research Report Series, 2015
Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…
Descriptors: Scores, Statistical Distributions, Research Reports, Equated Scores
Barr, Christopher D.; Reutebuch, Colleen K.; Carlson, Coleen D.; Vaughn, Sharon; Francis, David J. – Journal of Research on Educational Effectiveness, 2019
Beginning in 2002, researchers developed, implemented, and evaluated the efficacy of an English reading intervention for first-grade English learners using multiple randomized control trials (RCTs). As a result of this efficacy work, researchers successfully competed for an IES Goal 4 effectiveness study using the same intervention. Unlike the…
Descriptors: Intervention, English Language Learners, Grade 1, Elementary School Students
Winters, Marcus A. – Manhattan Institute for Policy Research, 2017
Critics of charter schools in New York City, America's largest school district, often allege that charters score better on standardized tests, on average, than traditional public schools because charters "cream-skim" (i.e., attract) the brightest, most motivated, students. Yet this accusation neglects the fact that not all traditional…
Descriptors: Charter Schools, Public Schools, School Effectiveness, Success
Northwest Evaluation Association, 2016
Northwest Evaluation Association™ (NWEA™) is committed to providing partners with useful tools to help make inferences from Measures of Academic Progress® (MAP®) interim assessment scores. One important tool is the concordance table between MAP and state summative assessments. Concordance tables have been used for decades to relate scores on…
Descriptors: Tables (Data), Benchmarking, Scoring Formulas, Scores
Dorans, Neil J. – ETS Research Report Series, 2014
Simulations are widely used. Simulations produce numbers that are deductive demonstrations of what a model says will happen.They produce numerical results that are consistent with the premises of the model used to generate the numbers. These simulated numerical results are not empirical data that address aspects of the world that lies outside the…
Descriptors: Simulation, Equated Scores, Scores, Scientific Methodology
Brennan, Robert L. – Measurement: Interdisciplinary Research and Perspectives, 2015
Koretz, in his article published in this issue, provides compelling arguments that the high stakes currently associated with accountability testing lead to behavioral changes in students, teachers, and other stakeholders that often have negative consequences, such as inflated scores. Koretz goes on to argue that these negative consequences require…
Descriptors: Accountability, High Stakes Tests, Behavior Change, Student Behavior
Ho, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2013
In his thoughtful focus article, Haertel (this issue) pushes testing experts to broaden the scope of their validation efforts and to invite scholars from other disciplines to join them. He credits existing validation frameworks for helping the measurement community to identify incomplete or nonexistent validity arguments. However, he notes his…
Descriptors: Educational Testing, Scores, Test Use, Test Validity