Publication Date
In 2025 | 4 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 58 |
Since 2016 (last 10 years) | 147 |
Since 2006 (last 20 years) | 496 |
Descriptor
Source
Author
Bianchini, John C. | 35 |
von Davier, Alina A. | 34 |
Dorans, Neil J. | 33 |
Kolen, Michael J. | 31 |
Loret, Peter G. | 31 |
Kim, Sooyeon | 26 |
Moses, Tim | 24 |
Livingston, Samuel A. | 22 |
Holland, Paul W. | 20 |
Puhan, Gautam | 20 |
Liu, Jinghua | 19 |
More ▼ |
Publication Type
Education Level
Location
Canada | 9 |
Australia | 8 |
Florida | 8 |
United Kingdom (England) | 8 |
Netherlands | 7 |
New York | 7 |
United States | 7 |
Israel | 6 |
Turkey | 6 |
United Kingdom | 6 |
California | 5 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 12 |
No Child Left Behind Act 2001 | 5 |
Education Consolidation… | 3 |
Hawkins Stafford Act 1988 | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |

Baker, Frank B. – Applied Psychological Measurement, 1992
The procedure of M.L. Stocking and F.M. Lord (1983) for computing equating coefficients for tests having dichotomously scored items is extended to the case of graded response items. A system of equations for obtaining the equating coefficients under the graded response model is derived. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Item Response Theory, Mathematical Models

Bolt, Daniel M. – Applied Measurement in Education, 1999
Examined whether the item response theory (IRT) true-score equating method is more adversely affected by the presence of multidimensionality than two conventional equating methods, linear and equipercentile equating. Results of two simulation studies suggest that the IRT method performs as well as the conventional methods when the correlation…
Descriptors: Correlation, Equated Scores, Item Response Theory, Simulation

Tsai, Tsung-Hsun; Hanson, Bradley A.; Kolen, Michael J.; Forsyth, Robert A. – Applied Measurement in Education, 2001
Compared bootstrap standard errors of five item response theory (IRT) equating methods for the common-item nonequivalent groups design using test results for 1,493 and 1,793 examinees taking a professional certification test. Results suggest that standard errors of equating less than 0.1 standard deviation units could be obtained with any of the…
Descriptors: Equated Scores, Error of Measurement, Item Response Theory, Licensing Examinations (Professions)
Kim, Dong-In; Brennan, Robert; Kolen, Michael – Journal of Educational Measurement, 2005
Four equating methods (3PL true score equating, 3PL observed score equating, beta 4 true score equating, and beta 4 observed score equating) were compared using four equating criteria: first-order equity (FOE), second-order equity (SOE), conditional-mean-squared-error (CMSE) difference, and the equi-percentile equating property. True score…
Descriptors: True Scores, Psychometrics, Equated Scores, Item Response Theory
Yin, Ping; Brennan, Robert L.; Kolen, Michael J. – Applied Psychological Measurement, 2004
The use of educational tests in practical situations often requires that tests that are built to different content and/or statistical specifications be statistically linked. When equating methods are applied to tests that differ in content, difficulty, and/or reliability, the resulting scores cannot be used interchangeably. This study examined…
Descriptors: Indexes, Equated Scores, Standardized Tests, Achievement Tests
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that an anchor test used in equating should be a miniature version (or "minitest") of the tests to be equated; that is, the anchor test should be proportionally representative of the two tests in content and statistical characteristics. This paper examines the scientific foundation of this belief, especially…
Descriptors: Test Items, Equated Scores, Correlation, Tests
Michaelides, Michalis P. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006
Consistent behavior is a desirable characteristic that common items are expected to have when administered to different groups. Findings from the literature have established that items do not always behave in consistent ways; item indices and IRT item parameter estimates of the same items differ when obtained from different administrations.…
Descriptors: Equated Scores, Test Items, Item Response Theory, Evaluation Methods
Buras, Avery – 1996
The logic and uses of test equating are discussed, including three methods of test equating. The focus is on the conceptual underpinnings of each test equating method, rather than on the mathematics of the procedures. Additional consideration is given to the assumptions of each method and its respective strengths and weaknesses. A commonly…
Descriptors: Equated Scores, Item Response Theory, Models, Raw Scores
Lawrence, Ida M.; Dorans, Neil J. – 1988
This paper addresses the sample invariant properties of four equating methods (Tucker and Levine linear equating, equipercentile equating through an anchor test, and three-parameter item response theory equating). Data from several national administrations of the Scholastic Aptitude Test served as the source of data for the study. Equating results…
Descriptors: Ability, College Entrance Examinations, Comparative Analysis, Equated Scores
Holland, Paul W.; Thayer, Dorothy T. – 1989
A new and unified approach to test equating is described that is based on log-linear models for smoothing score distributions and on the kernel method of nonparametric density estimation. The new method contains both linear and standard equipercentile methods as special cases and can handle several important equating data collection designs. An…
Descriptors: Data Collection, Equated Scores, Estimation (Mathematics), Nonparametric Statistics
Wightman, Linda F.; Wightman, Lawrence E. – 1988
Section Pre-Equating (SPE) is a method used to equate test forms that consist of multiple separately timed sections. SPE does not require examinees to take two complete forms of the test. Instead, all of the old form and one or two sections of the new form are administered to each examinee, and missing data techniques are employed to estimate the…
Descriptors: Comparative Analysis, Correlation, Equated Scores, Estimation (Mathematics)
Schratz, Mary K. – 1984
To explore the appropriateness of the Rasch model for the vertical equating of a multi-level, multi-form achievement test series, both the Rasch model and the traditional Thurstone procedures were applied to the Listening Comprehension subtest scores of the Stanford Achievement Test. Two adjacent levels of these tests were administered in 1981 to…
Descriptors: Achievement Tests, Elementary Secondary Education, Equated Scores, Latent Trait Theory
Reed, Richard – 1981
The Los Angeles Unified School District has been concerned over the excess testing of elementary school pupils, particularly those who are in special programs. When the Board of Education decided to have yearly testing of pupil progress using a locally developed criterion-referenced test, the intention was to replace a previously used…
Descriptors: Elementary Education, Equated Scores, Reading Achievement, School Districts
Kolen, Michael J.; Whitney, Douglas R. – 1981
Procedures used to compare the results from item response theory as well as more traditional equating methods were described and critically analyzed. The implications of the comparison of equipercentile, linear, one-parameter (Rasch), and three-parameter methods for equating twelve forms of each of the five tests of General Educational Development…
Descriptors: Adults, Equated Scores, Equivalency Tests, Factor Analysis

Brennan, Robert L.; And Others – Applied Psychological Measurement, 1988
Seven papers on technical and practical issues in equating are presented. Problems related to the use of conventional and item response theory equating methods, using pre- and post-smoothing to increase equipercentile equating's precision, and linear equating models for common-item nonequivalent-population design are discussed. (SLD)
Descriptors: Equated Scores, Latent Trait Theory, Research Problems, Scaling