Publication Date
| In 2026 | 0 |
| Since 2025 | 7 |
| Since 2022 (last 5 years) | 42 |
| Since 2017 (last 10 years) | 126 |
| Since 2007 (last 20 years) | 479 |
Descriptor
Source
Author
| Bianchini, John C. | 35 |
| von Davier, Alina A. | 34 |
| Dorans, Neil J. | 33 |
| Kolen, Michael J. | 31 |
| Loret, Peter G. | 31 |
| Kim, Sooyeon | 26 |
| Moses, Tim | 24 |
| Livingston, Samuel A. | 22 |
| Holland, Paul W. | 20 |
| Puhan, Gautam | 20 |
| Liu, Jinghua | 19 |
| More ▼ | |
Publication Type
Education Level
Location
| Canada | 9 |
| Australia | 8 |
| Florida | 8 |
| United Kingdom (England) | 8 |
| Netherlands | 7 |
| New York | 7 |
| United States | 7 |
| Israel | 6 |
| Turkey | 6 |
| United Kingdom | 6 |
| California | 5 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 12 |
| No Child Left Behind Act 2001 | 5 |
| Education Consolidation… | 3 |
| Hawkins Stafford Act 1988 | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedBaker, Frank B. – Applied Psychological Measurement, 1996
Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…
Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions
Peer reviewedErcikan, Kadriye – Applied Measurement in Education, 1997
Linking scores from the National Assessment of Educational Progress (NAEP) to statewide test results was studied. Results based on an equipercentile procedure suggest that such a link does not provide precise information. Information from a linking study should be limited to rough estimates of students in each NAEP achievement level. (SLD)
Descriptors: Equated Scores, Estimation (Mathematics), National Surveys, State Programs
Peer reviewedHanson, Bradley A. – Applied Psychological Measurement, 1991
Log-linear model bivariate smoothing and a bivariate smoothing model based on the four-parameter beta binomial model were compared for usefulness in frequency estimation common-item equipercentile equating using two datasets. The performance of smoothed equipercentile methods was also compared to that of linear methods of common-item equating.…
Descriptors: Comparative Analysis, Equated Scores, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedHanson, Bradley A. – Journal of Educational Statistics, 1991
The formula developed by R. Levine (1955) for equating unequally reliable tests is described. The formula can be interpreted as a method of moments estimate of an equating function that results in first order equity of the equated test score under a classical congeneric model. (TJH)
Descriptors: Equated Scores, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedDorans, Neil J.; Lawrence, Ida M. – Applied Measurement in Education, 1990
A procedure for checking the score equivalence of nearly identical editions of a test is described and illustrated with Scholastic Aptitude Test data. The procedure uses the standard error of equating and uses graphical representation of score conversion deviations from the identity function in standard error units. (SLD)
Descriptors: Equated Scores, Grade Equivalent Scores, Scores, Statistical Analysis
Peer reviewedBaker, Frank B. – Applied Psychological Measurement, 1993
A procedure was developed for finding equating coefficients of the linear transformation of the metric of one test to that of another when nominally scored. Empirical results indicate that tests scored under a nominal response model can be placed on a common metric in horizontal and vertical equating. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Item Response Theory, Scoring
Peer reviewedHambleton, Ronald K. – Applied Psychological Measurement, 2000
Introduces the articles of this theme issue focusing on performance assessment methodology. Papers address: (1) merging item formats; (2) scoring models; (3) equating and linking; (4) generalizability theory; (5) standard setting methods; and (6) validity issues and methods. (SLD)
Descriptors: Equated Scores, Evaluation Methods, Generalizability Theory, Performance Based Assessment
Peer reviewedWolfe, Edward W. – Journal of Applied Measurement, 2000
Describes Rasch measurement procedures for equating multiple test forms or calibrating an item bank. The procedures entail: (1) selecting a data collection design; (2) estimating parameters; (3) transforming the parameters to a common scale; and (4) evaluating the quality of the linkage between the forms. (SLD)
Descriptors: Equated Scores, Estimation (Mathematics), Item Banks, Item Response Theory
Peer reviewedLee, Guemin; Kolen, Michael J.; Frisbie, David A.; Ankenmann, Robert D. – Applied Psychological Measurement, 2001
Compared performance of two polytomous item response theory models to that of the dichotomous three-parameter logistic model in equating tests composed of testlets using data from 6 tests of the Iowa Tests of Basic Skills (samples of 537 to 680 eighth graders). Results of the equating method based on polytomous models produced results that more…
Descriptors: Equated Scores, Item Response Theory, Junior High School Students, Junior High Schools
Petersen, Nancy S. – Applied Psychological Measurement, 2008
This article discusses the five studies included in this issue. Each article addressed the same topic, population invariance of equating. They all used data from major standardized testing programs, and they all used essentially the same statistics to evaluate their results, namely, the root mean square difference and root expected mean square…
Descriptors: Testing Programs, Standardized Tests, Equated Scores, Evaluation Methods
von Davier, Alina A.; Holland, Paul W.; Livingston, Samuel A.; Casabianca, Jodi; Grant, Mary C.; Martin, Kathleen – ETS Research Report Series, 2006
This study examines how closely the kernel equating (KE) method (von Davier, Holland, & Thayer, 2004a) approximates the results of other observed-score equating methods--equipercentile and linear equatings. The study used pseudotests constructed of item responses from a real test to simulate three equating designs: an equivalent groups (EG)…
Descriptors: Equated Scores, Statistical Analysis, Simulation, Tests
Mao, Xia; von Davier, Alina A.; Rupp, Stacie – ETS Research Report Series, 2006
Kernel equating (KE) is a new approach to observed-score equating and is described in detail in von Davier, Holland, and Thayer (2004b). Over the past months, several evaluation studies of KE have been designed and carried out. In this part of the overall evaluation study, we compared the KE method with other equating methods using real data from…
Descriptors: Licensing Examinations (Professions), Teacher Certification, Equated Scores, Statistical Analysis
Henning, Grant – 1992
The psychometric characteristics of the Test of Written English (TWE) rating scale were explored. Rasch model scalar analysis methodology was employed with more than 4,000 scored essays across 2 elicitation prompts to gather information about the rating scale and rating process. Results suggested that the intervals between TWE scale steps were…
Descriptors: English (Second Language), Equated Scores, Essays, Interrater Reliability
Lindman, Erick L. – 1968
A suggested technique for analyzing distributions of test scores compares distributions of scores made by groups of pupils on standard tests with distributions made by other groups of students on the same tests. By identifying the percents of student scores which must be shifted to an adjacent cell (interval) to make the two distributions exactly…
Descriptors: Educational Testing, Equated Scores, Program Evaluation, Research Methodology
Jaeger, Richard M. – 1974
Two relatively new tools for analysis of data compiled in evaluation studies are presented. The National Test-Equating Study in Reading, known as the Anchor Test Study, produced tables of score-correspondence between the eight reading comprehension and vocabulary tests most widely used in the United States. Two types of tables from this report…
Descriptors: Data Analysis, Equated Scores, Evaluation Methods, Raw Scores

Direct link
