Publication Date
| In 2026 | 0 |
| Since 2025 | 7 |
| Since 2022 (last 5 years) | 42 |
| Since 2017 (last 10 years) | 126 |
| Since 2007 (last 20 years) | 479 |
Descriptor
Source
Author
| Bianchini, John C. | 35 |
| von Davier, Alina A. | 34 |
| Dorans, Neil J. | 33 |
| Kolen, Michael J. | 31 |
| Loret, Peter G. | 31 |
| Kim, Sooyeon | 26 |
| Moses, Tim | 24 |
| Livingston, Samuel A. | 22 |
| Holland, Paul W. | 20 |
| Puhan, Gautam | 20 |
| Liu, Jinghua | 19 |
| More ▼ | |
Publication Type
Education Level
Location
| Canada | 9 |
| Australia | 8 |
| Florida | 8 |
| United Kingdom (England) | 8 |
| Netherlands | 7 |
| New York | 7 |
| United States | 7 |
| Israel | 6 |
| Turkey | 6 |
| United Kingdom | 6 |
| California | 5 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 12 |
| No Child Left Behind Act 2001 | 5 |
| Education Consolidation… | 3 |
| Hawkins Stafford Act 1988 | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Moses, Tim; Yang, Wen-Ling; Wilson, Christine – Journal of Educational Measurement, 2007
This study explored the use of kernel equating for integrating and extending two procedures proposed for assessing item order effects in test forms that have been administered to randomly equivalent groups. When these procedures are used together, they can provide complementary information about the extent to which item order effects impact test…
Descriptors: Advanced Placement, Equated Scores, Test Items, Item Analysis
Black, Beth; Bramley, Tom – Research Papers in Education, 2008
A new judgemental method of equating raw scores on two tests, based on rank-ordering scripts from both tests, has been developed by Bramley. The rank-ordering method has potential application as a judgemental standard-maintaining mechanism, because given a mark on one test (e.g. the A grade boundary mark), the equivalent mark (i.e. at the same…
Descriptors: Foreign Countries, Equated Scores, Test Theory, Evaluative Thinking
Peer reviewedKrus, David J.; Krus, Patricia H. – Educational and Psychological Measurement, 1977
A description of a Fortran program for linear and area transformations of test scores with optional generation of symmetric tables of areas under the standard normal curve is presented. Also included is a historic note on the origin of McCall's T Scale. (Author/JKS)
Descriptors: Computer Programs, Equated Scores, Measurement Techniques
Peer reviewedKaskowitz, Gary S.; De Ayala, R. J. – Applied Psychological Measurement, 2001
Studied the effect of item parameter estimation for computation of linking coefficients for the test response function (TRF) linking/equating method. Simulation results showed that linking was more accurate when there was less error in the parameter estimates, and that 15 or 25 common items provided better results than 5 common items under both…
Descriptors: Equated Scores, Estimation (Mathematics), Responses, Simulation
Peer reviewedBaker, Frank B. – Applied Psychological Measurement, 1997
Examined the sampling distributions of equating coefficients produced by the characteristic curve method for tests using graded and nominal response scoring using simulated data. For both models and across all three equating situations, the sampling distributions were generally bell-shaped and peaked, and occasionally had a small degree of…
Descriptors: Equated Scores, Sampling, Simulation, Statistical Distributions
Kolen, Michael J. – Journal of Educational Measurement, 2004
The concept of invariance in equating and linking is traced from the 1950s to the present. A number of research studies that examined population invariance are reviewed. Theory and research suggest that linkings other than equatings are population dependent. Theory also indicates that equatings are population dependent, although when test forms…
Descriptors: Equated Scores, Evaluation Methods, Statistical Analysis
Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007
This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…
Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length
Moses, Tim; Holland, Paul – ETS Research Report Series, 2007
The purpose of this study was to empirically evaluate the impact of loglinear presmoothing accuracy on equating bias and variability across chained and post-stratification equating methods, kernel and percentile-rank continuization methods, and sample sizes. The results of evaluating presmoothing on equating accuracy generally agreed with those of…
Descriptors: Equated Scores, Statistical Analysis, Accuracy, Sample Size
Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010
This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…
Descriptors: Guides, Item Response Theory, Test Items, Correlation
Girard, Todd A.; Christensen, Bruce K. – Psychological Assessment, 2008
The correlation between a short-form (SF) test and its full-scale (FS) counterpart is a mainstay in the evaluation of SF validity. However, in correcting for overlapping error variance in this measure, investigators have overattenuated the validity coefficient through an intuitive misapplication of P. Levy's (1967) formula. The authors of the…
Descriptors: Error of Measurement, Computation, Psychiatric Services, Correlation
Guilera, Georgina; Gomez, Juana – Advances in Health Sciences Education, 2008
In the context of health sciences education, and education in general, the knowledge or ability of one or several subjects in a specific area is frequently compared using different forms of a test, or by means of different instruments aimed at measuring this knowledge or ability. In such cases, test scores must be equated so that they can be…
Descriptors: Item Response Theory, Health Education, Science Education, Equated Scores
Schnohr, C. W.; Kreiner, S.; Due, E. P.; Currie, C.; Boyce, W.; Diderichsen, F. – Social Indicators Research, 2008
Methodology for making cross-national comparisons is an area of increasing interest in social and public health related research. When studying socio-economic differences in health outcomes cross-nationally, there are several methodological issues of concern, especially when data is derived from self-reported questionnaires. Health Behaviour in…
Descriptors: Test Bias, Public Health, Evaluation Methods, Social Indicators
Elosua, Paula; Lopez-Jauregui, Alicia – Journal of Experimental Education, 2008
The comparison of scores from linguistically different tests is a twofold matter: the adaptation of tests and the comparison of scores. These 2 aspects of measurement invariance intersect at the need to guarantee the psychometric equivalence between the original and adapted versions. In this study, the authors examined comparability in 2 stages.…
Descriptors: Psychometrics, Item Response Theory, Equated Scores, Comparative Analysis
von Davier, Alina A.; Wilson, Christine – Applied Psychological Measurement, 2008
Dorans and Holland (2000) and von Davier, Holland, and Thayer (2003) introduced measures of the degree to which an observed-score equating function is sensitive to the population on which it is computed. This article extends the findings of Dorans and Holland and of von Davier et al. to item response theory (IRT) true-score equating methods that…
Descriptors: Advanced Placement, Advanced Placement Programs, Equated Scores, Calculus
Yang, Wen-Ling; Gao, Rui – Applied Psychological Measurement, 2008
This study investigates whether the functions linking number-correct scores to the College-Level Examination Program (CLEP) scaled scores remain invariant over gender groups, using test data on the 16 testlet-based forms of the CLEP College Algebra exam. To be consistent with the operational practice, linking of various test forms to a common…
Descriptors: Mathematics Tests, Algebra, Item Response Theory, Testing Programs

Direct link
