Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 3 |
Descriptor
Equated Scores | 19 |
Test Reliability | 19 |
Testing Problems | 19 |
Latent Trait Theory | 7 |
Scoring | 6 |
Statistical Analysis | 6 |
Test Validity | 6 |
Testing Programs | 5 |
Test Theory | 4 |
Achievement Tests | 3 |
Educational Assessment | 3 |
More ▼ |
Source
Educational and Psychological… | 2 |
School Psychology… | 2 |
Applied Measurement in… | 1 |
Applied Psychological… | 1 |
Evaluation and the Health… | 1 |
Journal of Educational… | 1 |
Author
Andrulis, Richard S. | 2 |
Algina, James | 1 |
Budescu, David | 1 |
Canivez, Gary L. | 1 |
Canner, Jane | 1 |
Gallas, Edwin J. | 1 |
Gilmer, Jerry S. | 1 |
Holmes, Susan E. | 1 |
Kahl, Stuart R. | 1 |
Kettler, Ryan J. | 1 |
Legg, Sue M. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 2 |
Location
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
California Achievement Tests | 2 |
Wechsler Intelligence Scale… | 2 |
Armed Services Vocational… | 1 |
Comprehensive Tests of Basic… | 1 |
Metropolitan Achievement Tests | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
McGill, Ryan J.; Ward, Thomas J.; Canivez, Gary L. – School Psychology International, 2020
The Wechsler Intelligence Scale for Children (WISC) is the most widely used intelligence test in the world. Now in its fifth edition, the WISC-V has been translated and adapted for use in nearly a dozen countries. Despite its popularity, numerous concerns have been raised about some of the procedures used to develop and validate translated and…
Descriptors: Children, Intelligence Tests, Translation, Test Validity
Kettler, Ryan J. – School Psychology International, 2020
This article is a commentary on McGill et al.'s (2020) article "Use of Translated and Adapted Versions of the WISC-V: Caveat Emptor." McGill et al. use caveat emptor in their title to indicate that the buyer of an assessment must be careful about the product being purchased, presumably because the seller of the assessment is not being…
Descriptors: Children, Intelligence Tests, Translation, Test Reliability
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement

MacCann, Robert G. – Educational and Psychological Measurement, 1989
Levine's equations for random groups and unequally reliable tests can be used to equate two tests through performance on an anchor test. Levine's assumption of a parallelism requirement is not necessary; it is sufficient to assume only that the tests are congeneric, an assumption implicit in linear test equating. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Latent Trait Theory, Test Reliability

Andrulis, Richard S.; And Others – Educational and Psychological Measurement, 1978
The effects of repeaters (testees included in both administrations of two forms of a test) on the test equating process are examined. It is shown that repeaters do effect test equating and tend to lower the cutoff point for passing the test. (JKS)
Descriptors: Cutting Scores, Equated Scores, Item Analysis, Scoring

Weiss, David J., Ed. – Applied Psychological Measurement, 1987
Issues concerning equating test scores are discussed in an introduction, four papers, and two commentaries. Equating methods research, sampling errors, linear equating, population differences, sources of equating errors, and a circular equating paradigm are considered. (SLD)
Descriptors: Equated Scores, Latent Trait Theory, Maximum Likelihood Statistics, Statistical Analysis
Yen, Wendy M. – 1982
Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…
Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Budescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests
Andrulis, Richard S.; And Others – 1974
The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…
Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)

Holmes, Susan E. – Evaluation and the Health Professions, 1986
A specific application of test equating is described, namely that of credentialing examination programs in the health professions. Considered are: (1) the role of test equating in the credentialing process; and (2) the issues that must be considered when implementing test equating in a credentialing examination program. (Author/LMO)
Descriptors: Certification, Credentials, Data Collection, Equated Scores
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Kahl, Stuart R. – 1995
Although few question the positive impacts alternative forms of assessment can have on instruction, concerns about the psychometric quality of data obtained from such assessments are taking their toll. Scoring issues are at the heart of many of these concerns. This paper addresses the causes of these concerns: misinformation about psychometric…
Descriptors: Alternative Assessment, Educational Assessment, Equated Scores, Performance Based Assessment
Wegner, Toni Giuliano; Ree, Malcolm James – 1985
In the late 1970s, the Department of Defense requested that the reference population for the Armed Services Vocational Aptitude Battery (ASVAB) be changed and updated to reflect the current youth population. Analyses of new data collected in 1980 indicated that speeded subtest scores of the new sample were atypically low and that the sample might…
Descriptors: Adults, Answer Sheets, Armed Forces, Data Analysis
Modu, Christopher C.; Stern, June – 1977
To assess the stability of the Scholastic Aptitude Test verbal score scale SAT--V, 1963 and 1973 forms of the SAT--V were administered in counterbalanced order to spaced samples of the same group. The 1973 scores were placed on the reporting scale used for the 1963 form. The experimentally derived scores on the 1963 scale were compared with their…
Descriptors: College Bound Students, College Entrance Examinations, Educational Problems, Educational Trends
Legg, Sue M.; Algina, James – 1986
This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…
Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores
Previous Page | Next Page ยป
Pages: 1 | 2