Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
Equated Scores | 16 |
Statistical Analysis | 16 |
Testing Programs | 16 |
Test Items | 5 |
Comparative Analysis | 4 |
Difficulty Level | 4 |
Academic Achievement | 3 |
College Entrance Examinations | 3 |
Error of Measurement | 3 |
Latent Trait Theory | 3 |
Sample Size | 3 |
More ▼ |
Source
ETS Research Report Series | 3 |
ACT, Inc. | 1 |
Applied Measurement in… | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Journal of Experimental… | 1 |
Psychometrika | 1 |
US Department of Education | 1 |
Author
Cope, Ronald T. | 2 |
Guo, Hongwen | 2 |
Haberman, Shelby | 2 |
Kim, Sooyeon | 2 |
von Davier, Alina A. | 2 |
Algina, James | 1 |
Angoff, William H. | 1 |
Baker, Jean | 1 |
Chen, Hanwei | 1 |
Cui, Zhongmin | 1 |
Dorans, Neil | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Evaluative | 8 |
Reports - Research | 8 |
Speeches/Meeting Papers | 6 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Audience
Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
Iowa Tests of Basic Skills | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011
Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…
Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011
The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…
Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis
Guo, Hongwen – Psychometrika, 2010
After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…
Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores
Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010
The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…
Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level
Haertel, Edward H. – US Department of Education, 2004
Large-scale testing programs often require multiple forms to maintain test security over time or to enable the measurement of change without repeating the identical questions. The comparability of scores across forms is consequential: Students are admitted to colleges based on their test scores, and the meaning of a given scale score one year …
Descriptors: Measurement, Testing Programs, Equated Scores, Test Use
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006
This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…
Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias
Cope, Ronald T. – 1985
This study considers the use of repeaters when test equating. The subjects consist of five groups of applicants to a professional certification program. Each group comprises first time examinees and repeaters. The procedures include a common item linear equating with nonrandom groups, use of equating chains, and the use of total examinee group…
Descriptors: Certification, Equated Scores, Measurement Techniques, Postsecondary Education
Forster, Fred; Karr, Chad – 1988
A method for equating test scores between two standardized achievement testing programs was developed. The first test was the Survey of Basic Skills (SBS) published by Science Research Associates. The second was the Tests of Individual Performance (TIP) of the Portland Public Schools in Oregon. Scores reported in Rasch units (RIT) from the TIP…
Descriptors: Achievement Tests, Elementary Education, Elementary School Students, Equated Scores
Angoff, William H. – 1991
An attempt was made to evaluate the standard error of equating (at the mean of the scores) in an ongoing testing program. The interest in estimating the empirical standard error of equating is occasioned by some discomfort with the error normally reported for test scores. Data used for this evaluation came from the Admissions Testing Program of…
Descriptors: College Entrance Examinations, Equated Scores, Error of Measurement, High School Students

Waltman, Kristie K. – Journal of Educational Measurement, 1997
A socially moderated link was established between statewide achievement results and the National Assessment of Educational Progress (NAEP) by using the same achievement level descriptions in an Iowa Test of Basic Skills standard-setting and an NAEP standard setting study. A statistically moderated link was established through an equipercentile…
Descriptors: Academic Achievement, Achievement Tests, Equated Scores, National Surveys
Cope, Ronald T. – 1995
This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…
Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests
Baker, Jean; Wongbundhit, Yuwadee – 1984
This study was designed to illustrate the use of the Rasch model procedure to equate the Dade County Compensatory Education Skills Test (DCCEST) to the State Student Assessment Test (SSAT) for both mathematics skills and communications skills. The SSAT was a test developed by the Florida State Department of Education to measure students' level of…
Descriptors: Academic Achievement, Basic Skills, Compensatory Education, Criterion Referenced Tests
Legg, Sue M.; Algina, James – 1986
This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…
Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores
Previous Page | Next Page »
Pages: 1 | 2