Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 3 |
Descriptor
Difficulty Level | 9 |
Equated Scores | 9 |
Testing Programs | 9 |
Test Items | 6 |
Statistical Analysis | 4 |
Criterion Referenced Tests | 3 |
Elementary Secondary Education | 3 |
Item Analysis | 3 |
Test Construction | 3 |
Test Format | 3 |
Test Reliability | 3 |
More ▼ |
Author
Algina, James | 1 |
Bauer, Ernest A. | 1 |
Chen, Hanwei | 1 |
Cope, Ronald T. | 1 |
Cowell, William R. | 1 |
Cui, Zhongmin | 1 |
Gao, Xiaohong | 1 |
Goodman, Joshua | 1 |
Kubiak, Anna T. | 1 |
Legg, Sue M. | 1 |
Linn, Robert L. | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 6 |
Reports - Research | 5 |
Reports - Evaluative | 3 |
Numerical/Quantitative Data | 2 |
Journal Articles | 1 |
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Comprehensive Tests of Basic… | 1 |
What Works Clearinghouse Rating
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010
The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…
Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level
Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012
Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…
Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

Slinde, Jefferey A.; Linn, Robert L. – Journal of Educational Measurement, 1978
Use of the Rasch model for vertical equating of tests is discussed. Although use of the model is promising, empirical results raise questions about the adequacy of the Rasch model. Latent trait models with more parameters may be necessary. (JKS)
Descriptors: Achievement Tests, Difficulty Level, Equated Scores, Higher Education
Kubiak, Anna T.; Cowell, William R. – 1990
A procedure used to average several Mantel-Haenszel delta difference values for an item is described and evaluated. The differential item functioning (DIF) procedure used by the Educational Testing Service (ETS) is based on the Mantel-Haenszel statistical technique for studying matched groups. It is standard procedure at ETS to analyze test items…
Descriptors: Difficulty Level, Elementary Secondary Education, Equated Scores, Item Bias
Cope, Ronald T. – 1995
This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…
Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests
Legg, Sue M.; Algina, James – 1986
This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…
Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores
Nassif, Paula M.; And Others – 1979
A procedure which employs a method of item substitution based on item difficulty is recommended for developing parallel criterion referenced test forms. This procedure is currently being used in the Florida functional literacy testing program and the Georgia teacher certification testing program. Reasons for developing parallel test forms involve…
Descriptors: Criterion Referenced Tests, Difficulty Level, Equated Scores, Functional Literacy

Bauer, Ernest A.; And Others – 1979
The reading portion of the Michigan Educational Assessment Program (MEAP) was equated to the reading comprehension subtest of the Comprehensive Tests of Basic Skills (CTBS) using the Rasch Model. Both tests were administered to 366 low achieving fourth grade students. MEAP was treated as both a 95-item test and a 19-item (number of objectives…
Descriptors: Academic Standards, Criterion Referenced Tests, Difficulty Level, Educational Objectives