Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Author
Allen, Nancy L. | 1 |
Drahozal, Edward C. | 1 |
Hanson, Bradley A. | 1 |
Miller, Timothy R. | 1 |
Spray, Judith A. | 1 |
van der Linden, Wim J. | 1 |
van der Ven, A. H. G. S. | 1 |
Publication Type
Reports - Evaluative | 6 |
Journal Articles | 3 |
Speeches/Meeting Papers | 3 |
Reports - Research | 1 |
Education Level
Audience
Location
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019
Lord's (1980) equity theorem claims observed-score equating to be possible only when two test forms are perfectly reliable or strictly parallel. An analysis of its proof reveals use of an incorrect statistical assumption. The assumption does not invalidate the theorem itself though, which can be shown to follow directly from the discrete nature of…
Descriptors: Equated Scores, Testing Problems, Item Response Theory, Evaluation Methods

Hanson, Bradley A. – Applied Measurement in Education, 1996
Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)
Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling

van der Ven, A. H. G. S.; And Others – Applied Psychological Measurement, 1989
A new model is presented that explains reaction time fluctuations in prolonged work tasks. The model extends the so-called Poisson-Erlang model and accounts for long-term trend effects in the reaction time curve. The model is consistent with Spearman's hypothesis that inhibition increases during work and decreases during rest. (TJH)
Descriptors: Elementary Secondary Education, Equations (Mathematics), Foreign Countries, Goodness of Fit
Spray, Judith A.; Miller, Timothy R. – 1992
A popular method of analyzing test items for differential item functioning (DIF) is to compute a statistic that conditions samples of examinees from different populations on an estimate of ability. This conditioning or matching by ability is intended to produce an appropriate statistic that is sensitive to true differences in item functioning,…
Descriptors: Blacks, College Entrance Examinations, Comparative Testing, Computer Simulation
Drahozal, Edward C. – 1986
This paper argues that the standard "When it is expected that a test will be used to make norm-referenced assessments of groups rather than individuals, normative data based on appropriate group statistics should be provided," which was considered secondary in the 1985 "Standards for Educational and Psychological Testing" and…
Descriptors: Achievement Tests, Elementary Education, Elementary Secondary Education, Grade Point Average