Descriptor
Statistical Distributions | 6 |
Test Construction | 3 |
Equated Scores | 2 |
Mathematical Models | 2 |
Reliability | 2 |
Scaling | 2 |
Scoring | 2 |
State Programs | 2 |
Test Items | 2 |
Test Reliability | 2 |
Test Results | 2 |
More ▼ |
Source
Applied Measurement in… | 6 |
Author
Bandalos, Deborah L. | 1 |
Cummings, Cynthia B. | 1 |
Edwards, Don | 1 |
Enders, Craig K. | 1 |
Feldt, Leonard S. | 1 |
Hanson, Bradley A. | 1 |
Kiplinger, Vonda L. | 1 |
Linn, Robert L. | 1 |
Schiel, Jeffrey L. | 1 |
Shaw, Dale G. | 1 |
Publication Type
Journal Articles | 6 |
Reports - Evaluative | 6 |
Information Analyses | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating

Hanson, Bradley A. – Applied Measurement in Education, 1996
Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)
Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format

Enders, Craig K.; Bandalos, Deborah L. – Applied Measurement in Education, 1999
Examined the degree to which coefficient alpha is affected by including items with different distribution shapes within a unidimensional scale. Computer simulation results indicate that reliability does not increase dramatically as a result of using differentially shaped items within a scale. Discusses implications for test construction. (SLD)
Descriptors: Computer Simulation, Reliability, Scaling, Statistical Distributions

Edwards, Don; Cummings, Cynthia B. – Applied Measurement in Education, 1990
An evolved form of the Edwards and Beckworth (1989) model for probability selection for Scholastic Achievement Test takers using truncated normal distributions is presented. It is shown that the arguments of Holland and Wainer are not sufficient to reject this model. (SLD)
Descriptors: College Entrance Examinations, Models, Participation, Probability

Feldt, Leonard S. – Applied Measurement in Education, 1993
The recommendation that the reliability of multiple-choice tests will be enhanced if the distribution of item difficulties is concentrated at approximately 0.50 is reinforced and extended in this article by viewing the 0/1 item scoring as a dichotomization of an underlying normally distributed ability score. (SLD)
Descriptors: Ability, Difficulty Level, Guessing (Tests), Mathematical Models

Linn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995
The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…
Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests

Schiel, Jeffrey L.; Shaw, Dale G. – Applied Measurement in Education, 1992
Changes in information retention resulting from changes in reliability and number of intervals in scale construction were studied to provide quantitative information to help in decisions about choosing intervals. Information retention reached a maximum when the number of intervals was about 8 or more and reliability was near 1.0. (SLD)
Descriptors: Decision Making, Knowledge Level, Mathematical Models, Monte Carlo Methods