Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Scaling | 9 |
Achievement Tests | 5 |
Academic Achievement | 4 |
Mathematical Models | 4 |
Educational Assessment | 3 |
Elementary Secondary Education | 3 |
Latent Trait Theory | 3 |
Scores | 3 |
Scoring | 3 |
State Programs | 3 |
Testing Programs | 3 |
More ▼ |
Source
Journal of Educational… | 4 |
ETS Research Report Series | 1 |
Educational Measurement:… | 1 |
Educational and Psychological… | 1 |
Psychometrika | 1 |
Author
Yen, Wendy M. | 9 |
Burket, George R. | 1 |
Ferrara, Steven | 1 |
Lall, Venessa F. | 1 |
Monfils, Lora | 1 |
Sykes, Robert C. | 1 |
Publication Type
Journal Articles | 8 |
Reports - Evaluative | 5 |
Reports - Research | 4 |
Opinion Papers | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Qatar | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Comprehensive Tests of Basic… | 1 |
What Works Clearinghouse Rating
Yen, Wendy M.; Lall, Venessa F.; Monfils, Lora – ETS Research Report Series, 2012
Alternatives to vertical scales are compared for measuring longitudinal academic growth and for producing school-level growth measures. The alternatives examined were empirical cross-grade regression, ordinary least squares and logistic regression, and multilevel models. The student data used for the comparisons were Arabic Grades 4 to 10 in…
Descriptors: Foreign Countries, Scaling, Item Response Theory, Test Interpretation

Burket, George R.; Yen, Wendy M. – Journal of Educational Measurement, 1997
Using simulated data modeled after real tests, a Thurstone method (L. Thurstone, 1925 and later) and three-parameter item response theory were compared for vertical scaling. Neither procedure produced artificial scale shrinkage, and both produced modest scale expansion for one simulated condition. (SLD)
Descriptors: Comparative Analysis, Item Response Theory, Scaling, Simulation

Sykes, Robert C.; Yen, Wendy M. – Journal of Educational Measurement, 2000
Investigated how well the generalized and Rasch models described item and test performance across a broad range of mixed-item-format test configurations (six tests from two state proficiency testing programs). Evaluating the impact of model assumptions on the predictions of item and test information permitted a delineation of the implications of…
Descriptors: Achievement Tests, Elementary Secondary Education, Prediction, Scaling

Yen, Wendy M. – Psychometrika, 1985
An approximate relationship is devised between the unidimensional model used in data analysis and a multidimensional model hypothesized to be generating the item responses. Scale shrinkage is successfully predicted for several sets of simulated data. (Author/LMO)
Descriptors: Difficulty Level, Hypothesis Testing, Item Analysis, Latent Trait Theory

Yen, Wendy M. – Journal of Educational Measurement, 1986
Two methods of constucting equal-interval scales for educational achievement are discussed: Thurstone's absolute scaling method and Item Response Theory. Alternative criteria for choosing a scale are contrasted. It is argued that clearer criteria are needed for judging the appropriateness and usefulness of alternative scaling procedures.…
Descriptors: Achievement Tests, Latent Trait Theory, Mathematical Models, Scaling
Yen, Wendy M. – 1984
Two of the most popular methods for obtaining equal-interval scales for educational measurement are discussed: Thurstone's method and Item Response Theory (IRT). Between-grade growth on these scales is compared; while unstandardized differences show different trends for the two scales, standardized differences that take standard deviations into…
Descriptors: Academic Achievement, Achievement Tests, Educational Research, Latent Trait Theory

Yen, Wendy M. – Educational Measurement: Issues and Practice, 1988
In their discussion of problems related to identifying learning disabilities with individually administered achievement tests, Phillips and Clarizio incorrectly conclude that developmental standard score scales produce unrealistic expectations for the growth of low-achieving students. By definition, these scores state how much the norm group's…
Descriptors: Academic Achievement, Achievement Gains, Achievement Tests, Educational Assessment

Yen, Wendy M. – Journal of Educational Measurement, 1993
Results from the Maryland School Performance Assessment Program for 5,392 elementary school students and from the Comprehensive Tests of Basic Skills (multiple choice) for a national sample are used to explore local item independence (LID) of test items. Some strategies are suggested for measuring LID in performance assessments. (SLD)
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Equations (Mathematics)

Yen, Wendy M.; Ferrara, Steven – Educational and Psychological Measurement, 1997
The program design and psychometric characteristics of the Maryland School Performance Assessment Program (MSPAP) are described, focusing on scaling, equating, standard setting, score accuracy, and validity. The MSPAP is an innovative performance-based testing program administered annually to students in grades three, five, and eight. (SLD)
Descriptors: Academic Achievement, Achievement Tests, Elementary Education, Grade 3