NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yen, Wendy M.; Lall, Venessa F.; Monfils, Lora – ETS Research Report Series, 2012
Alternatives to vertical scales are compared for measuring longitudinal academic growth and for producing school-level growth measures. The alternatives examined were empirical cross-grade regression, ordinary least squares and logistic regression, and multilevel models. The student data used for the comparisons were Arabic Grades 4 to 10 in…
Descriptors: Foreign Countries, Scaling, Item Response Theory, Test Interpretation
Peer reviewed Peer reviewed
Burket, George R.; Yen, Wendy M. – Journal of Educational Measurement, 1997
Using simulated data modeled after real tests, a Thurstone method (L. Thurstone, 1925 and later) and three-parameter item response theory were compared for vertical scaling. Neither procedure produced artificial scale shrinkage, and both produced modest scale expansion for one simulated condition. (SLD)
Descriptors: Comparative Analysis, Item Response Theory, Scaling, Simulation
Peer reviewed Peer reviewed
Sykes, Robert C.; Yen, Wendy M. – Journal of Educational Measurement, 2000
Investigated how well the generalized and Rasch models described item and test performance across a broad range of mixed-item-format test configurations (six tests from two state proficiency testing programs). Evaluating the impact of model assumptions on the predictions of item and test information permitted a delineation of the implications of…
Descriptors: Achievement Tests, Elementary Secondary Education, Prediction, Scaling
Peer reviewed Peer reviewed
Yen, Wendy M. – Psychometrika, 1985
An approximate relationship is devised between the unidimensional model used in data analysis and a multidimensional model hypothesized to be generating the item responses. Scale shrinkage is successfully predicted for several sets of simulated data. (Author/LMO)
Descriptors: Difficulty Level, Hypothesis Testing, Item Analysis, Latent Trait Theory
Peer reviewed Peer reviewed
Yen, Wendy M. – Journal of Educational Measurement, 1986
Two methods of constucting equal-interval scales for educational achievement are discussed: Thurstone's absolute scaling method and Item Response Theory. Alternative criteria for choosing a scale are contrasted. It is argued that clearer criteria are needed for judging the appropriateness and usefulness of alternative scaling procedures.…
Descriptors: Achievement Tests, Latent Trait Theory, Mathematical Models, Scaling
Yen, Wendy M. – 1984
Two of the most popular methods for obtaining equal-interval scales for educational measurement are discussed: Thurstone's method and Item Response Theory (IRT). Between-grade growth on these scales is compared; while unstandardized differences show different trends for the two scales, standardized differences that take standard deviations into…
Descriptors: Academic Achievement, Achievement Tests, Educational Research, Latent Trait Theory
Peer reviewed Peer reviewed
Yen, Wendy M. – Educational Measurement: Issues and Practice, 1988
In their discussion of problems related to identifying learning disabilities with individually administered achievement tests, Phillips and Clarizio incorrectly conclude that developmental standard score scales produce unrealistic expectations for the growth of low-achieving students. By definition, these scores state how much the norm group's…
Descriptors: Academic Achievement, Achievement Gains, Achievement Tests, Educational Assessment
Peer reviewed Peer reviewed
Yen, Wendy M. – Journal of Educational Measurement, 1993
Results from the Maryland School Performance Assessment Program for 5,392 elementary school students and from the Comprehensive Tests of Basic Skills (multiple choice) for a national sample are used to explore local item independence (LID) of test items. Some strategies are suggested for measuring LID in performance assessments. (SLD)
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Equations (Mathematics)
Peer reviewed Peer reviewed
Yen, Wendy M.; Ferrara, Steven – Educational and Psychological Measurement, 1997
The program design and psychometric characteristics of the Maryland School Performance Assessment Program (MSPAP) are described, focusing on scaling, equating, standard setting, score accuracy, and validity. The MSPAP is an innovative performance-based testing program administered annually to students in grades three, five, and eight. (SLD)
Descriptors: Academic Achievement, Achievement Tests, Elementary Education, Grade 3