NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Baglin, Roger F. – Journal of Educational Measurement, 1988
G. Burket's criticisms regarding calculation and interpretation of group scores on norm-referenced tests are discussed. Burket and Baglin seem to agree on the existence of a problem in the calculation and interpretation of group scores on norm-referenced tests but disagree on the issue of that problem's causes and solutions. (TJH)
Descriptors: Group Testing, Norm Referenced Tests, Scores, Testing Problems
Peer reviewed Peer reviewed
Burket, George R. – Journal of Educational Measurement, 1987
This response to the Baglin paper (1986) points out the fallacy in inferring that inappropriate scaling procedures cause apparent discrepancies between medians and means and between means calculated using different units. (LMO)
Descriptors: Norm Referenced Tests, Scaling, Scoring, Statistical Distributions
Peer reviewed Peer reviewed
Livingston, Samuel A. – Journal of Educational Measurement, 1973
Article commented on a study by Harris, who presented formulas for the variance of errors of estimation (of a true score from an observed score) and the variance of errors of prediction (of an observed score from an observed score on a parallel test). (Author/RK)
Descriptors: Criterion Referenced Tests, Measurement, Norm Referenced Tests, Test Reliability
Peer reviewed Peer reviewed
Baglin, Roger F. – Journal of Educational Measurement, 1981
While major test publishers randomly select school districts for their national norming studies, a survey of "accepting" and "declining" districts supports the hypothesis that self-selection bias results in overrepresentation of districts which already use a specific publisher's tests or instructional materials. (Author/BW)
Descriptors: National Norms, Norm Referenced Tests, Sampling, Standardized Tests
Peer reviewed Peer reviewed
Livingston, Samuel A. – Journal of Educational Measurement, 1972
Author replies to article TM 500 559. (MB)
Descriptors: Criterion Referenced Tests, Measurement Techniques, Norm Referenced Tests, Scoring
Peer reviewed Peer reviewed
Wardrop, James L.; And Others – Journal of Educational Measurement, 1982
A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…
Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models
Peer reviewed Peer reviewed
Tallmadge, G. Kasten – Journal of Educational Measurement, 1985
Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)
Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement
Peer reviewed Peer reviewed
Baglin, Roger F. – Journal of Educational Measurement, 1986
Norm-referenced standardized achievement tests are designed for obtaining group scores which can vary widely, depending on not only the measure of central tendency but also the type of derived score employed. This situation is hypothesized to be the result of using inappropriate statistical procedures to develop publishers' scaled scores.…
Descriptors: Achievement Tests, Elementary Secondary Education, Latent Trait Theory, Norm Referenced Tests
Peer reviewed Peer reviewed
Skakun, Ernest N.; Kling, Samuel – Journal of Educational Measurement, 1980
The Nedelsky procedure and two modified versions of the Ebel procedure were used by judges to set pass-fail levels on a medical certification examination in general surgery. Results indicated that the approaches produced different passing scores. The Ebel procedures displayed higher reliability than the Nedelsky approach. (Author/RD)
Descriptors: Certification, Cutting Scores, Measurement Techniques, Medical Students
Peer reviewed Peer reviewed
Airasian, Peter W. – Journal of Educational Measurement, 1985
The Stanford Achievement Test Forms E and F were judged to be one of the best achievement batteries for assessing basic skills taught in grades one through nine. The test publisher provides several booklets in addition to the administration manual. These include the Norms Booklet, Handbook of Instructional Strategies, and Guide to Classroom…
Descriptors: Academic Achievement, Achievement Rating, Achievement Tests, Elementary Secondary Education
Peer reviewed Peer reviewed
Hambleton, Ronald K.; Novick, Melvin R. – Journal of Educational Measurement, 1973
In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. (Editor)
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Decision Making, Definitions
Peer reviewed Peer reviewed
Livingston, Samuel A. – Journal of Educational Measurement, 1972
This article is a reply to a previous paper (see TM 500 488) interpreting Livingston's original article (see TM 500 487). (CK)
Descriptors: Criterion Referenced Tests, Error of Measurement, Norm Referenced Tests, Test Construction
Peer reviewed Peer reviewed
Hall, Alfred E. – Journal of Educational Measurement, 1985
The 12 subtests of the Ball Aptitude Battery (BAB) listed in the administration manual were described. The reviewer believes this aptitude battery, designed for use with high school students and adults in job selection and placement, needs major improvements. It is suggested that the BAB be used solely for research purposes. (DWH)
Descriptors: Adults, Aptitude Tests, High Schools, Norm Referenced Tests
Peer reviewed Peer reviewed
Tallmadge, G. Kasten – Journal of Educational Measurement, 1982
In assessing the validity of a norm-referenced model used in evaluating large-scale federal educational programs for disadvantaged children, gain estimates were shown as approximately equal with randomized control group model estimates compared by retrospective analyses of two databases. (Author/CM)
Descriptors: Comparative Analysis, Educational Assessment, Elementary Secondary Education, Federal Programs
Peer reviewed Peer reviewed
Pack, Elbert C. – Journal of Educational Measurement, 1972
Significantly more positive attitude toward subject matter of instruction was associated with the use of the criterion-referenced measure than with the norm-referenced measure; differences in attitude toward mode of instruction were not significant. (Author)
Descriptors: Attitude Measures, Comparative Analysis, Course Content, Criterion Referenced Tests
Previous Page | Next Page ยป
Pages: 1  |  2