ERIC - Search Results

Descriptor

Norm Referenced Tests	20
Test Reliability	8
Criterion Referenced Tests	7
Test Interpretation	6
Test Validity	5
Achievement Tests	4
Test Results	4
Testing Problems	4
True Scores	4
Comparative Analysis	3
Elementary Secondary Education	3
Error of Measurement	3
Measurement Techniques	3
Scoring	3
Statistical Bias	3
Test Construction	3
Educational Testing	2
High Schools	2
Mathematical Applications	2
Models	2
National Norms	2
Norms	2
Program Evaluation	2
Raw Scores	2
Scaling	2
More ▼

Source

Journal of Educational…

Author

Livingston, Samuel A.	4
Baglin, Roger F.	3
Tallmadge, G. Kasten	2
Airasian, Peter W.	1
Burket, George R.	1
Conklin, Jonathan E.	1
Feifs, Helmuts	1
Hall, Alfred E.	1
Hambleton, Ronald K.	1
Harris, Chester W.	1
Hoover, H. D.	1
Kling, Samuel	1
Novick, Melvin R.	1
Pack, Elbert C.	1
Page, Ellis, B.	1
Plake, Barbara S.	1
Skakun, Ernest N.	1
Wardrop, James L.	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	8
Opinion Papers	5
Information Analyses	2
Reports - Evaluative	2

Education Level

Audience

Practitioners

Location

United States

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Iowa Tests of Basic Skills	3
Metropolitan Achievement Tests	2
California Achievement Tests	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Group Scores: A Rejoiner to Burket.

Peer reviewed

Baglin, Roger F. – Journal of Educational Measurement, 1988

G. Burket's criticisms regarding calculation and interpretation of group scores on norm-referenced tests are discussed. Burket and Baglin seem to agree on the existence of a problem in the calculation and interpretation of group scores on norm-referenced tests but disagree on the issue of that problem's causes and solutions. (TJH)

Descriptors: Group Testing, Norm Referenced Tests, Scores, Testing Problems

Group Scores: A Response to Baglin.

Peer reviewed

Burket, George R. – Journal of Educational Measurement, 1987

This response to the Baglin paper (1986) points out the fallacy in inferring that inappropriate scaling procedures cause apparent discrepancies between medians and means and between means calculated using different units. (LMO)

Descriptors: Norm Referenced Tests, Scaling, Scoring, Statistical Distributions

A Note on the Interpretation of the Criterion-Referenced Reliability Coefficient

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1973

Article commented on a study by Harris, who presented formulas for the variance of errors of estimation (of a true score from an observed score) and the variance of errors of prediction (of an observed score from an observed score on a parallel test). (Author/RK)

Descriptors: Criterion Referenced Tests, Measurement, Norm Referenced Tests, Test Reliability

Does "Nationally" Normed Really Mean Nationally?

Peer reviewed

Baglin, Roger F. – Journal of Educational Measurement, 1981

While major test publishers randomly select school districts for their national norming studies, a survey of "accepting" and "declining" districts supports the hypothesis that self-selection bias results in overrepresentation of districts which already use a specific publisher's tests or instructional materials. (Author/BW)

Descriptors: National Norms, Norm Referenced Tests, Sampling, Standardized Tests

Reply to Shavelson, Block, and Ravitch's "Criterion-Referenced Testing: Comments on Reliability"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

Author replies to article TM 500 559. (MB)

Descriptors: Criterion Referenced Tests, Measurement Techniques, Norm Referenced Tests, Scoring

A Framework for Analyzing the Inference Structure of Educational Achievement Tests.

Peer reviewed

Wardrop, James L.; And Others – Journal of Educational Measurement, 1982

A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…

Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models

Rumors Regarding the Death of the Equipercentile Assumption May Have Been Greatly Exaggerated.

Peer reviewed

Tallmadge, G. Kasten – Journal of Educational Measurement, 1985

Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)

Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement

A Problem in Calculating Group Scores on Norm-Referenced Tests.

Peer reviewed

Baglin, Roger F. – Journal of Educational Measurement, 1986

Norm-referenced standardized achievement tests are designed for obtaining group scores which can vary widely, depending on not only the measure of central tendency but also the type of derived score employed. This situation is hypothesized to be the result of using inappropriate statistical procedures to develop publishers' scaled scores.…

Descriptors: Achievement Tests, Elementary Secondary Education, Latent Trait Theory, Norm Referenced Tests

Comparability of Methods for Setting Standards.

Peer reviewed

Skakun, Ernest N.; Kling, Samuel – Journal of Educational Measurement, 1980

The Nedelsky procedure and two modified versions of the Ebel procedure were used by judges to set pass-fail levels on a medical certification examination in general surgery. Results indicated that the approaches produced different passing scores. The Ebel procedures displayed higher reliability than the Nedelsky approach. (Author/RD)

Descriptors: Certification, Cutting Scores, Measurement Techniques, Medical Students

Stanford Achievement Test Forms E and F (Test Review).

Peer reviewed

Airasian, Peter W. – Journal of Educational Measurement, 1985

The Stanford Achievement Test Forms E and F were judged to be one of the best achievement batteries for assessing basic skills taught in grades one through nine. The test publisher provides several booklets in addition to the administration manual. These include the Norms Booklet, Handbook of Instructional Strategies, and Guide to Classroom…

Descriptors: Academic Achievement, Achievement Rating, Achievement Tests, Elementary Secondary Education

Toward an Integration of Theory and Method for Criterion-Referenced Tests

Peer reviewed

Hambleton, Ronald K.; Novick, Melvin R. – Journal of Educational Measurement, 1973

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. (Editor)

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Decision Making, Definitions

A Reply to Harris's "An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

This article is a reply to a previous paper (see TM 500 488) interpreting Livingston's original article (see TM 500 487). (CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Norm Referenced Tests, Test Construction

The Ball Aptitude Battery (Test Review).

Peer reviewed

Hall, Alfred E. – Journal of Educational Measurement, 1985

The 12 subtests of the Ball Aptitude Battery (BAB) listed in the administration manual were described. The reviewer believes this aptitude battery, designed for use with high school students and adults in job selection and placement, needs major improvements. It is suggested that the BAB be used solely for research purposes. (DWH)

Descriptors: Adults, Aptitude Tests, High Schools, Norm Referenced Tests

An Empirical Assessment of Norm-Referenced Evaluation Methodology.

Peer reviewed

Tallmadge, G. Kasten – Journal of Educational Measurement, 1982

In assessing the validity of a norm-referenced model used in evaluating large-scale federal educational programs for disadvantaged children, gain estimates were shown as approximately equal with randomized control group model estimates compared by retrospective analyses of two databases. (Author/CM)

Descriptors: Comparative Analysis, Educational Assessment, Elementary Secondary Education, Federal Programs

The Effects of Testing Upon Attitude Towards the Method and Content of Instruction

Peer reviewed

Pack, Elbert C. – Journal of Educational Measurement, 1972

Significantly more positive attitude toward subject matter of instruction was associated with the use of the criterion-referenced measure than with the norm-referenced measure; differences in attitude toward mode of instruction were not significant. (Author)

Descriptors: Attitude Measures, Comparative Analysis, Course Content, Criterion Referenced Tests

Previous Page | Next Page »

Pages: 1 | 2