Descriptor
Source
Applied Measurement in… | 14 |
Author
Publication Type
Journal Articles | 14 |
Reports - Evaluative | 8 |
Reports - Research | 4 |
Reports - Descriptive | 2 |
Guides - Non-Classroom | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Connecticut | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Texas Assessment of Academic… | 2 |
Metropolitan Achievement Tests | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating

Feldt, Leonard S. – Applied Measurement in Education, 1997
It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)
Descriptors: Correlation, Criteria, Reliability, Test Construction

Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards

Schafer, William D. – Applied Measurement in Education, 2000
Draws seven conclusions for professionals who administer state assessment programs from the "GI Forum V. Texas Education Agency" ruling. These conclusions are grouped into observations about test development and observations about test use. Discusses some implications for test use in other states. (SLD)
Descriptors: Court Litigation, High School Students, High Schools, State Programs

Mills, Craig N.; Stocking, Martha L. – Applied Measurement in Education, 1996
Issues that must be addressed in the large-scale application of computerized adaptive testing are explored, including considerations of test design, scoring, test administration, item and item bank development, and other aspects of test construction. Possible solutions and areas in which additional work is needed are identified. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Higher Education

Quellmalz, Edys S. – Applied Measurement in Education, 1991
It is proposed that criteria for evaluating the quality of performance should be defined, at least tentatively, during the initial design of a performance assessment. Six characteristics of sound criteria are (1) significance; (2) fidelity; (3) generalizability; (4) developmental appropriateness; (5) accessibility; and (6) utility. (SLD)
Descriptors: Child Development, Cognitive Tests, Educational Assessment, Evaluation Criteria

Behuniak, Peter; Tucker, Charlene – Applied Measurement in Education, 1992
Psychometrically linking a state criterion-referenced test (CRT) and a norm-referenced test (NRT) to yield NRT information through the CRT was studied with samples of 1,500 to 3,000 elementary school students per subject and grade level in Connecticut. A CRT/NRT link can create a focused and coherent assessment system. (SLD)
Descriptors: Content Analysis, Criterion Referenced Tests, Educational Assessment, Elementary Education

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991
Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)
Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991
Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)
Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques

Hall, Bruce W.; And Others – Applied Measurement in Education, 1988
Responses of 310 teachers in Florida to a survey about use of teacher-made tests, nationally standardized tests, and state minimum competency tests were studied. Results show that all three test types were used to some extent in eight decision categories, but none of the tests were clearly dominant. (SLD)
Descriptors: Classroom Techniques, Decision Making, Elementary Secondary Education, Minimum Competency Testing

Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992
Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)
Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9

Aschbacher, Pamela R. – Applied Measurement in Education, 1991
The University of California's (Los Angeles) Center for Research on Evaluation, Standards, and Student Testing survey of state assessment directors reveals that about 25 states currently study or develop performance assessments. Obstacles to statewide use of performance assessments were expressed. The new Student Assessment Exchange should…
Descriptors: Accountability, Cost Effectiveness, Educational Assessment, Educational Improvement

Baron, Joan Boykoff – Applied Measurement in Education, 1991
A series of 19 questions illuminates the characteristics of effective performance assessments in 3 sections: (1) the nature of assessment; (2) properties of effective tasks; and (3) making tasks meaningful and engaging. A fourth section offers practical suggestions for the construction of performance assessments and for teacher involvement. (SLD)
Descriptors: Decision Making, Educational Assessment, Elementary Secondary Education, Evaluation Methods

Frisbie, David A.; And Others – Applied Measurement in Education, 1993
The nature and quality of chapter-end tests accompanying social studies and science textbooks used in elementary school and middle school grades were studied through reviews by 3 judges of 91 tests. Identified shortcomings lead to the recommendation that these tests not be used intact in classroom assessment. (SLD)
Descriptors: Classroom Techniques, Content Analysis, Educational Assessment, Educational Objectives

Mehrens, William A.; Popham, W. James – Applied Measurement in Education, 1992
This paper discusses how to determine whether a test was developed in a legally defensible manner, reviewing general issues, specific cases bearing on different types of test use, some evaluative dimensions, and evidence of test quality. Tests constructed and used according to existing standards will generally stand legal scrutiny. (SLD)
Descriptors: College Entrance Examinations, Compliance (Legal), Constitutional Law, Court Litigation