ERIC - Search Results

Descriptor

Test Construction	14
Test Use	14
Educational Assessment	6
Elementary Secondary Education	4
Evaluation Methods	4
Performance Based Assessment	4
State Programs	4
Decision Making	3
Norm Referenced Tests	3
Scores	3
Student Evaluation	3
Test Validity	3
Validity	3
Accountability	2
Adaptive Testing	2
Classroom Techniques	2
Content Analysis	2
Court Litigation	2
Criterion Referenced Tests	2
Curriculum	2
Elementary Education	2
High School Students	2
Outcomes of Education	2
Performance Tests	2
Psychometrics	2
More ▼

Source

Applied Measurement in…

Author

Mehrens, William A.	2
Aschbacher, Pamela R.	1
Baron, Joan Boykoff	1
Behuniak, Peter	1
Dunbar, Stephen B.	1
Feldt, Leonard S.	1
Forsyth, Robert A.	1
Frisbie, David A.	1
Hall, Bruce W.	1
Hambleton, Ronald K.	1
Linn, Robert L.	1
Mills, Craig N.	1
Popham, W. James	1
Quellmalz, Edys S.	1
Schafer, William D.	1
Stocking, Martha L.	1
Tucker, Charlene	1
More ▼

Publication Type

Journal Articles	14
Reports - Evaluative	8
Reports - Research	4
Reports - Descriptive	2
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Connecticut

Laws, Policies, & Programs

Assessments and Surveys

Texas Assessment of Academic…	2
Metropolitan Achievement Tests	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Can Validity Rise When Reliability Declines?

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 1997

It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)

Descriptors: Correlation, Criteria, Reliability, Test Construction

Defending a State Graduation Test: "GI Forum v. Texas Education Agency." Measurement Perspectives from an External Evaluator.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 2000

Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…

Descriptors: Curriculum, Psychometrics, Reliability, Standards

"GI Forum v. Texas Education Agency": Observations for States.

Peer reviewed

Schafer, William D. – Applied Measurement in Education, 2000

Draws seven conclusions for professionals who administer state assessment programs from the "GI Forum V. Texas Education Agency" ruling. These conclusions are grouped into observations about test development and observations about test use. Discusses some implications for test use in other states. (SLD)

Descriptors: Court Litigation, High School Students, High Schools, State Programs

Practical Issues in Large-Scale Computerized Adaptive Testing.

Peer reviewed

Mills, Craig N.; Stocking, Martha L. – Applied Measurement in Education, 1996

Issues that must be addressed in the large-scale application of computerized adaptive testing are explored, including considerations of test design, scoring, test administration, item and item bank development, and other aspects of test construction. Possible solutions and areas in which additional work is needed are identified. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Higher Education

Developing Criteria for Performance Assessments: The Missing Link.

Peer reviewed

Quellmalz, Edys S. – Applied Measurement in Education, 1991

It is proposed that criteria for evaluating the quality of performance should be defined, at least tentatively, during the initial design of a performance assessment. Six characteristics of sound criteria are (1) significance; (2) fidelity; (3) generalizability; (4) developmental appropriateness; (5) accessibility; and (6) utility. (SLD)

Descriptors: Child Development, Cognitive Tests, Educational Assessment, Evaluation Criteria

The Potential of Criterion-Referenced Tests with Projected Norms.

Peer reviewed

Behuniak, Peter; Tucker, Charlene – Applied Measurement in Education, 1992

Psychometrically linking a state criterion-referenced test (CRT) and a norm-referenced test (NRT) to yield NRT information through the CRT was studied with samples of 1,500 to 3,000 elementary school students per subject and grade level in Connecticut. A CRT/NRT link can create a focused and coherent assessment system. (SLD)

Descriptors: Content Analysis, Criterion Referenced Tests, Educational Assessment, Elementary Education

Customized Tests and Customized Norms.

Peer reviewed

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991

Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

Quality Control in the Development and Use of Performance Assessments.

Peer reviewed

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991

Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)

Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques

Test Use among Classroom Teachers and Its Relationship to Teaching Level and Teaching Practices.

Peer reviewed

Hall, Bruce W.; And Others – Applied Measurement in Education, 1988

Responses of 310 teachers in Florida to a survey about use of teacher-made tests, nationally standardized tests, and state minimum competency tests were studied. Results show that all three test types were used to some extent in eight decision categories, but none of the tests were clearly dominant. (SLD)

Descriptors: Classroom Techniques, Decision Making, Elementary Secondary Education, Minimum Competency Testing

Three Applications of Customized Testing in Local School Districts.

Peer reviewed

Forsyth, Robert A.; And Others – Applied Measurement in Education, 1992

Eighth grade teachers in three local school districts helped customize two standardized norm-referenced tests for ninth graders to investigate effects of deleting some items and adding locally constructed items. Results indicate that percentile ranks for the customized tests could be very different from those for the complete test. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Elementary Secondary Education, Grade 9

Performance Assessment: State Activity, Interest, and Concerns.

Peer reviewed

Aschbacher, Pamela R. – Applied Measurement in Education, 1991

The University of California's (Los Angeles) Center for Research on Evaluation, Standards, and Student Testing survey of state assessment directors reveals that about 25 states currently study or develop performance assessments. Obstacles to statewide use of performance assessments were expressed. The new Student Assessment Exchange should…

Descriptors: Accountability, Cost Effectiveness, Educational Assessment, Educational Improvement

Strategies for the Development of Effective Performance Exercises.

Peer reviewed

Baron, Joan Boykoff – Applied Measurement in Education, 1991

A series of 19 questions illuminates the characteristics of effective performance assessments in 3 sections: (1) the nature of assessment; (2) properties of effective tasks; and (3) making tasks meaningful and engaging. A fourth section offers practical suggestions for the construction of performance assessments and for teacher involvement. (SLD)

Descriptors: Decision Making, Educational Assessment, Elementary Secondary Education, Evaluation Methods

An Evaluation of Elementary Textbook Tests as Classroom Assessment Tools.

Peer reviewed

Frisbie, David A.; And Others – Applied Measurement in Education, 1993

The nature and quality of chapter-end tests accompanying social studies and science textbooks used in elementary school and middle school grades were studied through reviews by 3 judges of 91 tests. Identified shortcomings lead to the recommendation that these tests not be used intact in classroom assessment. (SLD)

Descriptors: Classroom Techniques, Content Analysis, Educational Assessment, Educational Objectives

How to Evaluate the Legal Defensibility of High-Stakes Tests.

Peer reviewed

Mehrens, William A.; Popham, W. James – Applied Measurement in Education, 1992

This paper discusses how to determine whether a test was developed in a legally defensible manner, reviewing general issues, specific cases bearing on different types of test use, some evaluative dimensions, and evidence of test quality. Tests constructed and used according to existing standards will generally stand legal scrutiny. (SLD)

Descriptors: College Entrance Examinations, Compliance (Legal), Constitutional Law, Court Litigation