ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Source

Applied Measurement in…

Publication Type

Journal Articles	18
Reports - Evaluative	18
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Vermont

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Validity and Racial Justice in Educational Assessment

Peer reviewed

Direct link

Lederman, Josh – Applied Measurement in Education, 2023

Given its centrality to assessment, until the concept of validity includes concern for racial justice, such matters will be seen as residing outside the "real" work of validation, rendering them powerless to count against the apparent scientific merit of the test. As the definition of validity has evolved, however, it holds great…

Descriptors: Educational Assessment, Validity, Social Justice, Race

Can Validity Rise When Reliability Declines?

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 1997

It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)

Descriptors: Correlation, Criteria, Reliability, Test Construction

The Use of a Person-Fit Statistic with One High-Quality Achievement Test.

Peer reviewed

Rudner, Lawrence M.; And Others – Applied Measurement in Education, 1996

An analysis of data from the 1990 National Assessment of Educational Progress Trial State Assessment suggests that person-fit statistics may not provide additional information about results of psychometrically strong achievement tests. More research is needed before person-fit statistics can be used routinely in analysis of item response data.…

Descriptors: Achievement Tests, Individual Differences, Item Response Theory, Psychometrics

Assessing Person-Fit on Measures of Typical Performance.

Peer reviewed

Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996

Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)

Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses

The Precision of Measurements.

Peer reviewed

Kane, Michael – Applied Measurement in Education, 1996

This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)

Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques

Use of Generalized Person-Fit Indexes, Zetas for Statistical Pattern Classification.

Peer reviewed

Tatsuoka, Kikumi – Applied Measurement in Education, 1996

Application of person-fit statistics to cognitive diagnosis requires special efforts to detect normal and usual response patterns resulting from sources of misconception that are frequently observed among students. This study shows a solution for the problem by introducing an extension of a person-fit statistic developed by K. Tatsuoka (1985).…

Descriptors: Classification, Cognitive Tests, Diagnostic Tests, Item Response Theory

Using Multidimensional Item Response Theory to Understand What Items and Tests Are Measuring.

Peer reviewed

Ackerman, Terry A. – Applied Measurement in Education, 1994

When item response data do not satisfy the unidimensionality assumption, multidimensional item response theory (MIRT) should be used to model the item-examinee interaction. This article presents and discusses MIRT analyses designed to give better insight into what individual items are measuring. (SLD)

Descriptors: Evaluation Methods, Item Response Theory, Measurement Techniques, Models

Linking Results of Distinct Assessments.

Peer reviewed

Linn, Robert L. – Applied Measurement in Education, 1993

The following ways of linking results from distinct assessments to use them for multiple purposes are reviewed: (1) equating; (2) calibration; (3) statistical moderation; (4) prediction; and (5) social moderation. The characteristics of these methods, their requirements, and the comparative inferences they support are described. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Elementary Secondary Education, Equated Scores

Analysis of Profiles of Students Applying for Entrance to Universities.

Peer reviewed

Tognolini, Jim; Andrich, David – Applied Measurement in Education, 1996

Applying the principles of latent trait analysis makes it possible to rank order profiles of students seeking college admission in terms of the adequacy of a single score. An illustration using 577 profiles shows that it is possible that only a subset of profiles may require qualitative analysis. (SLD)

Descriptors: Admission (School), College Bound Students, Higher Education, Item Response Theory

Practical Issues in Large-Scale Computerized Adaptive Testing.

Peer reviewed

Mills, Craig N.; Stocking, Martha L. – Applied Measurement in Education, 1996

Issues that must be addressed in the large-scale application of computerized adaptive testing are explored, including considerations of test design, scoring, test administration, item and item bank development, and other aspects of test construction. Possible solutions and areas in which additional work is needed are identified. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Higher Education

The Reliability of Mathematics Portfolio Scores: Lessons from the Vermont Experience.

Peer reviewed

Klein, Stephen P.; And Others – Applied Measurement in Education, 1995

Portfolios are the centerpiece of Vermont's statewide assessment program in mathematics. Portfolio scores in the first two years were not reliable enough to permit the reporting of student-level results, but increasing the number of readers or the number of portfolio pieces is not operationally feasible. (SLD)

Descriptors: Educational Assessment, Elementary Secondary Education, Mathematics Tests, Performance Based Assessment

Developing Criteria for Performance Assessments: The Missing Link.

Peer reviewed

Quellmalz, Edys S. – Applied Measurement in Education, 1991

It is proposed that criteria for evaluating the quality of performance should be defined, at least tentatively, during the initial design of a performance assessment. Six characteristics of sound criteria are (1) significance; (2) fidelity; (3) generalizability; (4) developmental appropriateness; (5) accessibility; and (6) utility. (SLD)

Descriptors: Child Development, Cognitive Tests, Educational Assessment, Evaluation Criteria

Admissions Testing: Recommended Uses, Validity, Differential Prediction, and Coaching.

Peer reviewed

Linn, Robert L. – Applied Measurement in Education, 1990

The nature of admissions tests and their intended uses are reviewed. Evidence regarding the validity of tests, their contributions to admissions decisions, the effects of coaching, and possible bias in the predictive meaning of test scores for women and minorities are discussed. (Author/SLD)

Descriptors: Admission (School), Admission Criteria, College Entrance Examinations, Decision Making

Customized Tests and Customized Norms.

Peer reviewed

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991

Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

Quality Control in the Development and Use of Performance Assessments.

Peer reviewed

Dunbar, Stephen B.; And Others – Applied Measurement in Education, 1991

Issues pertaining to the quality of performance assessments, including reliability and validity, are discussed. The relatively limited generalizability of performance across tasks is indicative of the care needed to evaluate performance assessments. Quality control is an empirical matter when measurement is intended to inform public policy. (SLD)

Descriptors: Educational Assessment, Generalization, Interrater Reliability, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2

Test Use	18
Test Construction	8
Educational Assessment	7
Scores	6
Item Response Theory	5
Test Items	5
Elementary Secondary Education	4
Evaluation Methods	4
Higher Education	4
Measurement Techniques	4
Performance Based Assessment	4
Test Interpretation	4
Test Validity	4
Validity	4
Decision Making	3
Responses	3
Student Evaluation	3
Test Reliability	3
Test Results	3
Admission (School)	2
Cognitive Tests	2
College Entrance Examinations	2
Equated Scores	2
Generalizability Theory	2
Minority Groups	2
More ▼

Linn, Robert L.	3
Ackerman, Terry A.	1
Andrich, David	1
Baron, Joan Boykoff	1
Dunbar, Stephen B.	1
Feldt, Leonard S.	1
Flannery, Wm. Peter	1
Frisbie, David A.	1
Hambleton, Ronald K.	1
Kane, Michael	1
Klein, Stephen P.	1
Lederman, Josh	1
Mehrens, William A.	1
Mills, Craig N.	1
Popham, W. James	1
Quellmalz, Edys S.	1
Reise, Steven P.	1
Rudner, Lawrence M.	1
Stocking, Martha L.	1
Tatsuoka, Kikumi	1
Tognolini, Jim	1
More ▼