Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 2 |
Descriptor
Source
Applied Measurement in… | 33 |
Author
Publication Type
Journal Articles | 33 |
Reports - Evaluative | 18 |
Reports - Research | 9 |
Reports - Descriptive | 6 |
Guides - Non-Classroom | 1 |
Legal/Legislative/Regulatory… | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Connecticut | 1 |
Louisiana | 1 |
Vermont | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Texas Assessment of Academic… | 4 |
Metropolitan Achievement Tests | 1 |
National Assessment of… | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Lederman, Josh – Applied Measurement in Education, 2023
Given its centrality to assessment, until the concept of validity includes concern for racial justice, such matters will be seen as residing outside the "real" work of validation, rendering them powerless to count against the apparent scientific merit of the test. As the definition of validity has evolved, however, it holds great…
Descriptors: Educational Assessment, Validity, Social Justice, Race
Ketterlin-Geller, Leanne R.; Perry, Lindsey; Adams, Elizabeth – Applied Measurement in Education, 2019
Despite the call for an argument-based approach to validity over 25 years ago, few examples exist in the published literature. One possible explanation for this outcome is that the complexity of the argument-based approach makes implementation difficult. To counter this claim, we propose that the Assessment Triangle can serve as the overarching…
Descriptors: Validity, Educational Assessment, Models, Screening Tests

Feldt, Leonard S. – Applied Measurement in Education, 1997
It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)
Descriptors: Correlation, Criteria, Reliability, Test Construction

Rudner, Lawrence M.; And Others – Applied Measurement in Education, 1996
An analysis of data from the 1990 National Assessment of Educational Progress Trial State Assessment suggests that person-fit statistics may not provide additional information about results of psychometrically strong achievement tests. More research is needed before person-fit statistics can be used routinely in analysis of item response data.…
Descriptors: Achievement Tests, Individual Differences, Item Response Theory, Psychometrics

Mehrens, William A. – Applied Measurement in Education, 2000
Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…
Descriptors: Curriculum, Psychometrics, Reliability, Standards

Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996
Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)
Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses
Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004
A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…
Descriptors: Test Results, Student Evaluation, Scores, Testing Programs

Kane, Michael – Applied Measurement in Education, 1996
This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)
Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques

Tatsuoka, Kikumi – Applied Measurement in Education, 1996
Application of person-fit statistics to cognitive diagnosis requires special efforts to detect normal and usual response patterns resulting from sources of misconception that are frequently observed among students. This study shows a solution for the problem by introducing an extension of a person-fit statistic developed by K. Tatsuoka (1985).…
Descriptors: Classification, Cognitive Tests, Diagnostic Tests, Item Response Theory

Phillips, S. E. – Applied Measurement in Education, 2000
Discusses the plaintiffs' arguments, the state's responses, and the specific findings in the context of major themes of "GI Forum v. Texas Education Agency," which found that the Texas graduation test was valid and reliable and met applicable professional standards, including motive and opportunity to learn. (SLD)
Descriptors: Court Litigation, Graduation Requirements, High School Students, High Schools

Schafer, William D. – Applied Measurement in Education, 2000
Draws seven conclusions for professionals who administer state assessment programs from the "GI Forum V. Texas Education Agency" ruling. These conclusions are grouped into observations about test development and observations about test use. Discusses some implications for test use in other states. (SLD)
Descriptors: Court Litigation, High School Students, High Schools, State Programs

Ward, Cynthia A. – Applied Measurement in Education, 2000
Discusses the implications of the "GI Forum v. Texas Education Agency" (2000) decision supporting the use of the Texas Assessment of Academic Skills for other state assessment programs. Notes that the legal success of the state test in Texas was not due merely to good fortune, but to the test's adherence to legally defensible principles.…
Descriptors: Achievement Tests, Court Litigation, Elementary Secondary Education, Legal Problems

Ackerman, Terry A. – Applied Measurement in Education, 1994
When item response data do not satisfy the unidimensionality assumption, multidimensional item response theory (MIRT) should be used to model the item-examinee interaction. This article presents and discusses MIRT analyses designed to give better insight into what individual items are measuring. (SLD)
Descriptors: Evaluation Methods, Item Response Theory, Measurement Techniques, Models

Crone, Linda J.; And Others – Applied Measurement in Education, 1994
Scores from 324 Louisiana schools on the Louisiana Graduation Exit Examination and a within-school split sample of 255 schools indicate that a single subject or grade provides a less consistent and more narrow perspective on school effectiveness than a subcomposite made up of 2 subject areas. (SLD)
Descriptors: Classification, Effective Schools Research, Elementary Secondary Education, Exit Examinations

Brookhart, Susan M.; Durkin, Daniel T. – Applied Measurement in Education, 2003
Studied classroom assessment events in high school social studies classes using data from 12 assessment events in classes of a teacher researcher. Results for the 96 students involved support the conclusion that student perceptions of the task and self-efficacy, reported mental effort invested, goal orientation, and learning strategy use differed…
Descriptors: Academic Achievement, High School Students, High Schools, Learning Strategies