Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 7 |
Descriptor
Source
Applied Measurement in… | 12 |
Author
Publication Type
Journal Articles | 12 |
Reports - Research | 8 |
Reports - Evaluative | 4 |
Education Level
Secondary Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 11 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Location
New Hampshire | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Perez, Alexandra Lane; Evans, Carla – Applied Measurement in Education, 2023
New Hampshire's Performance Assessment of Competency Education (PACE) innovative assessment system uses student scores from classroom performance assessments as well as other classroom tests for school accountability purposes. One concern is that not having annual state testing may incentivize schools and teachers away from teaching the breadth of…
Descriptors: Grade 8, Competency Based Education, Evaluation Methods, Educational Innovation
Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011
The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…
Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis
Swerdzewski, Peter J.; Harmes, J. Christine; Finney, Sara J. – Applied Measurement in Education, 2011
Many universities rely on data gathered from tests that are low stakes for examinees but high stakes for the various programs being assessed. Given the lack of consequences associated with many collegiate assessments, the construct-irrelevant variance introduced by unmotivated students is potentially a serious threat to the validity of the…
Descriptors: Computer Assisted Testing, Student Motivation, Inferences, Universities
Puhan, Gautam – Applied Measurement in Education, 2009
The purpose of this study is to determine the extent of scale drift on a test that employs cut scores. It was essential to examine scale drift for this testing program because new forms in this testing program are often put on scale through a series of intermediate equatings (known as equating chains). This process may cause equating error to…
Descriptors: Testing Programs, Testing, Measurement Techniques, Item Response Theory
Shavelson, Richard J.; Young, Donald B.; Ayala, Carlos C.; Brandon, Paul R.; Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Tomita, Miki K.; Yin, Yue – Applied Measurement in Education, 2008
Assessment of and for learning has occupied center stage in education reform, especially with the advent of the No Child Left Behind Federal legislation. This study examined the formative function of assessment--assessment for learning--recognizing that such assessment needs to be aligned, at least in part, with the summative function of…
Descriptors: Federal Legislation, Formative Evaluation, Program Effectiveness, Educational Change

Hardy, Roy A. – Applied Measurement in Education, 1995
Cost factors associated with the development, administration, and scoring of performance assessment tasks are examined in the context of a statewide or other large-scale assessment program. Resources of money, time, and expertise are discussed. (SLD)
Descriptors: Cost Estimates, Costs, Educational Assessment, Estimation (Mathematics)

Aschbacher, Pamela R. – Applied Measurement in Education, 1991
The University of California's (Los Angeles) Center for Research on Evaluation, Standards, and Student Testing survey of state assessment directors reveals that about 25 states currently study or develop performance assessments. Obstacles to statewide use of performance assessments were expressed. The new Student Assessment Exchange should…
Descriptors: Accountability, Cost Effectiveness, Educational Assessment, Educational Improvement
Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004
A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…
Descriptors: Test Results, Student Evaluation, Scores, Testing Programs
Zhang, Zhicheng; Burry-Stock, Judith A. – Applied Measurement in Education, 2003
This study investigates teachers' assessment practices across teaching levels and content areas, as well as teachers' self-perceived assessment skills as a function of teaching experience and measurement training. Data from 297 teachers on the Assessment Practices Inventory were analyzed in a MANOVA design. As grade level increases, teachers rely…
Descriptors: Teaching Experience, Standardized Tests, Objective Tests, Instructional Improvement

Martinez, Michael E. – Applied Measurement in Education, 1993
Figural response (FR) items in architecture were compared with multiple-choice (MC) counterparts for their ability to predict architectural problem-solving proficiency of 33 practicing architects, 34 architecture interns, and 53 architecture students. Although both FR and MC predicted verbal design problem solving, only FR scores predicted…
Descriptors: Architects, Architectural Drafting, College Students, Comparative Analysis