ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	14

Source

Measurement:…

Author

Alonzo, Alicia C.	1
Bejar, Isaac I.	1
Brandt, Steffen	1
Briggs, Derek C.	1
Cui, Ying	1
Gierl, Mark J.	1
Graf, E. Aurora	1
Hill, Heather C.	1
Huff, Kristen	1
Koretz, Daniel	1
Mislevy, Robert J.	1
Plake, Barbara S.	1
Schilling, Stephen	1
Shepard, Lorrie A.	1
Thissen, David	1
Walker, Michael E.	1
von Davier, Alina A.	1
More ▼

Publication Type

Journal Articles	14
Opinion Papers	11
Reports - Evaluative	2
Reports - Descriptive	1
Reports - Research	1

Education Level

Elementary Secondary Education

Audience

Location

United States	2
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Failing Tests: Commentary on "Adapting Educational Measurement to the Demands of Test-Based Accountability"

Peer reviewed

Direct link

Thissen, David – Measurement: Interdisciplinary Research and Perspectives, 2015

In "Adapting Educational Measurement to the Demands of Test-Based Accountability" Koretz takes the time-honored engineering approach to educational measurement, identifying specific problems with current practice and proposing minimal modifications of the system to alleviate those problems. In response to that article, David Thissen…

Descriptors: Educational Testing, Accountability, Testing Problems, Test Construction

Adapting Educational Measurement to the Demands of Test-Based Accountability

Peer reviewed

Direct link

Koretz, Daniel – Measurement: Interdisciplinary Research and Perspectives, 2015

Accountability has become a primary function of large-scale testing in the United States. The pressure on educators to raise scores is vastly greater than it was several decades ago. Research has shown that high-stakes testing can generate behavioral responses that inflate scores, often severely. I argue that because of these responses, using…

Descriptors: Accountability, Educational Testing, Test Construction, Test Validity

Why Lessons Learned from the Past Require Haertel's Expanded Scope for Test Validation

Peer reviewed

Direct link

Shepard, Lorrie A. – Measurement: Interdisciplinary Research and Perspectives, 2013

In his article, Haertel (this issue) asks a fundamental question about how use of a test is expected to cause improvements in the educational system and in learning. He also considers how test validity should be investigated and argues for a more expansive view of validity that does not stop with scoring or generalization (the more technical and…

Descriptors: Educational Testing, Test Validity, Test Results, Test Construction

Teacher Evaluation as Trojan Horse: The Case for Teacher-Developed Assessments

Peer reviewed

Direct link

Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2013

In his focus article "How Is Testing Supposed to Improve Schooling?" Ed Haertel distinguishes between seven uses of educational tests as a function of the intended action and what or who will be influenced by the intended action. He then applies Mike Kane's interpretive argument approach (Kane, 2006) as a basis for speculating about the validity…

Descriptors: Educational Testing, Accountability, Educational Improvement, Teacher Evaluation

Updating the Duplex Design for Test-Based Accountability in the Twenty-First Century

Peer reviewed

Direct link

Bejar, Isaac I.; Graf, E. Aurora – Measurement: Interdisciplinary Research and Perspectives, 2010

The duplex design by Bock and Mislevy for school-based testing is revisited and evaluated as a potential platform in test-based accountability assessments today. We conclude that the model could be useful in meeting the many competing demands of today's test-based accountability assessments, although many research questions will need to be…

Descriptors: Accountability, Educational Assessment, Educational Testing, Test Construction

Design under Constraints: The Case of Large-Scale Assessment Systems

Peer reviewed

Direct link

Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2010

In "Updating the Duplex Design for Test-Based Accountability in the Twenty-First Century," Bejar and Graf (2010) propose extensions to the duplex design for large-scale assessment presented in Bock and Mislevy (1988). Examining the range of people who use assessment results--from students, teachers, administrators, curriculum designers,…

Descriptors: Measurement, Test Construction, Educational Testing, Data Collection

A Commentary on "Updating the Duplex Design for Test-Based Accountability in the Twenty-First Century"

Peer reviewed

Direct link

Brandt, Steffen – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's commentary on "Updating the Duplex Design for Test-Based Accountability in the Twenty-First Century," in which Isaac I. Bejar and E. Aurora Graf propose the application of a test design--the duplex design (which was proposed in 1988 by Bock and Mislevy) for application in current accountability assessments.…

Descriptors: Accountability, Educational Testing, Test Construction, Computer Assisted Testing

Considerations in Using Learning Progressions to Inform Achievement Level Descriptions

Peer reviewed

Direct link

Alonzo, Alicia C. – Measurement: Interdisciplinary Research and Perspectives, 2010

In their article "Innovations in Setting Performance Standards for K-12 Test-Based Accountability," Kristen Huff and Barbara S. Plake (2010) lay out three preconditions for continued investment in standard-setting methodology and practice, all focused on the sound development and use of achievement level descriptors (ALDs). Among these…

Descriptors: Standard Setting (Scoring), Achievement, Elementary Secondary Education, Accountability

Innovations in Setting Performance Standards for K-12 Test-Based Accountability

Peer reviewed

Direct link

Huff, Kristen; Plake, Barbara S. – Measurement: Interdisciplinary Research and Perspectives, 2010

Standard setting is a systematic process that uses a combination of judgmental and empirical procedures to make recommendations about where on the score continuum "cut scores" should be placed. Cut scores divide the score scale into categories consistent with the descriptions of student performance associated with multiple levels of achievement.…

Descriptors: Accountability, Educational Testing, Elementary Secondary Education, Standard Setting (Scoring)

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

Defining Characteristics of Diagnostic Classification Models and the Problem of Retrofitting in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Cui, Ying – Measurement: Interdisciplinary Research and Perspectives, 2008

One promising application of diagnostic classification models (DCM) is in the area of cognitive diagnostic assessment in education. However, the successful application of DCM in educational testing will likely come with a price--and this price may be in the form of new test development procedures and practices required to yield data that satisfy…

Descriptors: Educational Testing, Classification, Psychometrics, Test Construction

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Validating the MKT Measures: Some Responses to the Commentaries

Peer reviewed

Direct link

Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007

The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…

Descriptors: Educational Testing, Tests, Measurement, Faculty Development

Generalizability and Specificity of Interpretive Arguments: Observations Inspired by the Commentaries

Peer reviewed

Direct link

Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007

In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…

Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement

Educational Testing	14
Test Construction	14
Accountability	7
Test Validity	6
Educational Assessment	5
Evaluation Methods	5
Psychometrics	5
Testing Problems	5
Measurement Techniques	4
Classification	3
Equated Scores	3
Measurement	3
Teacher Evaluation	3
Comparative Analysis	2
Content Validity	2
Definitions	2
Elementary Secondary Education	2
Evaluation Problems	2
Evaluation Research	2
Foreign Countries	2
High Stakes Tests	2
Knowledge Base for Teaching	2
Mathematics Education	2
Mathematics Instruction	2
Methods	2
More ▼