ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	18

Descriptor

Educational Testing	30
Evaluation Methods	30
Evaluation Problems	30
Educational Assessment	19
Student Evaluation	17
Measurement	11
Psychometrics	11
Testing Problems	10
Elementary Secondary Education	9
Measurement Techniques	9
Test Validity	8
Diagnostic Tests	7
Educational Policy	7
Evaluation Criteria	7
Criterion Referenced Tests	6
Evaluation Research	6
Academic Standards	5
Accountability	5
Evidence	5
Item Response Theory	5
Models	5
Program Effectiveness	5
Teacher Evaluation	5
Academic Achievement	4
Classification	4
More ▼

Source

Measurement:…	6
Journal of Educational…	3
NHSA Dialog	2
American Educational Research…	1
Community College Research…	1
Journal of Curriculum Studies	1
National Center for Analysis…	1
National Center on…	1
Nelson A. Rockefeller…	1
Policy Futures in Education	1
Scholar-Practitioner Quarterly	1
Tennessee Department of…	1
Theory and Research in…	1
More ▼

Publication Type

Journal Articles	16
Opinion Papers	14
Reports - Descriptive	8
Reports - Evaluative	6
Reports - Research	5
Speeches/Meeting Papers	4
ERIC Digests in Full Text	2
ERIC Publications	2

Education Level

Elementary Secondary Education	9
Early Childhood Education	2
Elementary Education	2
Preschool Education	2
Secondary Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Higher Education	1
Postsecondary Education	1
Two Year Colleges	1
More ▼

Audience

Policymakers	2
Practitioners	1
Researchers	1
Teachers	1

Location

Florida	2
Germany	1
Pennsylvania	1
Tennessee	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Individuals with Disabilities…	1
Race to the Top	1

Assessments and Surveys

Stanford Achievement Tests	3
Advanced Placement…	1
Florida Comprehensive…	1
National Assessment of…	1
Pediatric Evaluation of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Assessing Developmental Assessment in Community Colleges: A Review of the Literature. CCRC Working Paper No. 19

Download full text

Hughes, Katherine L.; Scott-Clayton, Judith – Community College Research Center, Columbia University, 2010

Placement exams are high-stakes assessments that determine many students' college trajectories. More than half of entering students at community colleges are placed into developmental education in at least one subject, based primarily on scores from these assessments, yet recent research fails to find evidence that placement into remediation…

Descriptors: Community Colleges, Remedial Instruction, Literature Reviews, High Stakes Tests

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Diagnostic Models as Partially Ordered Sets

Peer reviewed

Direct link

Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…

Descriptors: Test Items, Classification, Psychometrics, Item Response Theory

Equivalent Diagnostic Classification Models

Peer reviewed

Direct link

Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009

Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…

Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory

Teacher Evaluation in Tennessee: A Report on Year 1 Implementation

Download full text

Tennessee Department of Education, 2012

In the summer of 2011, the Tennessee Department of Education contracted with the National Institute for Excellence in Teaching (NIET) to provide a four-day training for all evaluators across the state. NIET trained more than 5,000 evaluators intensively in the state model (districts using alternative instruments delivered their own training).…

Descriptors: Video Technology, Feedback (Response), Evaluators, Interrater Reliability

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

Diagnostic Classification Models and Multidimensional Adaptive Testing: A Commentary on Rupp and Templin

Peer reviewed

Direct link

Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009

On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…

Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics

Authentic Assessment in Action: A "R-E-A-L" Solution

Peer reviewed

Direct link

Bagnato, Stephen J.; Macy, Marisa – NHSA Dialog, 2010

Authentic assessment is a growing alternative to conventional testing. This research-to-practice article describes a framework for implementing authentic assessment. The R-E-A-L framework shows how roles, equipment, assessment tools, and location can be incorporated into early childhood practices.

Descriptors: Early Childhood Education, Performance Based Assessment, Program Implementation, Guidelines

Keeping It "R-E-A-L" with Authentic Assessment

Peer reviewed

Direct link

Macy, Marisa; Bagnato, Stephen J. – NHSA Dialog, 2010

The inclusion of young children with disabilities has remained a function of the Head Start program since its inception in the 1960s when the United States Congress mandated that children with disabilities comprise 10% of the Head Start enrollment (Zigler & Styfco, 2000). Standardized, norm-referenced tests used to identify children with…

Descriptors: Performance Based Assessment, Disadvantaged Youth, Norm Referenced Tests, Disabilities

Holding Accountability to Account. Research Brief

Direct link

National Center on Performance Incentives, 2008

In "Holding Accountability to Account: How Scholarship and Experience in Other Fields Inform Exploration of Performance Incentives in Education"--a paper presented at the National Center on Performance Incentives research to policy conference in February--Richard Rothstein, a research associate at the Economic Policy Institute, argues educational…

Descriptors: Private Sector, Incentives, Rewards, Accountability

What Makes for a Good Teacher and Who Can Tell? Working Paper 30

Download full text

Harris, Douglas N.; Sass, Tim R. – National Center for Analysis of Longitudinal Data in Education Research, 2009

Mounting pressure in the policy arena to improve teacher productivity either by improving signals that predict teacher performance or through creating incentive contracts based on performance--has spurred two related questions: Are there important determinants of teacher productivity that are not captured by teacher credentials but that can be…

Descriptors: Credentials, Teacher Effectiveness, Teaching Skills, Principals

The Difficulty of the Educational Task.

Download full text

Cooley, William W. – 1993

Comparison of student test scores between states, school districts, and even schools continues to be a popular measure of student achievement. However, these comparisons reveal little about the quality or effectiveness of educational programs, only the varying difficulty of educating different populations of students. This report uses U.S. Census…

Descriptors: Academic Achievement, Census Figures, Educational Diagnosis, Educational Testing

Previous Page | Next Page »

Pages: 1 | 2

Bagnato, Stephen J.	2
Bielinski, John	2
Macy, Marisa	2
Minnema, Jane	2
Thurlow, Martha	2
Armour-Garb, Allison, Ed.	1
Ascher, Carol	1
Baker, Eva L.	1
Baldwin, Su G.	1
Bechger, Timo	1
Carstensen, Claus H.	1
Clauser, Brian E.	1
Cooley, William W.	1
Cui, Ying	1
Dillon, Gerard F.	1
Frey, Andreas	1
Garrison, Mark J.	1
Haberman, Shelby J.	1
Harris, Douglas N.	1
Hill, Heather C.	1
Hughes, Katherine L.	1
Leighton, Jacqueline P.	1
Margolis, Melissa J.	1
Maris, Gunter	1
Mee, Janet	1
More ▼