ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	33

Descriptor

Educational Testing	51
Evaluation Problems	51
Evaluation Methods	30
Student Evaluation	27
Educational Assessment	26
Educational Policy	16
Testing Problems	16
Measurement	15
Academic Achievement	13
Elementary Secondary Education	13
Measurement Techniques	12
Program Effectiveness	12
Psychometrics	12
Test Validity	11
Correlation	8
Evaluation Criteria	8
Evaluation Research	8
Evidence	8
Models	8
Accountability	7
Criterion Referenced Tests	7
Diagnostic Tests	7
Foreign Countries	7
Longitudinal Studies	7
Teacher Evaluation	7
More ▼

Publication Type

Journal Articles	27
Opinion Papers	16
Reports - Evaluative	16
Reports - Research	12
Reports - Descriptive	11
Speeches/Meeting Papers	8
ERIC Digests in Full Text	2
ERIC Publications	2
Numerical/Quantitative Data	2

Education Level

Elementary Secondary Education	21
Secondary Education	6
Elementary Education	4
Higher Education	4
Postsecondary Education	3
Early Childhood Education	2
Grade 3	2
Grade 4	2
Grade 5	2
Preschool Education	2
Adult Education	1
High Schools	1
Junior High Schools	1
Two Year Colleges	1
More ▼

Audience

Policymakers	2
Practitioners	2
Teachers	2
Administrators	1
Researchers	1

Location

Florida	5
California	2
Illinois	2
New Jersey	2
New York	2
North Carolina	2
Tennessee	2
Texas	2
Arizona	1
Australia	1
Canada	1
Colorado	1
Delaware	1
Denmark	1
Germany	1
Ghana	1
Idaho	1
Indiana	1
Japan	1
Kansas	1
Maine	1
Maryland	1
Massachusetts	1
Michigan	1
Minnesota	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Individuals with Disabilities…	1
Race to the Top	1

Assessments and Surveys

Stanford Achievement Tests	3
Florida Comprehensive…	2
National Assessment of…	2
ACT Assessment	1
Advanced Placement…	1
Graduate Record Examinations	1
Iowa Tests of Educational…	1
National Teacher Examinations	1
North Carolina End of Course…	1
Pediatric Evaluation of…	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 51 results Save | Export

Continuing a Culture of Evidence: Assessment for Improvement. Research Report. ETS RR-17-08

Peer reviewed
PDF on ERIC

Download full text

Russell, Javarro; Markle, Ross – ETS Research Report Series, 2017

From 2006 to 2008, Educational Testing Service (ETS) produced a series of reports titled "A Culture of Evidence," designed to capture a changing climate in higher education assessment. A decade later, colleges and universities already face new and different challenges resulting from societal, technological, and scientific influences.…

Descriptors: Evidence Based Practice, Evidence, Educational Testing, Educational Improvement

Social Epistemology and the Pragmatics of Assessment

Peer reviewed

Direct link

Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014

In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…

Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization

The Leading Group Effect: Illusionary Declines in Scholastic Standard Scores of Mid-Range Japanese Junior High School Pupils

Peer reviewed

Direct link

Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012

Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…

Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems

Test Development with Performance Standards and Achievement Growth in Mind

Peer reviewed

Direct link

Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011

Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…

Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Assessing Developmental Assessment in Community Colleges: A Review of the Literature. CCRC Working Paper No. 19

Download full text

Hughes, Katherine L.; Scott-Clayton, Judith – Community College Research Center, Columbia University, 2010

Placement exams are high-stakes assessments that determine many students' college trajectories. More than half of entering students at community colleges are placed into developmental education in at least one subject, based primarily on scores from these assessments, yet recent research fails to find evidence that placement into remediation…

Descriptors: Community Colleges, Remedial Instruction, Literature Reviews, High Stakes Tests

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Diagnostic Models as Partially Ordered Sets

Peer reviewed

Direct link

Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…

Descriptors: Test Items, Classification, Psychometrics, Item Response Theory

Equivalent Diagnostic Classification Models

Peer reviewed

Direct link

Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009

Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…

Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory

Teacher Evaluation in Tennessee: A Report on Year 1 Implementation

Download full text

Tennessee Department of Education, 2012

In the summer of 2011, the Tennessee Department of Education contracted with the National Institute for Excellence in Teaching (NIET) to provide a four-day training for all evaluators across the state. NIET trained more than 5,000 evaluators intensively in the state model (districts using alternative instruments delivered their own training).…

Descriptors: Video Technology, Feedback (Response), Evaluators, Interrater Reliability

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

New Estimates of Design Parameters for Clustered Randomization Studies: Findings from North Carolina and Florida. Working Paper 43

Download full text

Xu, Zeyu; Nichols, Austin – National Center for Analysis of Longitudinal Data in Education Research, 2010

The gold standard in making causal inference on program effects is a randomized trial. Most randomization designs in education randomize classrooms or schools rather than individual students. Such "clustered randomization" designs have one principal drawback: They tend to have limited statistical power or precision. This study aims to…

Descriptors: Test Format, Reading Tests, Norm Referenced Tests, Research Design

Using Value-Added Measures of Teacher Quality. Brief 9

Download full text

Hanushek, Eric A.; Rivkin, Steven G. – National Center for Analysis of Longitudinal Data in Education Research, 2010

Extensive education research on the contribution of teachers to student achievement produces two generally accepted results. First, teacher quality varies substantially as measured by the value added to student achievement or future academic attainment or earnings. Second, variables often used to determine entry into the profession and…

Descriptors: Credentials, Teacher Effectiveness, Models, Teacher Qualifications

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Measurement:…	6
National Center for Analysis…	4
Journal of Educational…	3
NHSA Dialog	2
Alberta Journal of…	1
American Educational Research…	1
British Educational Research…	1
Community College Research…	1
ETS Research Report Series	1
Education and the Public…	1
Educational Measurement:…	1
European Physical Education…	1
Journal of Curriculum Studies	1
National Center for…	1
National Center on…	1
National Research Center on…	1
Nelson A. Rockefeller…	1
Online Submission	1
Policy Futures in Education	1
Research in Education	1
Review of Research in…	1
Scholar-Practitioner Quarterly	1
Studies in Educational…	1
TESL Canada Journal	1
Teachers College Record	1
More ▼

Bagnato, Stephen J.	2
Bielinski, John	2
Macy, Marisa	2
Minnema, Jane	2
Thurlow, Martha	2
Adkins, Deborah	1
Armour-Garb, Allison, Ed.	1
Ascher, Carol	1
Baker, Eva L.	1
Baldwin, Su G.	1
Bechger, Timo	1
Beresford, Lauren	1
Boyd, Donald	1
Bridget Terry Long	1
Carstensen, Claus H.	1
Chavez, Oscar	1
Cheng, Liying	1
Clauser, Brian E.	1
Collins, Dave	1
Cooley, William W.	1
Cronin, John	1
Cui, Ying	1
Dahlin, Michael	1
Davidson, Anne H.	1
Dillon, Gerard F.	1
More ▼