ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	12

Source

National Center for Research…	7
Educational Assessment	5
Educational Leadership	1
Educational Technology	1
Educational and Psychological…	1
Journal for the Education of…	1
Journal of Educational…	1
Review of Research in…	1
Teachers College Record	1
Yearbook of the National…	1

Publication Type

Reports - Evaluative	14
Journal Articles	12
Reports - Research	8
Information Analyses	6
Reports - Descriptive	4
Speeches/Meeting Papers	4
Guides - Non-Classroom	3
Opinion Papers	3
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	4
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Middle Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Baker, Eva L. X

Showing 1 to 15 of 32 results Save | Export

The Chimera of Validity

Peer reviewed

Direct link

Baker, Eva L. – Teachers College Record, 2013

Background/Context: Education policy over the past 40 years has focused on the importance of accountability in school improvement. Although much of the scholarly discourse around testing and assessment is technical and statistical, understanding of validity by a non-specialist audience is essential as long as test results drive our educational…

Descriptors: Validity, Educational Assessment, Accountability, Educational Improvement

Assessment Gaze, Refraction, and Blur: The Course of Achievement Testing in the Past 100 Years

Peer reviewed

Direct link

Baker, Eva L.; Chung, Gregory K. W. K.; Cai, Li – Review of Research in Education, 2016

This chapter addresses assessment (testing) with an emphasis on the 100-year period since the American Education Research Association was formed. The authors start with definitions and explanations of contemporary tests. They then look backward into the 19th century to significant work by Horace Mann and Herbert Spencer, who engendered two…

Descriptors: Achievement Tests, Educational History, Testing, Educational Assessment

Validating Measures of Algebra Teacher Subject Matter Knowledge and Pedagogical Content Knowledge. CRESST Report 820

Download full text

Buschang, Rebecca E.; Chung, Gregory K. W. K.; Delacruz, Girlie C.; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2012

The purpose of this study was to validate inferences about scores of one task designed to measure subject matter knowledge and three tasks designed to measure aspects of pedagogical content knowledge. Evidence for the validity of inferences was based on two expectations. First, if tasks were sensitive to expertise, we would find group differences.…

Descriptors: Validity, Measures (Individuals), Test Interpretation, Algebra

Validating Measures of Algebra Teacher Subject Matter Knowledge and Pedagogical Content Knowledge

Peer reviewed

Direct link

Buschang, Rebecca E.; Chung, Gregory K. W. K.; Delacruz, Girlie C.; Baker, Eva L. – Educational Assessment, 2012

Descriptors: Algebra, Mathematics Teachers, Teacher Characteristics, Knowledge Base for Teaching

Validity Evidence for Games as Assessment Environments. CRESST Report 773

Download full text

Delacruz, Girlie C.; Chung, Gregory K. W. K.; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010

This study provides empirical evidence of a highly specific use of games in education--the assessment of the learner. Linear regressions were used to examine the predictive and convergent validity of a math game as assessment of mathematical understanding. Results indicate that prior knowledge significantly predicts game performance. Results also…

Descriptors: Educational Games, Validity, Prior Learning, Scores

What Probably Works in Alternative Assessment. CRESST Report 772

Download full text

Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010

This report provides an overview of what was known about alternative assessment at the time that the article was written in 1991. Topics include beliefs about assessment reform, overview of alternative assessment including research knowledge, evidence of assessment impact, and critical features of alternative assessment. The author notes that in…

Descriptors: Alternative Assessment, Evaluation Methods, Evaluation Research, Performance Based Assessment

Review of Rifle Marksmanship Training Research. CRESST Report 783

Download full text

Chung, Gregory K. W. K.; Nagashima, Sam O.; Delacruz, Girlie C.; Lee, John J.; Wainess, Richard; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2011

The UCLA National Center for Research on Evaluation, Standards, and Student Testing (CRESST) is under contract from the Naval Postgraduate School (NPS) to conduct research on assessment models and tools designed to support Marine Corps rifle marksmanship. In this deliverable, we first review the literature on known-distance rifle marksmanship…

Descriptors: Weapons, Psychomotor Skills, Computer Software, Military Personnel

Assessment of Rifle Marksmanship Skill Using Sensor-Based Measures. CRESST Report 755

Download full text

Nagashima, Sam O.; Chung, Gregory K. W. K.; Espinosa, Paul D.; Berka, Chris; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2009

The goal of this report was to test the use of sensor-based skill measures in evaluating performance differences in rifle marksmanship. Ten shots were collected from 30 novices and 9 experts. Three measures for breath control and one for trigger control were used to predict skill classification. The data were fitted with a logistic regression…

Descriptors: Weapons, Classification, Lasers, Models

Moving to the Next Generation System Design: Integrating Cognition, Assessment, and Learning. CSE Report 706

Download full text

Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

This paper will describe the relationships between research on learning and its application in assessment models and operational systems. These have been topics of research at the National Center for Research on Evaluation, Standards, and Student Testing (CRESST) for more than 20 years and form a significant part of the intellectual foundation of…

Descriptors: Educational Testing, Inferences, Hypothesis Testing, Predictive Validity

Model-Based Assessments to Support Learning and Accountability: The Evolution of CRESST's Research on Multiple-Purpose Measures

Peer reviewed

Direct link

Baker, Eva L. – Educational Assessment, 2007

This article describes the history, evidence warrants, and evolution of the Center for Research on Evaluation, Standards, and Student Testing's (CRESST) model-based assessments. It considers alternative interpretations of scientific or practical models and illustrates how model-based assessment addresses both definitions. The components of the…

Descriptors: Educational Testing, Computer Assisted Testing, Validity, Test Construction

Policy and Validity Prospects for Performance-Based Assessment.

Peer reviewed

Baker, Eva L.; And Others – Journal for the Education of the Gifted, 1994

This article describes performance-based assessment as expounded by its proponents, comments on these conceptions, reviews evidence regarding the technical quality of performance-based assessment, and considers its validity under various policy options. (JDD)

Descriptors: Educational Change, Educational Policy, Elementary Secondary Education, Evaluation Methods

Relationships among Measures as Empirical Evidence of Validity: Incorporating Multiple Indicators of Achievement and School Context

Peer reviewed

Direct link

Goldschmidt, Pete; Martinez, Jose Felipe; Niemi, David; Baker, Eva L. – Educational Assessment, 2007

In this article we examine empirical evidence on the criterion, predictive, transfer, and fairness aspects of validity of a large-scale language arts performance assessment, referred to as the Performance Assignment (PA). We use multilevel models to avoid biased inferences that might result from the naturally nested data. Specifically, we examine…

Descriptors: Language Arts, Performance Based Assessment, Academic Achievement, Performance Tests

Validity Issues for Accountability Systems. CSE Technical Report.

Download full text

Baker, Eva L.; Linn Robert L. – 2002

This report analyzes the validity issues that arise in the context of educational accountability systems. The report addresses validity from three interlocking perspectives. The first explores the theory of action underlying accountability provisions. It considers problems emerging from the distance between aspirations for accountability in…

Descriptors: Accountability, Educational Assessment, Educational Change, Educational Testing

Beyond Objectives: Domain-Referenced Tests for Evaluation and Instructional Improvement

Baker, Eva L. – Educational Technology, 1974

Descriptors: Behavioral Objectives, Evaluation Criteria, Evaluation Methods, Instructional Improvement

Qualitative Analysis of Test Item Attributes for Domain Referenced Content Validity Judgments. Studies in Measurement and Methodology, Work Unit 1: Design and Use of Tests.

Download full text

Polin, Linda; Baker, Eva L. – 1979

This paper presents the interim results of a set of studies undertaken to develop a much needed methodology for establishing content validity in domain-referenced achievement tests. The study results are presented in the context of the larger issue of the improvement of test design. School teachers, administrators and graduate students were…

Descriptors: Achievement Tests, Criterion Referenced Tests, Elementary Secondary Education, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3

Test Validity	20
Elementary Secondary Education	15
Evaluation Methods	11
Test Construction	11
Validity	11
Educational Assessment	10
Test Use	9
Performance Based Assessment	7
Achievement Tests	6
Educational Policy	6
Accountability	5
Criterion Referenced Tests	5
Models	5
Student Evaluation	5
Test Reliability	5
Educational Technology	4
Educational Testing	4
Evaluation Criteria	4
High School Students	4
Scoring	4
Test Bias	4
Testing	4
High Schools	3
Higher Education	3
Inferences	3
More ▼

Baker, Eva L.	32
Chung, Gregory K. W. K.	7
Delacruz, Girlie C.	4
Buschang, Rebecca E.	2
Nagashima, Sam O.	2
Abedi, Jamal	1
Berka, Chris	1
Bewley, William L.	1
Brill, David G.	1
Burstein, Leigh	1
Cai, Li	1
Espinosa, Paul D.	1
Goldschmidt, Pete	1
Harris, Elizabeth Lewis	1
Koretz, Daniel	1
Lee, John J.	1
Linn Robert L.	1
Linn, Robert	1
Linn, Robert L.	1
Martinez, Jose Felipe	1
Niemi, David	1
Novak, John	1
O'Neil, Harold F.	1
O'Neil, Harold F., Jr.	1
More ▼