ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	15

Descriptor

Educational Testing	30
Reliability	30
Validity	25
Evaluation Methods	11
Student Evaluation	10
Scores	9
Test Construction	8
Educational Assessment	7
Elementary Secondary Education	6
Measurement	6
Academic Achievement	5
Accountability	5
High Stakes Tests	5
Models	5
Standardized Tests	5
Test Use	5
Achievement Tests	4
Criterion Referenced Tests	4
Educational Policy	4
Foreign Countries	4
Psychometrics	4
Test Interpretation	4
Test Validity	4
Testing Problems	4
Educational History	3
More ▼

Source

Educational Research	3
Educational Measurement:…	2
Assessment Update	1
ETS Research Report Series	1
Educational Testing Service	1
Educational and Psychological…	1
Journal of Applied Research…	1
Journal of Deaf Studies and…	1
Journal of Faculty Development	1
Journal of Technology,…	1
Measurement:…	1
Multivariate Behavioral…	1
NASSP Bulletin	1
Online Submission	1
ProQuest LLC	1
Theory Into Practice	1
More ▼

Publication Type

Journal Articles	16
Reports - Descriptive	8
Reports - Evaluative	6
Reports - Research	6
Speeches/Meeting Papers	4
Books	3
Opinion Papers	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Legal/Legislative/Regulatory…	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	5
Elementary Education	2
Higher Education	2
Adult Education	1
Grade 1	1
Grade 4	1
Grade 8	1
High Schools	1
Kindergarten	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Practitioners	3
Teachers	2
Administrators	1

Location

United Kingdom	3
New York	2
United States	2
United Kingdom (Great Britain)	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Race to the Top	1

Assessments and Surveys

Myers Briggs Type Indicator	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Digital Module 09: Sociocognitive Assessment for Diverse Populations

Peer reviewed

Direct link

Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…

Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability

Do Adjusted Subscores Lack Validity? Don't Blame the Messenger

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011

There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…

Descriptors: Scores, Methods, Validity, Reliability

Value of Value-Added Models Based on Student Outcomes to Evaluate Teaching

Peer reviewed

Direct link

Berk, Ronald A. – Journal of Faculty Development, 2016

Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…

Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation

Large-Scale Academic Achievement Testing of Deaf and Hard-of-Hearing Students: Past, Present, and Future

Peer reviewed

Direct link

Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012

The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…

Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement

Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

Descriptors: Educational Testing, Scores, Reports, Psychometrics

Measuring Teaching Using Value-Added Modeling: The Imperfect Panacea

Peer reviewed

Direct link

Scherrer, Jimmy – NASSP Bulletin, 2011

The use of value-added modeling (VAM) in school accountability is expanding. However, trying to decide how to embrace VAM can be rather nettlesome. Some experts claim it is "too unreliable," causes "more harm than good," and has "a big margin for error," while other experts assert VAM is "imperfect, but…

Descriptors: Teacher Effectiveness, Accountability, Inferences, Validity

Score Comparability for Language Minority Students on the Content Assessments Used by Two States. Research Report. ETS RR-11-27

Download full text

Young, John W.; Holtzman, Steven; Steinberg, Jonathan – Educational Testing Service, 2011

In this research investigation of score comparability for language minority students (English language learners [ELLs] and former English language learners), we examined 3 indicators of score comparability (reliability, internal test structure, and differential item functioning) for 4th and 8th grade students who took the NCLB-mandated content…

Descriptors: Language Minorities, Second Language Learning, Grade 8, Minority Group Students

Reliability and Validity of Information about Student Achievement: Comparing Large-Scale and Classroom Testing Contexts

Peer reviewed

Direct link

Cizek, Gregory J. – Theory Into Practice, 2009

Reliability and validity are two characteristics that must be considered whenever information about student achievement is collected. However, those characteristics--and the methods for evaluating them--differ in large-scale testing and classroom testing contexts. This article presents the distinctions between reliability and validity in the two…

Descriptors: Academic Achievement, Validity, Measures (Individuals), Reliability

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

Determining Validity in National Curriculum Assessments

Peer reviewed

Direct link

Stobart, Gordon – Educational Research, 2009

Background: Validity is a central concern in any assessment, though this has often not been made explicit in the UK assessment context. This article applies current validity theorising, largely derived from American formulations, to national curriculum assessments in England. Purpose: The aim is to consider validity arguments in relation to the…

Descriptors: National Curriculum, Foreign Countries, Elementary Secondary Education, Educational Policy

The Predictive Utility of Kindergarten Screening for Math Difficulty: How, when, and with Respect to What Outcome Should It Occur?

Direct link

Seethaler, Pamela M. – ProQuest LLC, 2008

The purpose of this study was to examine the reliability, validity, and predictive utility of 3 measures for screening kindergarten students for risk for math difficulty (MD). The screening measures assessed number sense and computational fluency, constructs central to typical early mathematical development. Conceptual and operational outcomes…

Descriptors: Prediction, Kindergarten, Grade 1, Predictive Validity

Subscores and Validity. Research Report. ETS RR-08-64

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

In educational testing, subscores may be provided based on a portion of the items from a larger test. One consideration in evaluation of such subscores is their ability to predict a criterion score. Two limitations on prediction exist. The first, which is well known, is that the coefficient of determination for linear prediction of the criterion…

Descriptors: Scores, Validity, Educational Testing, Correlation

English National Curriculum Assessment: A Commentary from the USA--Or Exhibiting Kindness to the Colonies

Peer reviewed

Direct link

Popham, W. James – Educational Research, 2009

Against a shifting set of assessment preferences in the US regarding whether educational assessment should continue to be a states rights game or become a federally dominated undertaking, the publication of five first-rate analyses about England's national curriculum assessment (NCA) is particularly propitious. Taken together, these five papers…

Descriptors: National Curriculum, States Powers, Educational Assessment, Foreign Countries

Controversies regarding the Nature of Score Validity: Still Crazy after All These Years.

Download full text

Gray, B. Thomas – 1997

Validity is a critically important issue with far-reaching implications for testing. The history of conceptualizations of validity over the past 50 years is reviewed, and 3 important areas of controversy are examined. First, the question of whether the three traditionally recognized types of validity should be integrated as a unitary entity of…

Descriptors: Educational Testing, Evaluation Methods, Reliability, Scores

A Technical Review of the Myers-Briggs Type Indicator(tm).

Download full text

Denham, Thomas J. – 2002

This paper describes the Myers-Briggs Type Indicator (MBTI), developed by I. Myers and K. Briggs (1940s) to assess personality type. Based on Jungian theory, the MBTI has become a tool for identifying the 16 different patterns of action into which every person fits. The 16 personality types are based on patterns of: (1) extraversion-introversion;…

Descriptors: Educational Testing, Personality Assessment, Personality Measures, Personality Traits

Previous Page | Next Page »

Pages: 1 | 2

Haberman, Shelby J.	4
Sinharay, Sandip	3
Attali, Yigal	1
Berk, Ronald A.	1
Brennan, Robert L.	1
Burstein, Jill	1
Cizek, Gregory J.	1
Dahl, Theodore	1
Denham, Thomas J.	1
Ebel, Robert L.	1
Erickson, Richard C.	1
Faraday, Sally	1
Gray, B. Thomas	1
Green, Sylvia	1
Harris, Richard	1
Holtzman, Steven	1
Johanningmeier, Erwin V.	1
Kadamus, James A.	1
Leitzel, Thomas C.	1
McCowan, Richard J.	1
McCowan, Sheila C.	1
Mislevy, Robert J.	1
Mitchell, Ross E.	1
Oates, Tim	1
Oliveri, Maria Elena	1
More ▼