ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	11

Descriptor

Educational Testing	15
Psychological Testing	15
Scores	15
Validity	5
Psychology	4
Test Use	4
Definitions	3
Evidence	3
Inferences	3
Standards	3
Test Interpretation	3
Test Validity	3
Item Response Theory	2
Mathematical Models	2
Measurement	2
Measurement Techniques	2
Personality Measures	2
Probability	2
Psychometrics	2
Statistical Analysis	2
Test Bias	2
Test Items	2
Test Results	2
Tests	2
Value Added Models	2
More ▼

Source

Measurement:…	4
Educational and Psychological…	2
ACT, Inc.	1
Assessment in Education:…	1
Educational Psychologist	1
Journal of Faculty Development	1
Practical Assessment,…	1
ProQuest LLC	1
School Guidance Worker	1

Author

Anderson, Kent E.	1
Berk, Ronald A.	1
Chen, Troy T.	1
Cramer, Angelique O. J.	1
Gorney, Kylie	1
Hathcoat, John D.	1
Kang, Taehoon	1
King, Wesley C., Jr.	1
Kylie Gorney	1
Miles, Edward W.	1
Mislevy, Robert J.	1
Murphy, Kevin R.	1
Newton, Paul E.	1
Plake, Barbara Sterrett	1
Sandip Sinharay	1
Shavelson, Richard J.	1
Sireci, Stephen G.	1
Wang, Jianjun	1
More ▼

Publication Type

Journal Articles	11
Reports - Descriptive	6
Opinion Papers	4
Reports - Research	4
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Reports - Evaluative	1
Reports - General	1

Education Level

Audience

Location

Canada

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Wechsler Intelligence Scale…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Using Item Scores and Distractors to Detect Aberrant Behavior

Direct link

Gorney, Kylie – ProQuest LLC, 2023

Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…

Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias

On the Validity of Useless Tests

Peer reviewed

Direct link

Sireci, Stephen G. – Assessment in Education: Principles, Policy & Practice, 2016

A misconception exists that validity may refer only to the "interpretation" of test scores and not to the "uses" of those scores. The development and evolution of validity theory illustrate test score interpretation was a primary focus in the earliest days of modern testing, and that validating interpretations derived from test…

Descriptors: Test Validity, Misconceptions, Evaluation Utilization, Data Interpretation

Validity Semantics in Educational and Psychological Assessment

Peer reviewed
PDF on ERIC

Download full text

Hathcoat, John D. – Practical Assessment, Research & Evaluation, 2013

The semantics, or meaning, of validity is a fluid concept in educational and psychological testing. Contemporary controversies surrounding this concept appear to stem from the proper location of validity. Under one view, validity is a property of score-based inferences and entailed uses of test scores. This view is challenged by the…

Descriptors: Test Validity, Educational Testing, Psychological Testing, Scores

On an Approach to Testing and Modeling Competence

Peer reviewed

Direct link

Shavelson, Richard J. – Educational Psychologist, 2013

E. L. Thorndike contributed significantly to the field of educational and psychological testing as well as more broadly to psychological studies in education. This article follows in his testing legacy. I address the escalating demand, across societal sectors, to measure individual and group competencies. In formulating an approach to measuring…

Descriptors: Competence, Psychology, Psychological Testing, Psychological Studies

Why the Item "23 + 1" Is Not in a Depression Questionnaire: Validity from a Network Perspective

Peer reviewed

Direct link

Cramer, Angelique O. J. – Measurement: Interdisciplinary Research and Perspectives, 2012

What is validity? A simple question but apparently one with many answers, as Paul Newton highlights in his review of the history of validity. The current definition of validity, as entertained in the 1999 "Standards for Educational and Psychological Testing" is indeed a consensus, one between the classical notion of attributes, and measures…

Descriptors: Validity, Educational Testing, Depression (Psychology), Psychology

Validity for What? The Peril of Overclarifying

Peer reviewed

Direct link

Murphy, Kevin R. – Measurement: Interdisciplinary Research and Perspectives, 2012

As Paul Newton so ably demonstrates, the concept of validity is both important and problematic. Over the last several decades, a consensus definition of validity has emerged; the current edition of "Standards for Educational and Psychological Testing" notes, "Validity refers to the degree to which evidence and theory support the interpretations of…

Descriptors: Evidence, Validity, Educational Testing, Psychological Testing

Value of Value-Added Models Based on Student Outcomes to Evaluate Teaching

Peer reviewed

Direct link

Berk, Ronald A. – Journal of Faculty Development, 2016

Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…

Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation

Clarifying the Consensus Definition of Validity

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012

The 1999 "Standards for Educational and Psychological Testing" defines validity as the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests. Although quite explicit, there are ways in which this definition lacks precision, consistency, and clarity. The history of validity has taught us…

Descriptors: Evidence, Validity, Educational Testing, Risk

The Case for Informal Argument

Peer reviewed

Direct link

Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton's "Clarifying the Consensus Definition of Validity" addresses the single most important, yet stubbornly protean, value in educational and psychological assessment. "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council on Measurement in…

Descriptors: Evidence, Validity, Educational Testing, Psychological Evaluation

An Investigation of the Performance of the Generalized S-X[superscript 2] Item-Fit Index for Polytomous IRT Models. ACT Research Report Series, 2007-1

Download full text

Kang, Taehoon; Chen, Troy T. – ACT, Inc., 2007

Orlando and Thissen (2000, 2003) proposed an item-fit index, S-X[superscript 2], for dichotomous item response theory (IRT) models, which has performed better than traditional item-fit statistics such as Yen's (1981) Q[subscript 1] and McKinley and Mill's (1985) G[superscript 2]. This study extends the utility of S-X[superscript 2] to polytomous…

Descriptors: Item Response Theory, Models, Computer Software, Statistical Analysis

Stochastic Models of Quality Control on Test Misgrading.

Download full text

Wang, Jianjun – 2002

Stochastic models are developed in this article to examine the rate of test misgrading in educational and psychological measurement. The estimation of inadvertent grading errors can serve as a basis for quality control in measurement. Limitations of traditional Poisson models have been reviewed to highlight the need to introduce new models using…

Descriptors: Educational Testing, Grading, Mathematical Models, Measurement Techniques

Comparing Profile Variability Across Independent Groups.

Plake, Barbara Sterrett; And Others – 1980

The difficulties in comparing profile variability (a measure of test scatter) are briefly discussed and the limitations of current techniques pointed out. Test scatter is defined as individual variation in test scores between or within various psychological and educational tests. Currently, no statistical technique for the comparison of profile…

Descriptors: Educational Diagnosis, Educational Testing, Individual Testing, Mathematical Models

Gender and Administration Mode Effects when Pencil-and-Paper Personality Tests Are Computerized.

Peer reviewed

Miles, Edward W.; King, Wesley C., Jr. – Educational and Psychological Measurement, 1998

Whether gender and administration mode (computer versus pencil and paper) influenced mean scores on four noncognitive psychological instruments was studied with 874 undergraduates. Results show no statistically significant interaction between gender and administration mode, although statistically significant main effects were found for both gender…

Descriptors: Computer Assisted Testing, Educational Testing, Higher Education, Personality Assessment

Testing and the Counsellor.

Anderson, Kent E.; And Others – School Guidance Worker, 1984

Presents nine articles dealing with aspects of testing and the role of the school counselor. Topics include interpreting test scores, testing problems and solutions, developing tests to predict dropping out, sex bias in vocational testing, detecting bias in standardized tests, assessments of reentry adults, and interest tests for adults. (JAC)

Descriptors: Adults, Children, Counselor Role, Dropout Prevention