ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	16

Descriptor

Test Interpretation	112
Test Theory	112
Test Validity	34
Test Construction	33
Test Reliability	32
Scores	26
Testing Problems	24
Criterion Referenced Tests	21
Measurement Techniques	17
Psychometrics	17
Standardized Tests	17
Test Use	17
Elementary Secondary Education	16
Higher Education	16
Educational Testing	15
Foreign Countries	15
Item Analysis	15
Norm Referenced Tests	14
Test Results	14
Testing	14
Mathematical Models	13
Statistical Analysis	13
Test Items	13
Educational Assessment	12
Error of Measurement	12
More ▼

Publication Type

Journal Articles	55
Reports - Research	45
Reports - Evaluative	21
Speeches/Meeting Papers	21
Opinion Papers	17
Reports - Descriptive	14
Guides - Non-Classroom	11
Information Analyses	11
Books	4
Numerical/Quantitative Data	3
Tests/Questionnaires	3
Collected Works - Serials	2
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Reference Materials -…	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	6
Higher Education	2
Postsecondary Education	1

Audience

Practitioners	12
Teachers	7
Researchers	6
Administrators	1
Policymakers	1
Students	1

Location

United Kingdom	4
Australia	3
Canada	3
United Kingdom (Great Britain)	3
United States	3
United Kingdom (England)	2
New York	1
New York (New York)	1
Taiwan	1
USSR	1
United Kingdom (Wales)	1
West Germany	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 112 results Save | Export

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

Comments on Implementing Validity Theory

Peer reviewed

Direct link

Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016

Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…

Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction

A Note on Assessing the Added Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2014

Brennan (Brennan, R. L., 2012) noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman (Haberman, S. J., 2008) suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. According to this…

Descriptors: Scores, Test Theory, Test Interpretation

An Alternative Approach to Test Analysis and Interpretation

Download full text

Powell, J. C. – International Association for Development of the Information Society, 2013

This reflection paper challenges current test scoring practices on the grounds that most wrong-answer selections are thoughtful not random, presenting research supporting this proposition. An alternative test scoring system is presented, described and its outcomes discussed. This new scoring system increases the number of variables considered,…

Descriptors: Test Theory, Test Interpretation, Scoring, Multiple Choice Tests

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Hit by a Perfect Storm? Art & Design in the National Student Survey

Peer reviewed

Direct link

Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014

There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…

Descriptors: Student Surveys, National Surveys, Art Education, Design

Validity and the Consequences of Test Interpretation and Use

Peer reviewed

Direct link

Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011

The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…

Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Cross-Cultural Validity of the TIMSS-1999 Mathematics Test: Verification of a Cognitive Model

Peer reviewed

Direct link

Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008

As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…

Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

Revised Thinking about the Nature of Score Validity.

Download full text

Woolley, Kristin K. – 1996

The theory of score validity has undergone several revisions within the measurement community. The current consensus among professionals is a rejection of the trinitarian doctrine (J. P. Guion, 1980) of score validity and the recognition of a unified view that includes social consequences of test interpretation and use. While some aspects of the…

Descriptors: Models, Scores, Standards, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational Measurement:…	4
Measurement:…	4
Psychometrika	4
Educational and Psychological…	3
Journal of Educational…	3
American Psychologist	2
Applied Psychological…	2
Clearing House	2
International Journal of…	2
Journal of Educational…	2
Journal of School Psychology	2
Alberta Journal of…	1
American Educator: The…	1
Annual Review of Applied…	1
Applied Measurement in…	1
Assessment in Education:…	1
Canadian Journal for…	1
Contemporary Education	1
Educational Researcher	1
Educational Studies in…	1
Executive Review	1
Illinois Schools Journal	1
Intelligence	1
International Association for…	1
International Journal of…	1
More ▼

Powell, J. C.	3
Tatsuoka, Kikumi K.	3
Cliff, Norman	2
Crocker, Linda	2
Haladyna, Tom	2
Hubley, Anita M.	2
Lord, Frederic M.	2
Lyon, Mark A.	2
Mislevy, Robert J.	2
Price, Gary G.	2
Santee, Phillip	2
Smith, Douglas K.	2
Sullivan, Francis J.	2
Tatsuoka, Maurice M.	2
Whitehead, Bruce	2
Zumbo, Bruno D.	2
Algina, James	1
Andrich, David	1
Angoff, William H.	1
Arnold, Margery E.	1
Austin, James T.	1
Baird, Jo-Anne	1
Bao, Lei	1
Barnard, Jane	1
More ▼

Comprehensive Tests of Basic…	3
Kaufman Assessment Battery…	3
SAT (College Admission Test)	3
California Achievement Tests	2
Stanford Achievement Tests	2
ACT Assessment	1
Advanced Placement…	1
Armed Services Vocational…	1
Developmental Indicators for…	1
Eysenck Personality Inventory	1
General Aptitude Test Battery	1
General Educational…	1
Graduate Record Examinations	1
Minnesota Multiphasic…	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
Preliminary Scholastic…	1
Test of Adult Basic Education	1
Test of English as a Foreign…	1
Wechsler Intelligence Scale…	1
Woodcock Johnson Tests of…	1
More ▼