ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	14

Descriptor

Test Interpretation	55
Test Theory	55
Test Validity	20
Psychometrics	15
Test Construction	15
Testing Problems	14
Test Use	13
Scores	12
Foreign Countries	11
Measurement Techniques	11
Test Reliability	11
Educational Assessment	10
Educational Testing	8
Testing	8
Comparative Analysis	7
Criterion Referenced Tests	7
Equated Scores	7
Evaluation Methods	7
Latent Trait Theory	7
Test Items	7
Test Results	7
Error of Measurement	6
Standardized Tests	6
Student Evaluation	6
Test Format	6
More ▼

Publication Type

Journal Articles	55
Reports - Research	18
Opinion Papers	13
Reports - Evaluative	11
Information Analyses	9
Reports - Descriptive	8
Guides - Non-Classroom	2
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Reference Materials -…	1
Reference Materials - General	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	6
Higher Education	2
Postsecondary Education	1

Audience

Practitioners	4
Researchers	4
Teachers	1

Location

United Kingdom	4
United States	3
Australia	2
United Kingdom (England)	2
United Kingdom (Great Britain)	2
Canada	1
Taiwan	1
United Kingdom (Wales)	1
West Germany	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	3
Kaufman Assessment Battery…	2
ACT Assessment	1
Advanced Placement…	1
Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
Developmental Indicators for…	1
Eysenck Personality Inventory	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Minnesota Multiphasic…	1
Peabody Picture Vocabulary…	1
Preliminary Scholastic…	1
Stanford Achievement Tests	1
Test of English as a Foreign…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 55 results Save | Export

Comments on Implementing Validity Theory

Peer reviewed

Direct link

Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016

Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…

Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction

A Note on Assessing the Added Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2014

Brennan (Brennan, R. L., 2012) noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman (Haberman, S. J., 2008) suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. According to this…

Descriptors: Scores, Test Theory, Test Interpretation

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Hit by a Perfect Storm? Art & Design in the National Student Survey

Peer reviewed

Direct link

Yorke, Mantz; Orr, Susan; Blair, Bernadette – Studies in Higher Education, 2014

There has long been the suspicion amongst staff in Art & Design that the ratings given to their subject disciplines in the UK's National Student Survey are adversely affected by a combination of circumstances--a "perfect storm". The "perfect storm" proposition is tested by comparing ratings for Art & Design with those…

Descriptors: Student Surveys, National Surveys, Art Education, Design

Validity and the Consequences of Test Interpretation and Use

Peer reviewed

Direct link

Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011

The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…

Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Tests in Europe: Where We Are and Where We Should Go

Peer reviewed

Direct link

Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012

Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…

Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Cross-Cultural Validity of the TIMSS-1999 Mathematics Test: Verification of a Cognitive Model

Peer reviewed

Direct link

Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008

As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…

Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

A Critical Analysis of University Examinations in Mathematics. Part II: A Problem of Evaluation.

Peer reviewed

Griffiths, H. B.; McLone, R. R. – Educational Studies in Mathematics, 1984

Results obtained when a procedure for assessing the questions on uniersity mathematics examinations to see what skills were needed for their solution are given for a sample of 1400 questions set during 1976 in 10 British universities. The method is a way of focusing rational argument. (MNS)

Descriptors: College Mathematics, Higher Education, Mathematics Instruction, Test Construction

Tolerance Intervals for True Scores.

Peer reviewed

Jarjoura, David – Journal of Educational Statistics, 1985

Issues regarding tolerance and confidence intervals are discussed within the context of educational measurement, and conceptual distinctions are drawn between these two types of intervals. Points are raised about the advantages of tolerance intervals when the focus is on a particular observed score rather than a particular examinee. (Author/BW)

Descriptors: Comparative Analysis, Error of Measurement, Mathematical Models, Test Interpretation

A Latent Class Model for Rating Data.

Peer reviewed

Rost, Jurgen – Psychometrika, 1985

A latent class model for rating data is presented which provides an alternative to the latent trait approach of analyzing test data. It is the analog of Andrich's binomial Rasch model for Lazarsfeld's latent class analysis (LCA). Response probabilities for rating categories follow a binomial distribution and depend on class-specific item…

Descriptors: Item Analysis, Latent Trait Theory, Mathematical Models, Rating Scales

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational Measurement:…	4
Measurement:…	4
Educational and Psychological…	3
Journal of Educational…	3
Psychometrika	3
American Psychologist	2
Clearing House	2
International Journal of…	2
Journal of Educational…	2
Journal of School Psychology	2
Alberta Journal of…	1
American Educator: The…	1
Annual Review of Applied…	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment in Education:…	1
Canadian Journal for…	1
Contemporary Education	1
Educational Researcher	1
Educational Studies in…	1
Illinois Schools Journal	1
Intelligence	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Economic Education	1
More ▼

Hubley, Anita M.	2
Santee, Phillip	2
Whitehead, Bruce	2
Zumbo, Bruno D.	2
Austin, James T.	1
Baird, Jo-Anne	1
Beal, Judy	1
Blair, Bernadette	1
Blixt, Sonya L.	1
Bracken, Bruce A.	1
Briere, John	1
Chen, Yi-Hsin	1
Cizek, Gregory J.	1
Cliff, Norman	1
Cresswell, Mike	1
Crocker, Linda	1
Dorans, Neil J.	1
Douglas, Dan	1
Drasgow, Fritz	1
Elosua, Paula	1
Fan, Xitao	1
Fricke, Reiner	1
Frisbie, David A.	1
Gafni, Naomi	1
More ▼