ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	15

Descriptor

Evaluation Methods	99
Test Interpretation	99
Test Validity	82
Test Reliability	35
Student Evaluation	26
Test Construction	26
Test Use	24
Measurement Techniques	23
Elementary Secondary Education	21
Educational Assessment	20
Evaluation Criteria	19
Test Results	19
Scores	17
Educational Testing	16
Testing Problems	16
Achievement Tests	14
Validity	14
Comparative Analysis	13
Testing	12
Tests	12
Performance Based Assessment	11
Standardized Tests	11
Foreign Countries	10
Psychometrics	10
Statistical Analysis	10
More ▼

Education Level

Elementary Secondary Education	8
Higher Education	3
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 7	1

Audience

Practitioners	13
Teachers	7
Administrators	3
Policymakers	1
Researchers	1
Students	1

Location

United Kingdom	5
Australia	3
United Kingdom (England)	2
United States	2
China	1
Connecticut	1
Greece	1
Kentucky (Louisville)	1
Michigan	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Elementary and Secondary…	3
Elementary and Secondary…	1

Assessments and Surveys

National Assessment of…	4
Self Directed Search	2
Strong Campbell Interest…	2
Aberrant Behavior Checklist	1
Adjective Check List	1
Advanced Placement…	1
Child Abuse Potential…	1
Group Embedded Figures Test	1
Iowa Tests of Educational…	1
Learning Style Inventory	1
Minnesota Multiphasic…	1
Myers Briggs Type Indicator	1
Pennsylvania Educational…	1
Productivity Environmental…	1
Rokeach Value Survey	1
SAT (College Admission Test)	1
Self Directed Learning…	1
Sequential Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 99 results Save | Export

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Raters' Scoring Process in Assessment of Interpreting: An Empirical Study Based on Eye Tracking and Retrospective Verbalisation

Peer reviewed

Direct link

Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024

Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…

Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability

Re-Examining Measurement Invariance of School Climate Surveys across Race/Ethnicity

Peer reviewed

Direct link

Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025

Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…

Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment

Calibrating Items Using an Unfolding Model of Item Response Theory: The Case of the Trait Personality Questionnaire 5 (TPQue5)

Peer reviewed

Direct link

Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024

Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…

Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Is What You See What You Really Get? Comparison of Scoring Techniques in the Assessment of Real-World Divergent Thinking

Peer reviewed

Direct link

Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014

In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…

Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods

Worldwide Test Reviewing at the Beginning of the Twenty-First Century

Peer reviewed

Direct link

Geisinger, Kurt F. – International Journal of Testing, 2012

This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…

Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

Assessment 101: Assessment Made Easy for First-Year Teachers

Download full text

Bailey, Jennifer; Little, Chelsea; Rigney, Rex; Thaler, Anna; Weiderman, Ken; Yorkovich, Ben – Online Submission, 2010

This handbook is designed as a quick reference for first-year teachers who find themselves in an assessment driven environment with little experience to help make sense of the language, underlying philosophy, or organizational structure of the assessment system. The handbook begins with advice on developing and evaluating effective learning…

Descriptors: Student Evaluation, Portfolio Assessment, Elementary Secondary Education, Performance Based Assessment

Assessment of Prior Learning in Higher Education: A Review from a Validity Perspective

Peer reviewed

Direct link

Stenlund, Tova – Assessment & Evaluation in Higher Education, 2010

The process of giving official acknowledgment to formal, informal and non-formal prior learning is commonly labelled as assessment, accreditation or recognition of prior learning (APL), representing a practice that is expanding in higher education in many countries. This paper focuses specifically on the assessment part of APL, which undoubtedly…

Descriptors: Higher Education, Validity, Prior Learning, Program Effectiveness

Understanding Comparability of Examination Standards

Peer reviewed

Direct link

Coe, Robert – Research Papers in Education, 2010

Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…

Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards

Evaluating the Rank-Ordering Method for Standard Maintaining

Peer reviewed

Direct link

Bramley, Tom; Gill, Tim – Research Papers in Education, 2010

The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…

Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries

Linking through Improved Design, Not Redefinition: Commentary on Newton

Peer reviewed

Direct link

Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010

"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…

Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques

Contrasting Conceptions of Comparability

Peer reviewed

Direct link

Newton, Paul E. – Research Papers in Education, 2010

Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…

Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests

What is the Problem of Construct Validity?

Download full text

Popp, Jerome A. – 1975

In this paper it is argued that the problem of construct validation in the construction of instruments and indicators is an important problem for educational researchers and practitioners; moreover, it is claimed that the popular notion of operational definition is a misleading idea which has obscured the problem of construct validity in…

Descriptors: Evaluation Methods, Statistical Analysis, Statistical Significance, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational Measurement:…	5
Measurement:…	3
Research Papers in Education	3
Assessment for Effective…	2
Educational Researcher	2
Journal of Educational…	2
Alberta Journal of…	1
American Journal of Education	1
American Journal on Mental…	1
Annual Review of Applied…	1
Applied Measurement in…	1
Assessment & Evaluation in…	1
Career Development Quarterly	1
Counseling Psychologist	1
Creativity Research Journal	1
Early Child Development and…	1
Education Economics	1
Evaluation Review	1
Focus on Exceptional Children	1
High School Magazine	1
Innovative Higher Education	1
Instructional Science	1
International Journal of…	1
Interpreter and Translator…	1
J Educ Meas	1
More ▼

Linn, Robert L.	5
Fleming, Dan B.	2
Ackerman, Terry A.	1
Allen, R. R.	1
An, Lily Shiao	1
Anderson, Colette	1
Archer, Robert P.	1
Arreola, Raoul A.	1
Arter, Judith A.	1
Athelstan, Gary T.	1
Bailey, Jennifer	1
Baird, Jo-Anne	1
Baker, Eva L.	1
Banta, Trudy W.	1
Benavidez, Charlotte	1
Bihm, Elson M.	1
Binghan Zheng	1
Bracey, Gerald W.	1
Bramley, Tom	1
Bridges, Claude F.	1
Campbell, Vicki L.	1
Cancelli, Anthony A.	1
Chao Han	1
Coe, Robert	1
More ▼

Journal Articles	40
Reports - Research	24
Opinion Papers	17
Reports - Evaluative	16
Speeches/Meeting Papers	13
Guides - Non-Classroom	12
Information Analyses	10
Reports - Descriptive	9
Books	6
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - Classroom - Teacher	2
Reports - General	2
Tests/Questionnaires	2
Book/Product Reviews	1
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Guides - General	1
Reference Materials -…	1
Reference Materials -…	1
More ▼