Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 11 |
Descriptor
Models | 32 |
Test Interpretation | 32 |
Test Validity | 23 |
Test Use | 9 |
Scores | 8 |
Test Reliability | 8 |
Test Construction | 7 |
Validity | 7 |
Test Items | 6 |
Evaluation Methods | 5 |
Language Tests | 5 |
More ▼ |
Source
Author
Kane, Michael T. | 2 |
Ackerman, Terry A. | 1 |
Adams, Elizabeth | 1 |
Arreola, Raoul A. | 1 |
Bardon, Jack I., Ed. | 1 |
Bernal, Ernest M. | 1 |
Charvat, Jeff | 1 |
Chen, Yi-Hsin | 1 |
DeBello, Thomas C. | 1 |
Douglas, Dan | 1 |
Dunleavy, Peter | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Adult Education | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Researchers | 1 |
Location
Australia | 1 |
Indiana | 1 |
Taiwan | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACTFL Oral Proficiency… | 1 |
National Assessment of… | 1 |
SAT (College Admission Test) | 1 |
Stages of Concern… | 1 |
System of Multicultural… | 1 |
Test of Adult Basic Education | 1 |
What Works Clearinghouse Rating
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Ketterlin-Geller, Leanne R.; Perry, Lindsey; Adams, Elizabeth – Applied Measurement in Education, 2019
Despite the call for an argument-based approach to validity over 25 years ago, few examples exist in the published literature. One possible explanation for this outcome is that the complexity of the argument-based approach makes implementation difficult. To counter this claim, we propose that the Assessment Triangle can serve as the overarching…
Descriptors: Validity, Educational Assessment, Models, Screening Tests
Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019
We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…
Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2016
How we choose to use a term depends on what we want to do with it. If "validity" is to be used to support a score interpretation, validation would require an analysis of the plausibility of that interpretation. If validity is to be used to support score uses, validation would require an analysis of the appropriateness of the proposed…
Descriptors: Test Validity, Test Interpretation, Test Use, Scores
Eklund, Katie; Rossen, Eric; Charvat, Jeff; Meyer, Lauren; Tanner, Nick – Journal of Applied School Psychology, 2016
The National Association of School Psychologists' Model for Comprehensive and Integrated School Psychological Services (2010a), often referred to as the National Association of School Psychologists' Practice Model, describes the comprehensive range of professional skills and competencies available from school psychologists across 10 domains. The…
Descriptors: School Psychologists, Self Evaluation (Individuals), Factor Structure, Professional Associations
Tuccitto, Daniel E.; Giacobbi, Peter R., Jr.; Leite, Walter L. – Educational and Psychological Measurement, 2010
This study tested five confirmatory factor analytic (CFA) models of the Positive Affect Negative Affect Schedule (PANAS) to provide validity evidence based on its internal structure. A sample of 223 club sport athletes indicated their emotions during the past week. Results revealed that an orthogonal two-factor CFA model, specifying error…
Descriptors: Factor Analysis, Models, Affective Measures, Validity
Langan, Anthony Mark; Dunleavy, Peter; Fielding, Alan – Education Sciences, 2013
Many countries use national-level surveys to capture student opinions about their university experiences. It is necessary to interpret survey results in an appropriate context to inform decision-making at many levels. To provide context to national survey outcomes, we describe patterns in the ratings of science and engineering subjects from the…
Descriptors: Models, National Surveys, Undergraduate Students, College Science
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Chen, Yi-Hsin; Gorin, Joanna S.; Thompson, Marilyn S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2008
As with any test administered across linguistically and culturally diverse groups, evidence suggesting the equivalence of score meaning across countries is needed for valid comparisons. The current study examines the cross-cultural equivalence of score interpretations from the Trends in International Mathematics and Science Study (TIMSS)-1999 from…
Descriptors: Construct Validity, Mathematics Tests, Foreign Countries, Equated Scores
Forehand, Garlie A. – 1982
Problems in validating ability tests for handicapped students and research approaches to predictive validity are discussed. Validity for handicapped persons tested under regular conditions; for applicants to special programs, and for tests taken under special administrative conditions are considered. Item analysis and the construction of new…
Descriptors: Academic Ability, Disabilities, Evaluation Criteria, Measures (Individuals)
Woolley, Kristin K. – 1996
The theory of score validity has undergone several revisions within the measurement community. The current consensus among professionals is a rejection of the trinitarian doctrine (J. P. Guion, 1980) of score validity and the recognition of a unified view that includes social consequences of test interpretation and use. While some aspects of the…
Descriptors: Models, Scores, Standards, Test Interpretation
McNamara, Tim – Language Assessment Quarterly, 2006
The thought of Samuel Messick has influenced language testing in 2 main ways: in proposing a new understanding of how inferences made based on tests must be challenged, and in drawing attention to the consequences of test use. The former has had a powerful impact on language-testing research, most notably in Bachman's work on validity and the…
Descriptors: Test Use, Testing, Language Tests, Validity

Haertel, Edward – Review of Educational Research, 1985
A unified framework for validating criterion-referenced test (CRT) interpretation is proposed. Using functional literacy as an illustration, the instructional outcomes assessed are called achievement constructs, and described in psychological and behavioral terms. The proposed construct validation methods can yield tests more closely linked to…
Descriptors: Achievement Tests, Competence, Criterion Referenced Tests, Educational Objectives

Ackerman, Terry A. – Applied Measurement in Education, 1994
When item response data do not satisfy the unidimensionality assumption, multidimensional item response theory (MIRT) should be used to model the item-examinee interaction. This article presents and discusses MIRT analyses designed to give better insight into what individual items are measuring. (SLD)
Descriptors: Evaluation Methods, Item Response Theory, Measurement Techniques, Models