Publication Date
In 2025 | 3 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 43 |
Descriptor
Evaluation Methods | 362 |
Test Interpretation | 362 |
Elementary Secondary Education | 84 |
Student Evaluation | 82 |
Test Validity | 82 |
Test Construction | 81 |
Educational Assessment | 61 |
Testing | 60 |
Test Results | 56 |
Test Use | 56 |
Measurement Techniques | 53 |
More ▼ |
Source
Author
Linn, Robert L. | 5 |
Padzensky, Herb | 3 |
Schafer, William D. | 3 |
Beach, David P. | 2 |
Elmore, Patricia B. | 2 |
Fleming, Dan B. | 2 |
Geisinger, Kurt F. | 2 |
Hood, Albert B. | 2 |
House, Gary D. | 2 |
Johnson, Richard W. | 2 |
Madaus, George F. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 50 |
Teachers | 18 |
Administrators | 12 |
Researchers | 10 |
Policymakers | 4 |
Students | 4 |
Counselors | 3 |
Community | 2 |
Parents | 1 |
Location
Canada | 8 |
United Kingdom | 8 |
Australia | 6 |
United States | 6 |
Connecticut | 3 |
South Africa | 3 |
United Kingdom (England) | 3 |
California | 2 |
Indiana | 2 |
Michigan | 2 |
Missouri | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 12 |
No Child Left Behind Act 2001 | 2 |
Americans with Disabilities… | 1 |
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Hana Svobodová; Petr Trahorsch – International Research in Geographical and Environmental Education, 2025
Geographical Olympiads are disciplinary competitions that can be a tool for assessing geographical knowledge and skills in different countries of the world. This article aims to analyse the results of the national and international geography Olympiads and to identify their conditionality and interrelationship. The secondary aim is to find out…
Descriptors: Foreign Countries, Geography, Geography Instruction, Evaluation Methods
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024
Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…
Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability
Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025
Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…
Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment
Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024
Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…
Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits
Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2020
Researchers have documented the impact of rater effects, or raters' tendencies to give different ratings than would be expected given examinee achievement levels, in performance assessments. However, the degree to which rater effects influence person fit, or the reasonableness of test-takers' achievement estimates given their response patterns,…
Descriptors: Performance Based Assessment, Evaluators, Achievement, Influences
An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022
Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…
Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies
Dumas, Denis; McNeish, Daniel; Greene, Jeffrey A. – Educational Psychologist, 2020
Scholars have lamented that current methods of assessing student performance do not align with contemporary views of learning as situated within students, contexts, and time. Here, we introduce and describe one theoretical--psychometric paradigm--termed "dynamic measurement"--designed to provide a valid representation of the way students…
Descriptors: Alternative Assessment, Psychometrics, Educational Psychology, Student Evaluation
Fitzpatrick, Tess; Clenton, Jon – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2017
This article offers a solution to a significant problem for teachers and researchers of language learning that confounds their interpretations and expectations of test data: The apparent simplicity of tests of vocabulary knowledge masks the complexity of the constructs they claim to measure. The authors first scrutinise task elements in two widely…
Descriptors: Language Tests, Vocabulary Development, Difficulty Level, Performance Factors
Michelle M. Neumann; Jason L. Anthony; Noé A. Erazo; David L. Neumann – Grantee Submission, 2019
The framework and tools used for classroom assessment can have significant impacts on teacher practices and student achievement. Getting assessment right is an important component in creating positive learning experiences and academic success. Recent government reports (e.g., United States, Australia) call for the development of systems that use…
Descriptors: Early Childhood Education, Futures (of Society), Educational Assessment, Evaluation Methods
Thummaphan, Phonraphee – ProQuest LLC, 2017
The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…
Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics
Hays, Danica G. – American Counseling Association, 2017
The latest edition of this perennial bestseller instructs and updates students and clinicians on the basic principles of psychological assessment and measurement, recent changes in assessment procedures, and the most widely used tests in counseling practice today. Dr. Danica Hays guides counselors in the appropriate selection, interpretation, and…
Descriptors: Evaluation Methods, Psychological Evaluation, Psychological Testing, Test Selection
Berliner, David C. – Teachers College Record, 2015
Trying to understand PISA is analogous to the parable of the blind men and the elephant. There are many facets of the PISA program, and thus many ways to both applaud and critique this ambitious international program of assessment that has gained enormous importance in the crafting of contemporary educational policy. One of the facets discussed in…
Descriptors: Achievement Tests, Standardized Tests, Educational Assessment, Educational Indicators
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Choi, Ick Kyu – ProQuest LLC, 2013
At the University of California, Los Angeles, the Test of Oral Proficiency (TOP), an internally developed oral proficiency test, is administered to international teaching assistant (ITA) candidates to ensure an appropriate level of academic oral English proficiency. Test taker performances are rated live by two raters according to four subscales.…
Descriptors: Screening Tests, Profiles, Oral Language, English