Publication Date
In 2025 | 2 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 16 |
Since 2016 (last 10 years) | 35 |
Since 2006 (last 20 years) | 134 |
Descriptor
Source
Author
Halle, Tamara | 3 |
Darling-Hammond, Linda | 2 |
Dietel, Ronald | 2 |
Epstein, Michael H. | 2 |
Herman, Joan L. | 2 |
Hughes, Georgia K. | 2 |
Moodie, Shannon | 2 |
Osmundson, Ellen | 2 |
Aaron Zimmerman | 1 |
Abedi, Jamal | 1 |
Adams, Stephanie G. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 16 |
Researchers | 10 |
Teachers | 10 |
Administrators | 5 |
Policymakers | 4 |
Counselors | 1 |
Location
United Kingdom | 9 |
Australia | 5 |
United Kingdom (England) | 5 |
Vermont | 5 |
Florida | 3 |
Massachusetts | 3 |
New York | 3 |
United States | 3 |
Connecticut | 2 |
Nebraska | 2 |
New Hampshire | 2 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 4 |
No Child Left Behind Act 2001 | 4 |
Education Amendments 1974 | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Susan K. Johnsen – Gifted Child Today, 2025
The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…
Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement
Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024
In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…
Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods
Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…
Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability
Claire Timperley; Kate Schick – Teaching in Higher Education, 2025
Traditional authentic assessment tasks are frequently tied to future work and enmeshed in neoliberal and capitalist visions of education. We advocate an alternative approach where authenticity signifies meaningful learning outside the confines of the classroom to promote deep learning that 'sticks'. We proffer an understanding of "assessment…
Descriptors: Performance Based Assessment, Philosophy, World Views, Instruction
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size
Wolkowitz, Amanda A. – Journal of Educational Measurement, 2021
Decision consistency (DC) is the reliability of a classification decision based on a test score. In professional credentialing, the decision is often a high-stakes pass/fail decision. The current methods for estimating DC are computationally complex. The purpose of this research is to provide a computationally and conceptually simple method for…
Descriptors: Decision Making, Reliability, Classification, Scores
Roessger, Kevin M. – Adult Learning, 2020
Practitioners often struggle to assess reflective learning in the workplace because of difficulties conceptualizing reflection and its effects in the workplace. This article addresses this problem by offering a pragmatic approach to assessment that asks practitioners to specify why they are using reflection, what they are hoping to gain from it,…
Descriptors: Workplace Learning, Evaluation Methods, Reflection, Adult Education
Williamson, Joanna; Child, Simon – Journal of Vocational Education and Training, 2022
School- and college-based vocational and technical qualifications (VTQs) in England are required to award successful candidates a grade rather than simple pass or fail. Ensuring the reliability and validity of these grades is considered vital, particularly in light of the high-stakes purposes for which school assessment results in England are…
Descriptors: Foreign Countries, Vocational Education, Qualifications, Student Evaluation
Price, Heather E.; Smith, Christian – Field Methods, 2021
To identify the dominant cultural models among parents transmitting faith to their children, we find few methodological guidelines to guide coding and analysis of semi-structured interviews. We thus developed a three-phase procedure for our research team. Phase-one follows Campbell et al. by unitizing on meanings rather than words/pages, including…
Descriptors: Semi Structured Interviews, Parents, Religion, Reliability
Talan, Teri N.; Bella, Jill M.; Bloom, Paula Jorde – Teachers College Press, 2022
The "Program Administration Scale" (PAS) is designed to reliably measure and improve the leadership and management practices of center-based programs--the only instrument of its kind to focus exclusively on organization-wide administrative issues. In the third edition, the authors share updated information supporting the reliability and…
Descriptors: Program Administration, Evaluation Methods, Leadership, Early Childhood Education
Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019
Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…
Descriptors: Interrater Reliability, Models, Observation, Measurement
Buckley, Jeffrey; Seery, Niall; Gumaelius, Lena; Canty, Donal; Doyle, Andrew; Pears, Arnold – International Journal of Technology and Design Education, 2021
Design is core element of general technology education internationally. While there is a degree of contention with regards to its treatment, there is general consensus that the inclusion of design in some form is important, if not characteristic, of the subject area. Acknowledging that design is important, there are many questions which need to be…
Descriptors: Alignment (Education), Design, Guidelines, Learning Theories
Cumming, Tammie; Miller, M. David; Leshchinskaya, Isana – Change: The Magazine of Higher Learning, 2023
In 2021, the Council for Higher Education Accreditation (CHEA) made a monumental move to require postsecondary institutions to evaluate and document their actions to ensure fairness in admissions, an inclusive learning environment, and equitable student outcomes. Around the same time, a team comprising educational measurement experts, diversity…
Descriptors: Diversity, Equal Education, Inclusion, Postsecondary Education
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests