ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	35
Since 2006 (last 20 years)	134

Descriptor

Evaluation Methods	292
Test Reliability	150
Reliability	116
Test Validity	106
Student Evaluation	79
Validity	66
Higher Education	49
Test Construction	40
Elementary Secondary Education	39
Interrater Reliability	39
Measurement Techniques	38
Foreign Countries	36
Psychometrics	29
Educational Assessment	27
Evaluation Criteria	27
Performance Based Assessment	24
Program Evaluation	23
Models	21
Measures (Individuals)	19
Teacher Evaluation	19
Scores	18
Academic Achievement	17
Research Methodology	17
Teaching Methods	17
College Students	16
More ▼

Publication Type

Reports - Descriptive	292
Journal Articles	226
Speeches/Meeting Papers	20
Opinion Papers	18
Tests/Questionnaires	12
Information Analyses	10
Reports - Evaluative	9
Reports - Research	5
Guides - Classroom - Teacher	4
Guides - Non-Classroom	4
Numerical/Quantitative Data	3
Books	1
ERIC Publications	1
Guides - General	1
Translations	1
More ▼

Education Level

Higher Education	39
Elementary Secondary Education	25
Postsecondary Education	21
Adult Education	10
Elementary Education	10
Secondary Education	8
Early Childhood Education	7
High Schools	5
Middle Schools	4
Preschool Education	3
Primary Education	2
Adult Basic Education	1
Junior High Schools	1
Kindergarten	1
Two Year Colleges	1
More ▼

Audience

Practitioners	16
Researchers	10
Teachers	10
Administrators	5
Policymakers	4
Counselors	1

Location

United Kingdom	9
Australia	5
United Kingdom (England)	5
Vermont	5
Florida	3
Massachusetts	3
New York	3
United States	3
Connecticut	2
Nebraska	2
New Hampshire	2
Rhode Island	2
Arkansas (Little Rock)	1
Austria	1
California	1
California (Oakland)	1
Canada	1
Colorado (Denver)	1
India	1
Ireland	1
Japan	1
Kansas	1
Malaysia	1
Mexico	1
Minnesota	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	4
No Child Left Behind Act 2001	4
Education Amendments 1974	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 292 results Save | Export

Technical Adequacy-Reliability

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2025

The author provides information about reliability and areas that educators should examine in determining if an assessment is consistent and trustworthy for use, and how it should be interpreted in making decisions about students. Reliability areas that are discussed in the column include internal consistency, test-retest or stability, inter-scorer…

Descriptors: Test Reliability, Academically Gifted, Student Evaluation, Error of Measurement

The Value of Expanding Perspectives on Assessment

Peer reviewed

Direct link

Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024

In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…

Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods

Evaluation of Maximal Reliability for Multidimensional Measuring Instruments Using Structural Equation Modeling

Peer reviewed

Direct link

Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…

Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability

Assessment as Pedagogy: Inviting Authenticity through Relationality, Vulnerability and Wonder

Peer reviewed

Direct link

Claire Timperley; Kate Schick – Teaching in Higher Education, 2025

Traditional authentic assessment tasks are frequently tied to future work and enmeshed in neoliberal and capitalist visions of education. We advocate an alternative approach where authenticity signifies meaningful learning outside the confines of the classroom to promote deep learning that 'sticks'. We proffer an understanding of "assessment…

Descriptors: Performance Based Assessment, Philosophy, World Views, Instruction

Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI

Peer reviewed

Direct link

Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024

The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…

Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software

A Tutorial on Aggregating Evidence from Conceptual Replication Studies Using the Product Bayes Factor

Peer reviewed

Direct link

Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024

The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…

Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size

A Computationally Simple Method for Estimating Decision Consistency

Peer reviewed

Direct link

Wolkowitz, Amanda A. – Journal of Educational Measurement, 2021

Decision consistency (DC) is the reliability of a classification decision based on a test score. In professional credentialing, the decision is often a high-stakes pass/fail decision. The current methods for estimating DC are computationally complex. The purpose of this research is to provide a computationally and conceptually simple method for…

Descriptors: Decision Making, Reliability, Classification, Scores

Assessment Strategies for Reflective Learning in the Workplace: A Pragmatic Approach

Peer reviewed

Direct link

Roessger, Kevin M. – Adult Learning, 2020

Practitioners often struggle to assess reflective learning in the workplace because of difficulties conceptualizing reflection and its effects in the workplace. This article addresses this problem by offering a pragmatic approach to assessment that asks practitioners to specify why they are using reflection, what they are hoping to gain from it,…

Descriptors: Workplace Learning, Evaluation Methods, Reflection, Adult Education

Mark Scheme Design for School- and College-Based Assessment in VTQs

Peer reviewed

Direct link

Williamson, Joanna; Child, Simon – Journal of Vocational Education and Training, 2022

School- and college-based vocational and technical qualifications (VTQs) in England are required to award successful candidates a grade rather than simple pass or fail. Ensuring the reliability and validity of these grades is considered vital, particularly in light of the high-stakes purposes for which school assessment results in England are…

Descriptors: Foreign Countries, Vocational Education, Qualifications, Student Evaluation

Procedures for Reliable Cultural Model Analysis Using Semi-Structured Interviews

Peer reviewed

Direct link

Price, Heather E.; Smith, Christian – Field Methods, 2021

To identify the dominant cultural models among parents transmitting faith to their children, we find few methodological guidelines to guide coding and analysis of semi-structured interviews. We thus developed a three-phase procedure for our research team. Phase-one follows Campbell et al. by unitizing on meanings rather than words/pages, including…

Descriptors: Semi Structured Interviews, Parents, Religion, Reliability

Program Administration Scale (PAS): Measuring Whole Leadership in Early Childhood Centers, Third Edition

Direct link

Talan, Teri N.; Bella, Jill M.; Bloom, Paula Jorde – Teachers College Press, 2022

The "Program Administration Scale" (PAS) is designed to reliably measure and improve the leadership and management practices of center-based programs--the only instrument of its kind to focus exclusively on organization-wide administrative issues. In the third edition, the authors share updated information supporting the reliability and…

Descriptors: Program Administration, Evaluation Methods, Leadership, Early Childhood Education

Pedagogical Considerations for Examining Rater Variability in Rater-Mediated Assessments: A Three-Model Framework

Peer reviewed

Direct link

Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…

Descriptors: Interrater Reliability, Models, Observation, Measurement

Framing the Constructive Alignment of Design within Technology Subjects in General Education

Peer reviewed

Direct link

Buckley, Jeffrey; Seery, Niall; Gumaelius, Lena; Canty, Donal; Doyle, Andrew; Pears, Arnold – International Journal of Technology and Design Education, 2021

Design is core element of general technology education internationally. While there is a degree of contention with regards to its treatment, there is general consensus that the inclusion of design in some form is important, if not characteristic, of the subject area. Acknowledging that design is important, there are many questions which need to be…

Descriptors: Alignment (Education), Design, Guidelines, Learning Theories

DEI Institutionalization: Measuring Diversity, Equity, and Inclusion in Postsecondary Education

Peer reviewed

Direct link

Cumming, Tammie; Miller, M. David; Leshchinskaya, Isana – Change: The Magazine of Higher Learning, 2023

In 2021, the Council for Higher Education Accreditation (CHEA) made a monumental move to require postsecondary institutions to evaluate and document their actions to ensure fairness in admissions, an inclusive learning environment, and equitable student outcomes. Around the same time, a team comprising educational measurement experts, diversity…

Descriptors: Diversity, Equal Education, Inclusion, Postsecondary Education

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 20

Educational and Psychological…	9
Assessment & Evaluation in…	4
International Journal of…	4
International Journal of…	4
American Journal of Evaluation	3
Assessment Update	3
Diagnostique	3
Educational Measurement:…	3
Journal of Teacher Education	3
American Journal of Distance…	2
American Journal of…	2
Appalachia Educational…	2
Assessment and Accountability…	2
British Educational Research…	2
British Journal of…	2
Career Development Quarterly	2
Child Abuse & Neglect: The…	2
Educational Evaluation and…	2
Educational Technology	2
Gifted Child Today	2
Journal of Autism and…	2
Journal of Educational…	2
Journal of Educational and…	2
Measurement in Physical…	2
New Directions for Evaluation	2
More ▼

Halle, Tamara	3
Darling-Hammond, Linda	2
Dietel, Ronald	2
Epstein, Michael H.	2
Herman, Joan L.	2
Hughes, Georgia K.	2
Moodie, Shannon	2
Osmundson, Ellen	2
Aaron Zimmerman	1
Abedi, Jamal	1
Adams, Stephanie G.	1
Alina A. von Davier	1
Almeida, M. Joao C. A.	1
Alonso, Ariel	1
Amidon, Edmund	1
Anderson, Cynthia M.	1
Anderson, William L.	1
Antony, Martin M.	1
Appleton, James J.	1
Ari, Omer	1
Atkins, David C.	1
Atkinson, Nancy L.	1
Austin, Christy R.	1
Avery, Marybell	1
More ▼

National Assessment of…	3
ACT Assessment	2
Dynamic Indicators of Basic…	2
New York State Regents…	2
Autism Diagnostic Observation…	1
Battelle Developmental…	1
Bayley Scales of Infant…	1
Behavioral and Emotional…	1
College Level Academic Skills…	1
College Level Examination…	1
College Student Experiences…	1
Collegiate Assessment of…	1
Denver Developmental…	1
Developmental Indicators for…	1
Early Childhood Environment…	1
Florida Comprehensive…	1
Graduate Management Admission…	1
Infant Toddler Environment…	1
Iowa Tests of Basic Skills	1
National Assessment of Adult…	1
Praxis Series	1
Preliminary Scholastic…	1
Program for International…	1
Scales of Independent Behavior	1
Self Directed Search	1
More ▼