Publication Date
In 2025 | 14 |
Since 2024 | 43 |
Since 2021 (last 5 years) | 103 |
Since 2016 (last 10 years) | 232 |
Since 2006 (last 20 years) | 963 |
Descriptor
Evaluation Methods | 2697 |
Measurement Techniques | 2697 |
Program Evaluation | 420 |
Student Evaluation | 381 |
Higher Education | 349 |
Evaluation Criteria | 327 |
Models | 321 |
Elementary Secondary Education | 314 |
Foreign Countries | 303 |
Educational Assessment | 285 |
Research Methodology | 268 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 138 |
Researchers | 93 |
Teachers | 59 |
Administrators | 39 |
Policymakers | 18 |
Media Staff | 9 |
Students | 8 |
Parents | 6 |
Community | 4 |
Counselors | 4 |
Support Staff | 2 |
More ▼ |
Location
Australia | 44 |
Canada | 33 |
United Kingdom | 32 |
United States | 31 |
California | 28 |
United Kingdom (England) | 24 |
Florida | 17 |
New York | 16 |
Turkey | 15 |
Texas | 13 |
China | 12 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards with or without Reservations | 1 |
Timothy R. Konold; Elizabeth A. Sanders; Kelvin Afolabi – Structural Equation Modeling: A Multidisciplinary Journal, 2025
Measurement invariance (MI) is an essential part of validity evidence concerned with ensuring that tests function similarly across groups, contexts, and time. Most evaluations of MI involve multigroup confirmatory factor analyses (MGCFA) that assume simple structure. However, recent research has shown that constraining non-target indicators to…
Descriptors: Evaluation Methods, Error of Measurement, Validity, Monte Carlo Methods
Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2025
This study argues that readability formulas should be used as one measure of linguistic equivalence when adapting psychometric scales from one language to another. Assuming that the psychological structure being measured was not changed, it was observed that calculated readability levels and interpretations were different for two different…
Descriptors: Psychometrics, Evaluation Methods, Readability, Media Adaptation
Conor O. Chandler; Irina Proskorovsky – Research Synthesis Methods, 2024
In health technology assessment, matching-adjusted indirect comparison (MAIC) is the most common method for pairwise comparisons that control for imbalances in baseline characteristics across trials. One of the primary challenges in MAIC is the need to properly account for the additional uncertainty introduced by the matching process. Limited…
Descriptors: Predictor Variables, Influence of Technology, Evaluation Methods, Methods Research
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Robert Kwadwo Siemoh; Prince Duku; Sampson Boye – Discover Education, 2025
Pedagogical content knowledge (PCK) is important for teachers' instructional effectiveness. This study investigated the self-reported level of PCK among in-service elementary school science teachers in a municipality in Ghana, examining the overall level of PCK. An In-Service Elementary Science Teachers Self-Reported PCK (IEST-SR-PCK) scale was…
Descriptors: Science Teachers, Science Instruction, Pedagogical Content Knowledge, Factor Analysis
Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025
Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…
Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods
Paschalis Karakasis; Konstantinos I. Bougioukas; Konstantinos Pamporis; Nikolaos Fragakis; Anna-Bettina Haidich – Research Synthesis Methods, 2024
This study aimed to assess the methods and outcomes of The Measurement Tool to Assess systematic Reviews (AMSTAR) 2 appraisals in overviews of reviews (overviews) of interventions in the cardiovascular field and identify factors that are associated with these outcomes. MEDLINE, Scopus, and the Cochrane Database of Systematic Reviews were searched…
Descriptors: Human Body, Intervention, Literature Reviews, Measurement Techniques
Jahabar, Jahangeer Mohamed; Toh, Tin Lam; Tay, Eng Guan; Tong, Cherng Luen – Mathematics Education Research Group of Australasia, 2023
Big Ideas in school mathematics can be seen as overarching concepts that occur in various mathematical topics in a syllabus. For teachers, this knowledge can be used to help students develop a better understanding of mathematics by making visible the central ideas, and connection across topics and across levels. For students, this knowledge can…
Descriptors: Mathematics Education, Mathematics Instruction, Mathematics Curriculum, Teaching Methods
Pearce, Jacob; Chiavaroli, Neville; Tavares, Walter – Advances in Health Sciences Education, 2023
This paper is motivated by a desire to advance assessment in the health professions through encouraging the judicious and productive use of metaphors. Through five specific examples (pixels, driving lesson/test, jury deliberations, signal processing, and assessment as a toolbox), we interrogate how metaphors are being used in assessment to…
Descriptors: Figurative Language, Evaluation Methods, Measurement Techniques, Allied Health Occupations
Justin Kompf; Ryan Rhodes – Measurement in Physical Education and Exercise Science, 2024
The measurement of resistance training (RT) is often based on adaptations of aerobic physical activity measures which may not contain the elements necessary to assess RT. The purpose of this systematic review was to examine what measures are used to assess RT and appraise their composition. Specifically, the inclusion of frequency, duration,…
Descriptors: Physical Fitness, Training, Muscular Strength, Evaluation Methods
Stefanie A. Wind; Benjamin Lugu – Applied Measurement in Education, 2024
Researchers who use measurement models for evaluation purposes often select models with stringent requirements, such as Rasch models, which are parametric. Mokken Scale Analysis (MSA) offers a theory-driven nonparametric modeling approach that may be more appropriate for some measurement applications. Researchers have discussed using MSA as a…
Descriptors: Item Response Theory, Data Analysis, Simulation, Nonparametric Statistics
María Pilar García-Rodríguez; Sara Conde-Velez; Manuel Delgado-García; José Carmona Márquez – Learning Environments Research, 2024
We present the validation of a questionnaire for compulsory secondary school students (seventh to tenth grade), designated "Educational learning environments for ESO pupils" (CEApA_ESO), for the purpose of evaluating learning environments. Although many instruments have been developed in this area, our work attempts to comprehensively…
Descriptors: Educational Environment, Compulsory Education, Secondary Education, Grade 7
Steffen Zitzmann; Lisa Bardach; Kai T. Horstmann; Matthias Ziegler; Martin Hecht – Structural Equation Modeling: A Multidisciplinary Journal, 2024
We investigated three different approaches for quantifying individual change and reporting it back to persons: (a) the common change score, which is obtained by first computing scale scores from two consecutive measurements and then subtract these scores from one another, (b) the ad-hoc approach, which is similar to the former approach but uses…
Descriptors: Personality Change, Personality Measures, Regression (Statistics), Evaluation Methods
Tenko Raykov; Bingsheng Zhang – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Multidimensional measuring instruments are often used in behavioral, social, educational, marketing, and biomedical research. For these scales, the paper discusses how to find the optimal score based on their components that is associated with the highest possible reliability. Within the framework of structural equation modeling, an approach to…
Descriptors: Multidimensional Scaling, Measurement Equipment, Measurement Techniques, Test Reliability
Ha Pho; Marian A. Dyer; Jaime Vallejos; Jill Hendrickson Lohmeier – American Journal of Evaluation, 2024
Although most evaluators are familiar with participatory evaluation (PE), the ability to measure stakeholder participation in an evaluation remains challenging. Based on Cousins and Whitmore's (1998) PE theoretical model, Daigneault and Jacob (2009, 2012, 2014) developed an instrument for measuring the degree to which an evaluation can be…
Descriptors: Program Evaluation, Public School Teachers, Evaluation Methods, Participation