Publication Date
In 2025 | 12 |
Since 2024 | 45 |
Since 2021 (last 5 years) | 101 |
Since 2016 (last 10 years) | 177 |
Since 2006 (last 20 years) | 339 |
Descriptor
Evaluation Methods | 441 |
Foreign Countries | 441 |
Test Reliability | 215 |
Reliability | 163 |
Test Validity | 163 |
Student Evaluation | 129 |
Validity | 90 |
Interrater Reliability | 78 |
Student Attitudes | 65 |
Test Construction | 63 |
Higher Education | 58 |
More ▼ |
Source
Author
Bourke, Sid | 2 |
Boyle, Michael H. | 2 |
Bramley, Tom | 2 |
Cunningham, Charles E. | 2 |
Darling-Hammond, Linda | 2 |
Eng, Lin Siew | 2 |
Gamliel, Eyal | 2 |
Gillis, Shelley | 2 |
Ginns, Paul | 2 |
Godbout, Paul | 2 |
Heldsinger, Sandra | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 7 |
Researchers | 6 |
Teachers | 4 |
Administrators | 2 |
Location
Australia | 44 |
United Kingdom | 41 |
Canada | 28 |
China | 28 |
Turkey | 27 |
United Kingdom (England) | 26 |
Netherlands | 19 |
Israel | 16 |
United States | 15 |
Spain | 12 |
Taiwan | 12 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Constructing a Roadmap to Measure the Quality of Business Assessments Aimed at Curriculum Management
Silva, Thanuci; Santos, Regiane dos; Mallet, Débora – Journal of Education for Business, 2023
Assuring the quality of education is a concern of learning institutions. To do so, it is necessary to have assertive learning management, with consistent data on students' outcomes. This research provides associate deans and researchers, a roadmap with which to gather evidence to improve the quality of open-ended assessments. Based on statistical…
Descriptors: Student Evaluation, Evaluation Methods, Business Education, Higher Education
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Qiong Wu; Liping Gu – Sociological Methods & Research, 2024
Family income questions in general purpose surveys are usually collected with either a single-question summary design or a multiple-question disaggregation design. It is unclear how estimates from the two approaches agree with each other. The current paper takes advantage of a large-scale survey that has collected family income with both methods.…
Descriptors: Foreign Countries, Family Income, Questionnaires, Research Design
Hua Yuan; Yunmei Wu; Hui Tao; Jun Yin; Ying Fang; Junjie Zhang; Yun Zhang – International Journal of Technology and Design Education, 2025
This paper introduces a framework aimed at assessing the sustainability of fashion designers, intending to evaluate their proficiency in sustainability and enhance higher education in design. To establish a system for assessing and evaluating sustainable design competence, we initiated interviews with both designers and fashion design students.…
Descriptors: Clothing, Design, Sustainability, Reliability
Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024
In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…
Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading
Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024
There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…
Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education
Ilona Rinne – Assessment & Evaluation in Higher Education, 2024
It is widely acknowledged in research that common criteria and aligned standards do not result in consistent assessment of such a complex performance as the final undergraduate thesis. Assessment is determined by examiners' understanding of rubrics and their views on thesis quality. There is still a gap in the research literature about how…
Descriptors: Foreign Countries, Undergraduate Students, Teacher Education Programs, Evaluation Criteria
Yang Yang – Shanlax International Journal of Education, 2024
This paper explores the reliability of using ChatGPT in evaluating EFL writing by assessing its intra- and inter-rater reliability. Eighty-two compositions were randomly sampled from the Written English Corpus of Chinese Learners. These compositions were rated by three experienced raters with regard to 'language', 'content', and 'organization'.…
Descriptors: English (Second Language), Second Language Instruction, Writing (Composition), Evaluation Methods
Juan M. Sanchez – Journal of Biological Education, 2024
Bias assessment (systematic errors) is fundamental in industry and service laboratories, where reliable results must be obtained to give correct answers to specific problems. Therefore, knowledge and practice in quality methodologies is of fundamental importance for students. Unfortunately, laboratory lessons often focus on connecting theory and…
Descriptors: Achievement Tests, Science Laboratories, Biology, Science Education
Simon Massey – International Journal of Social Research Methodology, 2024
The UK-based article develops a quantitative method for measuring 8-9-year-old children's Gender Ability Beliefs through drawings, assessing the reliability and validity of the measure and its association with respondents' self-reported gender. The measure, originally used in the US by Beilock et al. (2010), required respondents to draw two…
Descriptors: Children, Sex, Childrens Attitudes, Gender Differences
Pinar Mihci Türker; Ömer Kirmaci; Emrah Kayabasi; Erinç Karatas; Ebru Kiliç Çakmak; Serçin Karatas – Journal of Educational Technology and Online Learning, 2024
The COVID-19 epidemic has precipitated a rapid and widespread adoption of online education, leading to its normalization in contemporary society. Online education is evident across several educational levels. However, assessing the efficacy and effectiveness of these training programs can only be achieved by implementing a suitable evaluation…
Descriptors: Online Courses, Distance Education, Evaluation Methods, Test Construction
Marine Simon; Alexandra Budke – Journal of Geography in Higher Education, 2024
Comparison is an important geographic method and a common task in geography education. Mastering comparison is a complex competency and written comparisons are challenging tasks both for students and assessors. As yet, however, there is no set test for evaluating comparison competency nor tool for enhancing it. Moreover, little is known about…
Descriptors: Geography Instruction, Student Evaluation, Comparative Analysis, Reliability
Delia Leuenberger; Elisabeth Moser Opitz; Noemi Gloor – Journal of Numerical Cognition, 2024
Computation competence (CC) in simple addition and subtraction using non-counting (NC) strategies is an important learning objective in Grade 1 mathematics but many children, especially low achievers in mathematics, struggle to acquire these skills. To provide these students with the support they need, it is important to have valid and reliable…
Descriptors: Computation, Mathematics Skills, Addition, Subtraction
Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024
Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…
Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability