Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 13 |
| Since 2017 (last 10 years) | 36 |
| Since 2007 (last 20 years) | 64 |
Descriptor
| Foreign Countries | 86 |
| Performance Based Assessment | 86 |
| Test Reliability | 41 |
| Reliability | 33 |
| Test Validity | 30 |
| Student Evaluation | 28 |
| Evaluation Methods | 23 |
| Interrater Reliability | 19 |
| Validity | 17 |
| Test Construction | 13 |
| Comparative Analysis | 12 |
| More ▼ | |
Source
Author
| Darling-Hammond, Linda | 2 |
| Gillis, Shelley | 2 |
| Godbout, Paul | 2 |
| Yorke, Mantz | 2 |
| Admiraal, Wilfried | 1 |
| Ahmed, Tamim | 1 |
| Ajjawi, Rola | 1 |
| Akkanat, Cigdem | 1 |
| Aktas, Mehtap | 1 |
| Amirhossein Rasooli | 1 |
| Amy Jackson | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
| United Kingdom | 15 |
| Australia | 14 |
| Netherlands | 8 |
| United Kingdom (England) | 7 |
| Japan | 6 |
| Canada | 5 |
| Turkey | 5 |
| Indonesia | 4 |
| Connecticut | 3 |
| India | 3 |
| Singapore | 3 |
| More ▼ | |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 2 |
Assessments and Surveys
| National Assessment of… | 2 |
| New York State Regents… | 2 |
| Raven Progressive Matrices | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Stefan K. Schauber; Anne O. Olsen; Erik L. Werner; Morten Magelssen – Advances in Health Sciences Education, 2024
Introduction: Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that…
Descriptors: Medical Students, Performance Based Assessment, Expertise, Interrater Reliability
Rafikul Islam; Azilah Anis; Md Siddique E. Azam – Journal of Applied Research in Higher Education, 2025
Purpose: SETARA is a well-known university rating tool in Malaysia. The study aims to enhance the transparency, accuracy, and reliability of SETARA assessment instrument by improving its weighting scheme for the domains, sub-domains, criteria, and indicators. Design/methodology/approach: The study utilized a quantitative research design and…
Descriptors: Foreign Countries, Test Reliability, Higher Education, Performance Based Assessment
Gübes, Nese Öztürk – Participatory Educational Research, 2021
The aim of this study is to show how a many-facet Rasch measurement model (MFRM) can be used for quality control whilst monitoring a musical aptitude examination. The data used in this study was gathered from a musical aptitude examination which was applied in 2019-2020 academic year for selecting teacher candidates to a music education department…
Descriptors: Foreign Countries, Music Education, Teacher Education Programs, Preservice Teacher Education
Stephanie Baines; Pauldy Otermans; David Tree; Nicholas Worsfold – Teaching in Higher Education, 2025
Authentic assessments are seen as a promising response to many of the challenges currently facing Higher Education. Studies have identified shared characteristics of authentic assessments, but it is also argued that the term is vague and subjective. Drawing on existing frameworks we have established a standardised measure to evaluate authenticity…
Descriptors: Foreign Countries, College Faculty, College Students, Performance Based Assessment
Castillo-Diaz, Marcio Alexander; Gomes, Cristiano Mauro Assis; Jelihovschi, Enio Galinkin – International Journal of Educational Methodology, 2022
The field of studies in metacognition points to some limitations in the way the construct has traditionally been measured and shows a near absence of performance-based tests. The Meta-Text is a performance-based test recently created to assess components of cognition regulation: planning, monitoring, and judgment. This study presents the first…
Descriptors: Schemata (Cognition), Decision Making, Undergraduate Students, Foreign Countries
Zoe Stephenson; Amy Jackson; Victoria Wilkes – Assessment & Evaluation in Higher Education, 2024
The closed-door PhD and doctoral viva voce--the approach adopted in the United Kingdom--is esteemed by some as being a valuable academic tradition. However, an increasing body of literature and research has raised concerns about the quality, transparency, reliability and validity of this viva format. This systematic literature review aims to…
Descriptors: Foreign Countries, Doctoral Students, Doctoral Dissertations, Persuasive Discourse
Héctor J. Pijeira-Díaz; Shashank Subramanya; Janneke van de Pol; Anique de Bruin – Journal of Computer Assisted Learning, 2024
Background: When learning causal relations, completing causal diagrams enhances students' comprehension judgements to some extent. To potentially boost this effect, advances in natural language processing (NLP) enable real-time formative feedback based on the automated assessment of students' diagrams, which can involve the correctness of both the…
Descriptors: Learning Analytics, Automation, Student Evaluation, Causal Models
Tanaka, Mitsuko; Ross, Steven J. – Assessment in Education: Principles, Policy & Practice, 2023
Raters vary from each other in their severity and leniency in rating performance. This study examined the factors affecting rater severity in peer assessments of oral presentations in English as a Foreign Language (EFL), focusing on peer raters' self-construal and presentation abilities. Japanese university students enrolled in EFL classes…
Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Peer Evaluation
Yuichiro Yokouchi – Language Testing in Asia, 2025
The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…
Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers
Rizos, Spiridon; Sfakianaki, Eleni; Kakouris, Andreas – European Journal of Educational Management, 2022
This study investigates the quality of higher education institutes' (HEIs') administrative services by assessing student satisfaction in the context of Total Quality Management (TQM). Differences between students' perceptions and expectations of administrative service quality are examined and discussed. A questionnaire survey was developed…
Descriptors: College Administration, Educational Quality, Total Quality Management, College Students
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Geraldine O’Neill – International Journal of Work-Integrated Learning, 2024
Work-integrated learning (WIL) has become an increasingly common feature of higher education curricula. Two aspects of WIL, authenticity and consistency, are valued in different ways by the stakeholders involved. Authenticity, by its very nature, supports the idea of learning being personalized and unique. Consistency, on the other hand, is…
Descriptors: Work Experience Programs, Higher Education, Reliability, Performance Based Assessment
Amirhossein Rasooli; Jim Turner; Tünde Varga-Atkins; Edd Pitt; Shaghayegh Asgari; Will Moindrot – Assessment & Evaluation in Higher Education, 2025
Groupwork is a crucial aspect of work contexts and a key twenty first century skill. Assessment of groupwork provides a persistent challenge for educators in university contexts with students reporting experiences of unfairness from their peers during groupwork. This study developed a novel Peer Assessment Fairness Instrument to explore factors…
Descriptors: Foreign Countries, Undergraduate Students, Student Attitudes, College Faculty
Hakelind, Camilla; Sundström, Anna E. – Psychology Learning and Teaching, 2022
Finding valid and reliable ways to assess complex clinical skills within psychology is a challenge. Recently, there have been some examples of applying Objective Structured Clinical Examinations (OSCEs) in psychology for making such assessments. The aim of this study was to examine students' and examiners' perceptions of a digital OSCE in…
Descriptors: Graduate Students, Masters Programs, Clinical Psychology, Student Evaluation
Bearman, Margaret; Ajjawi, Rola; Bennett, Sue; Boud, David – Advances in Health Sciences Education, 2021
Objective Structured Clinical Examinations (OSCEs) have become ubiquitous as a form of assessment in medical education but involve substantial resource demands and considerable local variation. A detailed understanding of the processes by which OSCEs are designed and administered could improve feasibility and sustainability. This exploration of…
Descriptors: Performance Based Assessment, Medical Education, Test Construction, Testing

Peer reviewed
Direct link
