NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 70 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025
This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…
Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Selim Dasçioglu; Tuncay Ögretmen – International Journal of Assessment Tools in Education, 2024
The purpose of this research is to determine whether PISA 2018 mathematical literacy test items show a differential item functioning across countries. For this purpose, only the items in booklet number three were examined using the MIMIC method with Latent Class Analysis (LCA) approach. PISA 2018 tests are mostly developed in English. Therefore,…
Descriptors: Test Items, Item Analysis, Mathematics Tests, Literacy
Green, Clare; Hughes, Sarah – Cambridge University Press & Assessment, 2022
The Digital High Stakes Assessment Programme in Cambridge University Press & Assessment is developing digital assessments for UK and global teachers and learners. In one development, the team are making decisions about the assessment models to use to assess computing systems knowledge and understanding. This research took place as part of the…
Descriptors: Test Items, Computer Science, Achievement Tests, Objective Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Sagoo, Mandeep Gill; Vorstenbosch, Marc A.T.M.; Bazira, Peter J.; Ellis, Harold; Kambouri, Maria; Owen, Charlie – Anatomical Sciences Education, 2021
Anatomical examinations have been designed to assess topographical and/or applied knowledge of anatomy with or without the inclusion of visual resources such as cadaveric specimens or images, radiological images, and/or clinical photographs. Multimedia learning theories have advanced the understanding of how words and images are processed during…
Descriptors: Anatomy, Computer Assisted Testing, Visual Aids, Medical Students
Peer reviewed Peer reviewed
Direct linkDirect link
Kunal Sareen – Innovations in Education and Teaching International, 2024
This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…
Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020
Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…
Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
Peer reviewed Peer reviewed
Direct linkDirect link
Jose A. Diaz; Steven M. Nelson; A. Alexander Beaujean; Adam E. Green; Michael K. Scullin – Creativity Research Journal, 2024
The compound Remote Associates Test (RAT) is a classic measure of creativity. Participants are shown three cue words (sore-shoulder-sweat) and asked to generate a word that connects them (cold). Theoretical views of RAT performance differ in the degree to which they conceptualize performance as depending on automatic spreading activation across…
Descriptors: Test Items, Creative Thinking, Creativity Tests, Performance
Peer reviewed Peer reviewed
Direct linkDirect link
Stefan O'Grady – International Journal of Listening, 2025
Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…
Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025
The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…
Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Sullivan, Alice – International Journal of Social Research Methodology, 2020
The UK census authorities have proposed guidance for the 2021 census indicating that the sex question may be answered according to subjective gender identity. This raises issues about the measurement of sex and gender identity which other data collection exercises are also contending with. This paper addresses the questions that have arisen…
Descriptors: Foreign Countries, National Surveys, Census Figures, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
McElwee, Sarah; Y. F. Cheung, Kevin; R. T. Cromie, Stephen; Shannon, Mark; Gallacher, Tom – Assessment in Education: Principles, Policy & Practice, 2021
The BioMedical Admissions Test (BMAT) has been used to select students for healthcare courses for 15 years. Recently, the candidature has included an increasing number of test takers who did not complete their schooling in the UK. In line with responsibilities to promote widening participation, a revision of the Section 2 Scientific Knowledge and…
Descriptors: Foreign Countries, Medical Education, College Admission, Medical Schools
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Krell, Moritz; Samia Khan; Jan van Driel – Education Sciences, 2021
The development and evaluation of valid assessments of scientific reasoning are an integral part of research in science education. In the present study, we used the linear logistic test model (LLTM) to analyze how item features related to text complexity and the presence of visual representations influence the overall item difficulty of an…
Descriptors: Cognitive Processes, Difficulty Level, Science Tests, Logical Thinking
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Yuan; Hau, Kit-Tai – Educational and Psychological Measurement, 2020
In large-scale low-stake assessment such as the Programme for International Student Assessment (PISA), students may skip items (missingness) which are within their ability to complete. The detection and taking care of these noneffortful responses, as a measure of test-taking motivation, is an important issue in modern psychometric models.…
Descriptors: Response Style (Tests), Motivation, Test Items, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Sullivan, Alice – International Journal of Social Research Methodology, 2020
This article replies to the responses to my article on "Sex and the Census: Why surveys should not conflate sex and gender identity". Fugard conflates sex itself with the characteristics associated with sex, such as finger length ratios, leading to the erroneous implication that binary sex is not a useful explanatory variable. Hines…
Descriptors: Foreign Countries, National Surveys, Census Figures, Test Items
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5