ERIC - Search Results

Publication Date

In 2025	3
Since 2024	6
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	40
Since 2006 (last 20 years)	59

Descriptor

Foreign Countries	70
Test Items	70
Test Construction	22
Test Validity	13
Computer Assisted Testing	12
Difficulty Level	11
Questionnaires	11
Scores	10
Test Format	10
Achievement Tests	9
Mathematics Tests	9
Scoring	9
Secondary School Students	9
Correlation	8
English (Second Language)	8
Item Analysis	8
Multiple Choice Tests	8
Science Tests	8
Undergraduate Students	8
Higher Education	7
National Surveys	7
Second Language Learning	7
Student Attitudes	7
College Students	6
Computation	6
More ▼

Publication Type

Journal Articles	65
Reports - Research	52
Reports - Evaluative	9
Tests/Questionnaires	5
Reports - Descriptive	4
Information Analyses	3
Books	1
Collected Works - General	1
Guides - Classroom - Teacher	1
Non-Print Media	1
Opinion Papers	1
Reports - General	1
More ▼

Education Level

Higher Education	26
Postsecondary Education	19
Secondary Education	12
Elementary Secondary Education	3
High Schools	3

Audience

Practitioners	3
Policymakers	2
Community	1
Researchers	1
Teachers	1

Location

United Kingdom	70
United States	11
Australia	7
Canada	5
China	5
Japan	5
France	3
South Korea	3
Germany	2
Hong Kong	2
Ireland	2
Italy	2
Netherlands	2
Russia	2
Singapore	2
Spain	2
Taiwan	2
Turkey	2
Austria	1
Belgium	1
Chile	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Program for International…	5
Dyadic Adjustment Scale	1
International English…	1
Locke Wallace Marital…	1
National Assessment of…	1
Pearson Test of English…	1
Remote Associates Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 70 results Save | Export

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

Detection of Differential Item Functioning with Latent Class Analysis: PISA 2018 Mathematical Literacy Test

Peer reviewed
PDF on ERIC

Download full text

Selim Dasçioglu; Tuncay Ögretmen – International Journal of Assessment Tools in Education, 2024

The purpose of this research is to determine whether PISA 2018 mathematical literacy test items show a differential item functioning across countries. For this purpose, only the items in booklet number three were examined using the MIMIC method with Latent Class Analysis (LCA) approach. PISA 2018 tests are mostly developed in English. Therefore,…

Descriptors: Test Items, Item Analysis, Mathematics Tests, Literacy

Item Types and Demand: What Is the Impact on Demand of Manipulating Item Types in Computer Science GCSE and IGCSE? Research Report

Download full text

Green, Clare; Hughes, Sarah – Cambridge University Press & Assessment, 2022

The Digital High Stakes Assessment Programme in Cambridge University Press & Assessment is developing digital assessments for UK and global teachers and learners. In one development, the team are making decisions about the assessment models to use to assess computing systems knowledge and understanding. This research took place as part of the…

Descriptors: Test Items, Computer Science, Achievement Tests, Objective Tests

Online Assessment of Applied Anatomy Knowledge: The Effect of Images on Medical Students' Performance

Peer reviewed

Direct link

Sagoo, Mandeep Gill; Vorstenbosch, Marc A.T.M.; Bazira, Peter J.; Ellis, Harold; Kambouri, Maria; Owen, Charlie – Anatomical Sciences Education, 2021

Anatomical examinations have been designed to assess topographical and/or applied knowledge of anatomy with or without the inclusion of visual resources such as cadaveric specimens or images, radiological images, and/or clinical photographs. Multimedia learning theories have advanced the understanding of how words and images are processed during…

Descriptors: Anatomy, Computer Assisted Testing, Visual Aids, Medical Students

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Evaluating Human Scoring Using Generalizability Theory

Peer reviewed

Direct link

Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020

Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…

Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

The Impact of Adding a Fourth Item to the Traditional 3-Item Remote Associates Test

Peer reviewed

Direct link

Jose A. Diaz; Steven M. Nelson; A. Alexander Beaujean; Adam E. Green; Michael K. Scullin – Creativity Research Journal, 2024

The compound Remote Associates Test (RAT) is a classic measure of creativity. Participants are shown three cue words (sore-shoulder-sweat) and asked to generate a word that connects them (cold). Theoretical views of RAT performance differ in the degree to which they conceptualize performance as depending on automatic spreading activation across…

Descriptors: Test Items, Creative Thinking, Creativity Tests, Performance

Investigating the Role of Response Format in Computer-Based Lecture Comprehension Tasks

Peer reviewed

Direct link

Stefan O'Grady – International Journal of Listening, 2025

Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…

Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests

A Case Study of Washback and Test Preparation of the New Version of PTE Academic

Peer reviewed
PDF on ERIC

Download full text

Yi Zou; Ying Zheng; Jingwen Wang – International Journal of Language Testing, 2025

The Pearson Test of English Academic (PTE-A), a widely used high-stakes language proficiency test for university admissions and migration purposes, underwent a notable change from a three-hour to a two-hour version in November 2021. The implementation of the new version has prompted inquiries into the washback effects on various stakeholders.…

Descriptors: Testing Problems, Test Preparation, High Stakes Tests, English (Second Language)

Sex and the Census: Why Surveys Should Not Conflate Sex and Gender Identity

Peer reviewed

Direct link

Sullivan, Alice – International Journal of Social Research Methodology, 2020

The UK census authorities have proposed guidance for the 2021 census indicating that the sex question may be answered according to subjective gender identity. This raises issues about the measurement of sex and gender identity which other data collection exercises are also contending with. This paper addresses the questions that have arisen…

Descriptors: Foreign Countries, National Surveys, Census Figures, Test Items

Revising the BioMedical Admissions Test (BMAT) to Improve Impact and Washback for Candidates and Support Fair Access to Test Preparation

Peer reviewed

Direct link

McElwee, Sarah; Y. F. Cheung, Kevin; R. T. Cromie, Stephen; Shannon, Mark; Gallacher, Tom – Assessment in Education: Principles, Policy & Practice, 2021

The BioMedical Admissions Test (BMAT) has been used to select students for healthcare courses for 15 years. Recently, the candidature has included an increasing number of test takers who did not complete their schooling in the UK. In line with responsibilities to promote widening participation, a revision of the Section 2 Scientific Knowledge and…

Descriptors: Foreign Countries, Medical Education, College Admission, Medical Schools

Analyzing Cognitive Demands of a Scientific Reasoning Test Using the Linear Logistic Test Model (LLTM)

Peer reviewed
PDF on ERIC

Download full text

Krell, Moritz; Samia Khan; Jan van Driel – Education Sciences, 2021

The development and evaluation of valid assessments of scientific reasoning are an integral part of research in science education. In the present study, we used the linear logistic test model (LLTM) to analyze how item features related to text complexity and the presence of visual representations influence the overall item difficulty of an…

Descriptors: Cognitive Processes, Difficulty Level, Science Tests, Logical Thinking

Measuring Motivation to Take Low-Stakes Large-Scale Test: New Model Based on Analyses of "Participant-Own-Defined" Missingness

Peer reviewed

Direct link

Liu, Yuan; Hau, Kit-Tai – Educational and Psychological Measurement, 2020

In large-scale low-stake assessment such as the Programme for International Student Assessment (PISA), students may skip items (missingness) which are within their ability to complete. The detection and taking care of these noneffortful responses, as a measure of test-taking motivation, is an important issue in modern psychometric models.…

Descriptors: Response Style (Tests), Motivation, Test Items, Statistical Analysis

Response to Fugard and Hines

Peer reviewed

Direct link

Sullivan, Alice – International Journal of Social Research Methodology, 2020

This article replies to the responses to my article on "Sex and the Census: Why surveys should not conflate sex and gender identity". Fugard conflates sex itself with the characteristics associated with sex, such as finger length ratios, leading to the erroneous implication that binary sex is not a useful explanatory variable. Hines…

Descriptors: Foreign Countries, National Surveys, Census Figures, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

International Journal of…	4
Applied Measurement in…	3
Assessment & Evaluation in…	3
Assessment in Education:…	3
British Journal of…	3
Educational and Psychological…	3
Research Matters	3
Educational Research	2
Evaluation & Research in…	2
Research Papers in Education	2
School Science Review	2
Studies in Higher Education	2
Anatomical Sciences Education	1
Australian Journal of…	1
British Journal of Learning…	1
Cambridge University Press &…	1
Creativity Research Journal	1
Education Sciences	1
Educational Research and…	1
Educational Studies	1
Educational Studies in…	1
Field Methods	1
Innovations in Education and…	1
International Journal of…	1
International Journal of…	1
More ▼

Crisp, Victoria	4
Bramley, Tom	3
Sullivan, Alice	2
Tuncay Ögretmen	2
A. Alexander Beaujean	1
Adam E. Green	1
Andrich, David	1
Barber, Jill	1
Bazira, Peter J.	1
Beggs, Jim	1
Bellin, W.	1
Bimpeh, Yaw	1
Black, Beth	1
Blair, Bernadette	1
Bligh, J. G.	1
Blumberg, Fran	1
Breen, Sinéad	1
Brooks, Michelle	1
Brooks, Mike	1
Brown, Anna	1
Brunfaut, Tineke	1
Burfitt, Joan	1
Carrick, Tessa	1
Chia, Lian Sai	1
More ▼