Publication Date
In 2025 | 6 |
Since 2024 | 11 |
Since 2021 (last 5 years) | 14 |
Since 2016 (last 10 years) | 28 |
Since 2006 (last 20 years) | 51 |
Descriptor
Computer Assisted Testing | 59 |
Evaluation Criteria | 40 |
Foreign Countries | 22 |
English (Second Language) | 13 |
Second Language Learning | 13 |
Evaluation Methods | 12 |
Scores | 12 |
Test Items | 12 |
Adaptive Testing | 11 |
Admission Criteria | 10 |
Language Tests | 10 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 59 |
Journal Articles | 53 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Education Level
Higher Education | 24 |
Postsecondary Education | 21 |
Secondary Education | 7 |
High Schools | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Audience
Location
Germany | 4 |
China | 3 |
Indonesia | 3 |
Taiwan | 2 |
Turkey | 2 |
Australia | 1 |
Colombia | 1 |
Denmark | 1 |
Egypt | 1 |
Finland | 1 |
Indiana | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
K. Talman; J. Vierula; T. Karihtala; E. Laakkonen; J. Engblom; E. Haavisto – Higher Education Quarterly, 2025
Higher education institutions need to develop valid, fair, and objective selection methods. Current literature reporting the development and validation of new national large-scale selection tests is scarce. This two-phased study aimed to (1) develop and (2) evaluate the validity of the Finnish digital Universities of Applied Sciences Entrance…
Descriptors: Admission Criteria, Test Construction, Test Validity, Computer Assisted Testing
Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025
Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…
Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment
Jussi S. Jauhiainen; Agustín Garagorry Guerra – Innovations in Education and Teaching International, 2025
The study highlights ChatGPT-4's potential in educational settings for the evaluation of university students' open-ended written examination responses. ChatGPT-4 evaluated 54 written responses, ranging from 24 to 256 words in English. It assessed each response using five criteria and assigned a grade on a six-point scale from fail to excellent,…
Descriptors: Artificial Intelligence, Technology Uses in Education, Student Evaluation, Writing Evaluation
Wen Xin Zhang; John J. H. Lin; Ying-Shao Hsu – Journal of Computer Assisted Learning, 2025
Background Study: Assessing learners' inquiry-based skills is challenging as social, political, and technological dimensions must be considered. The advanced development of artificial intelligence (AI) makes it possible to address these challenges and shape the next generation of science education. Objectives: The present study evaluated the SSI…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Inquiry, Active Learning
Imanudin Kudus; Heru Nurasa; Ida Widianingsih; Nina Karlina; Jayum Anak Jawan – Cogent Education, 2024
Currently, Indonesia has 122 State Universities (PTN) under the Ministry of Education and Culture and other ministries. Improving the quality of the selection process for new student admissions at PTN is critical for Indonesia's human resources development. Then in 2019, there was a transformation with the implementation of the exam becoming a…
Descriptors: Foreign Countries, Higher Education, Public Colleges, Organizational Climate
Blair Lehman; Jesse R. Sparks; Jonathan Steinberg – ETS Research Report Series, 2024
Over the last 20 years, many methods have been proposed to use process data (e.g., response time) to detect changes in engagement during the test-taking process. However, many of these methods were developed and evaluated in highly similar testing contexts: 30 or more single-select multiple-choice items presented in a linear, fixed sequence in…
Descriptors: National Competency Tests, Secondary School Mathematics, Secondary School Students, Mathematics Tests
Ezi Apino; Edi Istiyono; Heri Retnawati; Widihastuti Widihastuti; Kana Hidayati – Journal of Pedagogical Research, 2024
Assessment of attitudes towards statistics [ATS] is needed to support the success of statistics education in tertiary institutions, so measuring instruments with high accuracy is required. However, existing instruments to measure ATS have not considered the use of technology as an essential variable affecting success in statistics education. The…
Descriptors: Foreign Countries, College Students, College Faculty, Statistics Education
Celeste Combrinck; Nelé Loubser – Discover Education, 2025
Written assignments for large classes pose a far more significant challenge in the age of the GenAI revolution. Suggestions such as oral exams and formative assessments are not always feasible with many students in a class. Therefore, we conducted a study in South Africa and involved 280 Honors students to explore the usefulness of Turnitin's AI…
Descriptors: Foreign Countries, Artificial Intelligence, Large Group Instruction, Alternative Assessment
Shinta Estri Wahyuningrum; Gilles van Luijtelaar; Augustina Sulastri; Marc P. H. Hendriks; Ridwan Sanjaya; Tom Heskes – SAGE Open, 2024
Visual Reproduction is a condition to measure Visual Spatial Memory as one of the cognitive domains commonly used to measure visuo-spatial memory. Geometric figures serve as stimulus material, and probands have to reproduce the figures from memory through a hand drawing. The scoring of the drawing has subjective elements. This study aims to…
Descriptors: Automation, Scores, Geometry, Visual Aids
Gerd Kortemeyer; Julian Nöhl; Daria Onishchuk – Physical Review Physics Education Research, 2024
[This paper is part of the Focused Collection in Artificial Intelligence Tools in Physics Teaching and Physics Education Research.] Using a high-stakes thermodynamics exam as the sample (252 students, four multipart problems), we investigate the viability of four workflows for AI-assisted grading of handwritten student solutions. We find that the…
Descriptors: Grading, Physics, Science Instruction, Artificial Intelligence
Yishen Song; Liming Guo; Qinhua Zheng – Education and Information Technologies, 2025
Scientific inquiry ability is closely related to the process of hands-on inquiry practice. However, its assessment is often separated from this practice due to the limitation of technical basis and labor cost. The development of multimodal data analysis provides a new opportunity to realize automated assessment based on hands-on practice.…
Descriptors: Elementary School Students, Grade 4, Hands on Science, Experiential Learning
Lin, Chuan-Ju; Chang, Hua-Hua – Educational and Psychological Measurement, 2019
For item selection in cognitive diagnostic computerized adaptive testing (CD-CAT), ideally, a single item selection index should be created to simultaneously regulate precision, exposure status, and attribute balancing. For this purpose, in this study, we first proposed an attribute-balanced item selection criterion, namely, the standardized…
Descriptors: Test Items, Selection Criteria, Computer Assisted Testing, Adaptive Testing
Penny Smith; Tracey Carlyon – Assessment Matters, 2023
Learning and assessment that drives learner success should be a key tenet of all initial teacher education programmes. Initial teacher education providers in Aotearoa New Zealand must use an assessment framework to ensure that graduating teachers meet the Teaching Council standards. As a part of a review of their assessment practices, academic…
Descriptors: Foreign Countries, Beginning Teachers, Beginning Teacher Induction, Teacher Education
A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing
Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023
Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…
Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy
Gandini, Elena A. M.; Horák, Tania – Language Learning in Higher Education, 2020
This contribution reports on the developing and piloting of a computer-based version of the test of English as a foreign language produced by the University of Central Lancashire (UCLan), where it is currently used for the admission of international students and the subsequent evaluation of their language progress. Among other benefits,…
Descriptors: Computer Assisted Testing, Feedback (Response), Foreign Countries, English (Second Language)