Publication Date
In 2025 | 5 |
Since 2024 | 11 |
Since 2021 (last 5 years) | 17 |
Since 2016 (last 10 years) | 37 |
Since 2006 (last 20 years) | 88 |
Descriptor
Computer Assisted Testing | 88 |
Evaluation Criteria | 64 |
Foreign Countries | 26 |
Evaluation Methods | 23 |
Second Language Learning | 19 |
Test Items | 18 |
English (Second Language) | 17 |
Scores | 17 |
Test Construction | 17 |
Student Evaluation | 16 |
Language Tests | 15 |
More ▼ |
Source
Author
Chang, Hua-Hua | 4 |
Lin, Chuan-Ju | 4 |
Attali, Yigal | 3 |
Ramineni, Chaitanya | 3 |
Williamson, David M. | 3 |
Faurer, Judson C. | 2 |
Hwang, Gwo-Jen | 2 |
Lottridge, Susan | 2 |
Wang, Wen-Chung | 2 |
Wood, Scott | 2 |
Yin, Peng-Yeng | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 35 |
Postsecondary Education | 31 |
Secondary Education | 10 |
Elementary Secondary Education | 9 |
High Schools | 4 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
More ▼ |
Audience
Administrators | 1 |
Researchers | 1 |
Teachers | 1 |
Location
Germany | 5 |
Australia | 3 |
China | 3 |
Indonesia | 3 |
Turkey | 3 |
Denmark | 2 |
Egypt | 2 |
Japan | 2 |
Singapore | 2 |
Taiwan | 2 |
United Kingdom | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
K. Talman; J. Vierula; T. Karihtala; E. Laakkonen; J. Engblom; E. Haavisto – Higher Education Quarterly, 2025
Higher education institutions need to develop valid, fair, and objective selection methods. Current literature reporting the development and validation of new national large-scale selection tests is scarce. This two-phased study aimed to (1) develop and (2) evaluate the validity of the Finnish digital Universities of Applied Sciences Entrance…
Descriptors: Admission Criteria, Test Construction, Test Validity, Computer Assisted Testing
Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025
Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…
Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment
Wen Xin Zhang; John J. H. Lin; Ying-Shao Hsu – Journal of Computer Assisted Learning, 2025
Background Study: Assessing learners' inquiry-based skills is challenging as social, political, and technological dimensions must be considered. The advanced development of artificial intelligence (AI) makes it possible to address these challenges and shape the next generation of science education. Objectives: The present study evaluated the SSI…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Inquiry, Active Learning
Imanudin Kudus; Heru Nurasa; Ida Widianingsih; Nina Karlina; Jayum Anak Jawan – Cogent Education, 2024
Currently, Indonesia has 122 State Universities (PTN) under the Ministry of Education and Culture and other ministries. Improving the quality of the selection process for new student admissions at PTN is critical for Indonesia's human resources development. Then in 2019, there was a transformation with the implementation of the exam becoming a…
Descriptors: Foreign Countries, Higher Education, Public Colleges, Organizational Climate
Blair Lehman; Jesse R. Sparks; Jonathan Steinberg – ETS Research Report Series, 2024
Over the last 20 years, many methods have been proposed to use process data (e.g., response time) to detect changes in engagement during the test-taking process. However, many of these methods were developed and evaluated in highly similar testing contexts: 30 or more single-select multiple-choice items presented in a linear, fixed sequence in…
Descriptors: National Competency Tests, Secondary School Mathematics, Secondary School Students, Mathematics Tests
Kirsch, Irwin; Braun, Henry – Large-scale Assessments in Education, 2020
Mounting concerns about the levels and distributions of human capital, as well as how they are associated with outcomes for individuals and societies, have contributed to an increase in the number of national and international surveys. These surveys not only examine skills among school-age and adult populations, they also facilitate evaluation of…
Descriptors: International Assessment, Computer Assisted Testing, Human Capital, Program Evaluation
Ezi Apino; Edi Istiyono; Heri Retnawati; Widihastuti Widihastuti; Kana Hidayati – Journal of Pedagogical Research, 2024
Assessment of attitudes towards statistics [ATS] is needed to support the success of statistics education in tertiary institutions, so measuring instruments with high accuracy is required. However, existing instruments to measure ATS have not considered the use of technology as an essential variable affecting success in statistics education. The…
Descriptors: Foreign Countries, College Students, College Faculty, Statistics Education
Celeste Combrinck; Nelé Loubser – Discover Education, 2025
Written assignments for large classes pose a far more significant challenge in the age of the GenAI revolution. Suggestions such as oral exams and formative assessments are not always feasible with many students in a class. Therefore, we conducted a study in South Africa and involved 280 Honors students to explore the usefulness of Turnitin's AI…
Descriptors: Foreign Countries, Artificial Intelligence, Large Group Instruction, Alternative Assessment
Shinta Estri Wahyuningrum; Gilles van Luijtelaar; Augustina Sulastri; Marc P. H. Hendriks; Ridwan Sanjaya; Tom Heskes – SAGE Open, 2024
Visual Reproduction is a condition to measure Visual Spatial Memory as one of the cognitive domains commonly used to measure visuo-spatial memory. Geometric figures serve as stimulus material, and probands have to reproduce the figures from memory through a hand drawing. The scoring of the drawing has subjective elements. This study aims to…
Descriptors: Automation, Scores, Geometry, Visual Aids
Gerd Kortemeyer; Julian Nöhl; Daria Onishchuk – Physical Review Physics Education Research, 2024
[This paper is part of the Focused Collection in Artificial Intelligence Tools in Physics Teaching and Physics Education Research.] Using a high-stakes thermodynamics exam as the sample (252 students, four multipart problems), we investigate the viability of four workflows for AI-assisted grading of handwritten student solutions. We find that the…
Descriptors: Grading, Physics, Science Instruction, Artificial Intelligence
Wood, Scott; Yao, Erin; Haisfield, Lisa; Lottridge, Susan – ACT, Inc., 2021
For assessment professionals who are also automated scoring (AS) professionals, there is no single set of standards of best practice. This paper reviews the assessment and AS literature to identify key standards of best practice and ethical behavior for AS professionals and codifies those standards in a single resource. Having a unified set of AS…
Descriptors: Standards, Best Practices, Computer Assisted Testing, Scoring
Yishen Song; Liming Guo; Qinhua Zheng – Education and Information Technologies, 2025
Scientific inquiry ability is closely related to the process of hands-on inquiry practice. However, its assessment is often separated from this practice due to the limitation of technical basis and labor cost. The development of multimodal data analysis provides a new opportunity to realize automated assessment based on hands-on practice.…
Descriptors: Elementary School Students, Grade 4, Hands on Science, Experiential Learning
Lin, Chuan-Ju; Chang, Hua-Hua – Educational and Psychological Measurement, 2019
For item selection in cognitive diagnostic computerized adaptive testing (CD-CAT), ideally, a single item selection index should be created to simultaneously regulate precision, exposure status, and attribute balancing. For this purpose, in this study, we first proposed an attribute-balanced item selection criterion, namely, the standardized…
Descriptors: Test Items, Selection Criteria, Computer Assisted Testing, Adaptive Testing
Rotou, Ourania; Rupp, André A. – ETS Research Report Series, 2020
This research report provides a description of the processes of evaluating the "deployability" of automated scoring (AS) systems from the perspective of large-scale educational assessments in operational settings. It discusses a comprehensive psychometric evaluation that entails analyses that take into consideration the specific purpose…
Descriptors: Computer Assisted Testing, Scoring, Educational Assessment, Psychometrics
Suzumura, Nana – Language Assessment Quarterly, 2022
The present study is part of a larger mixed methods project that investigated the speaking section of the Advanced Placement (AP) Japanese Language and Culture Exam. It investigated assumptions for the evaluation inference through a content analysis of test taker responses. Results of the content analysis were integrated with those of a many-facet…
Descriptors: Content Analysis, Test Wiseness, Advanced Placement, Computer Assisted Testing