Publication Date
In 2025 | 1 |
Since 2024 | 7 |
Descriptor
Computer Assisted Testing | 7 |
Error of Measurement | 7 |
Adaptive Testing | 3 |
Test Reliability | 3 |
Accuracy | 2 |
Evaluation Methods | 2 |
Student Evaluation | 2 |
Test Validity | 2 |
Thinking Skills | 2 |
Adults | 1 |
Aptitude Tests | 1 |
More ▼ |
Source
ProQuest LLC | 2 |
British Educational Research… | 1 |
ETS Research Institute | 1 |
Education and Information… | 1 |
Journal of Autism and… | 1 |
Journal of Computer Assisted… | 1 |
Author
Amit Sevak | 1 |
Andreas Mühling | 1 |
Christopher F. Chabris | 1 |
Daniel Fishtein | 1 |
Edi Istiyono | 1 |
Ikkyu Choi | 1 |
Jesse Sparks | 1 |
Jonas Flodén | 1 |
Markus Gebhardt | 1 |
Matt I. Brown | 1 |
Morten Bastian | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Dissertations/Theses -… | 2 |
Reports - Evaluative | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
High Schools | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
What Works Clearinghouse Rating
Jonas Flodén – British Educational Research Journal, 2025
This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…
Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring
Yu Wang – ProQuest LLC, 2024
The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…
Descriptors: Multiple Choice Tests, Cognitive Tests, Cognitive Measurement, Educational Diagnosis
Rizki Zakwandi; Edi Istiyono; Wipsar Sunu Brams Dwandaru – Education and Information Technologies, 2024
Computational Thinking (CT) skill was a part of the global framework of reference on Digital Literacy for Indicator 4.4.2, widely developed in mathematics and science learning. This study aimed to promote an assessment tool using a two-tier Computerized Adaptive Test (CAT). The study used the Design and Development Research (DDR) method with four…
Descriptors: Computer Assisted Testing, Adaptive Testing, Student Evaluation, Computation
Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024
The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…
Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability
Nikola Ebenbeck; Morten Bastian; Andreas Mühling; Markus Gebhardt – Journal of Computer Assisted Learning, 2024
Background: Computerised adaptive tests (CATs) are tests that provide personalised, efficient and accurate measurement while reducing testing time, depending on the desired level of precision. Schools have different types of assessments that can benefit from a significant reduction in testing time to varying degrees, depending on the area of…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Public Schools, Special Schools
Stefan Lorenz – ProQuest LLC, 2024
This dissertation develops and applies sophisticated Item Response Theory (IRT) methods to address fundamental measurement challenges in cognitive testing, focusing on the Armed Services Vocational Aptitude Battery (ASVAB) data from the National Longitudinal Survey of Youth (NLSY). The first chapter implements a confirmatory multidimensional IRT…
Descriptors: Human Capital, Item Response Theory, Vocational Aptitude, Armed Forces
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias