ERIC - Search Results

Publication Date

In 2026	0
Since 2025	17
Since 2022 (last 5 years)	96
Since 2017 (last 10 years)	232

Descriptor

Computer Assisted Testing	232
Test Reliability	177
Test Validity	126
Foreign Countries	98
Test Construction	53
Elementary School Students	44
Language Tests	44
Scores	43
Scoring	43
Student Evaluation	37
Evaluation Methods	33
Reliability	33
Second Language Learning	33
Test Items	32
Psychometrics	31
English (Second Language)	30
Interrater Reliability	30
Test Format	29
Item Response Theory	27
Undergraduate Students	27
Comparative Analysis	23
Reading Tests	23
Student Attitudes	23
College Students	20
Adaptive Testing	19
More ▼

Publication Type

Journal Articles	190
Reports - Research	175
Reports - Evaluative	18
Reports - Descriptive	16
Dissertations/Theses -…	11
Tests/Questionnaires	10
Information Analyses	6
Numerical/Quantitative Data	4
Speeches/Meeting Papers	4
Guides - General	3
Guides - Non-Classroom	3
Collected Works - Proceedings	2
Books	1
Collected Works - General	1
Opinion Papers	1
More ▼

Education Level

Higher Education	75
Postsecondary Education	67
Elementary Education	54
Secondary Education	42
Middle Schools	25
Early Childhood Education	21
High Schools	19
Intermediate Grades	18
Junior High Schools	18
Primary Education	18
Grade 5	14
Elementary Secondary Education	12
Grade 3	12
Grade 4	12
Grade 6	8
Grade 7	8
Grade 2	7
Kindergarten	7
Grade 8	6
Grade 1	4
Adult Education	3
Grade 9	2
Preschool Education	2
Grade 10	1
Grade 11	1
More ▼

Audience

Administrators	6
Researchers	2
Practitioners	1

Location

Turkey	10
China	9
New York	9
Australia	8
Germany	5
Indonesia	5
Singapore	5
California	4
France	4
Japan	4
Taiwan	4
United Kingdom	4
Canada	3
Europe	3
Hungary	3
Illinois	3
India	3
Israel	3
Malaysia	3
Sweden	3
United Kingdom (England)	3
United States	3
Connecticut	2
Maryland	2
Nebraska	2
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	2
Pell Grant Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 232 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Technology-Based Assessment of Phonological Awareness in Kindergarten

Peer reviewed

Direct link

Renáta Kiss; Beno Csapó – International Journal of Early Childhood, 2025

Previous research has shown that phonological awareness is one of the most important prerequisites for early reading. Monitoring its development requires reliable, easy-to-use instruments especially in the last years of kindergarten. The present study aims to explore the potential for assessing phonological awareness and some of its subskills…

Descriptors: Phonological Awareness, Kindergarten, Reading Skills, Student Evaluation

What Predicts Variation in Reliability and Validity of Online Peer Assessment? A Large-Scale Cross-Context Study

Peer reviewed

Direct link

Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023

Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…

Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity

How Reliable Is Assessment of Children's Sentence Comprehension Using a Self-Directed App? A Comparison of Supported versus Independent Use

Peer reviewed

Direct link

Pauline Frizelle; Ana Buckley; Tricia Biancone; Anna Ceroni; Darren Dahly; Paul Fletcher; Dorothy V. M. Bishop; Cristina McKean – Journal of Child Language, 2024

This study reports on the feasibility of using the Test of Complex Syntax- Electronic (TECS-E), as a self-directed app, to measure sentence comprehension in children aged 4 to 5 ½ years old; how testing apps might be adapted for effective independent use; and agreement levels between face-to-face supported computerized and independent computerized…

Descriptors: Language Processing, Computer Software, Language Tests, Syntax

Reliability, Validity and Acceptability of the PEDI-CAT with ASD Scales for Australian Children and Youth on the Autism Spectrum

Peer reviewed

Direct link

Angela Chamberlain; Emily D'Arcy; Andrew J. O. Whitehouse; Kerry Wallace; Maya Hayden-Evans; Sonya Girdler; Benjamin Milbourn; Sven Bölte; Kiah Evans – Journal of Autism and Developmental Disorders, 2025

Purpose: The PEDI-CAT (ASD) is used to assess functioning of children and youth on the autism spectrum; however, current psychometric evidence is limited. This study aimed to explore the reliability, validity and acceptability of the PEDI-CAT (ASD) using a large Australian sample. Methods: Caregivers of 134 children and youth on the spectrum…

Descriptors: Autism Spectrum Disorders, Children, Youth, Test Reliability

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design

Peer reviewed

Direct link

Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024

To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…

Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design

Exploring the Opportunities for Online Assessment of Phonological Awareness at the Beginning of Schooling

Peer reviewed

Direct link

Ágnes Hódi; Edit Tóth – International Journal of Early Childhood, 2024

Phonological awareness plays a key role in learning to read; therefore, its assessment has received a lot of attention. Research in the domain of phonological awareness has been characterized by attempts to develop reliable and valid assessment tools for diverse populations. Over the past few decades, phonological awareness assessment has gone…

Descriptors: Phonological Awareness, Computer Assisted Testing, Hungarian, Native Language

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

Development and Initial Validation of the Computer-Based Orthographic Processing Assessment Short Form: An Application of Cognitive Diagnostic Modeling

Peer reviewed

Direct link

Yi-Jui I. Chen; Yi-Jhen Wu; Yi-Hsin Chen; Robin Irey – Journal of Psychoeducational Assessment, 2025

A short form of the 60-item computer-based orthographic processing assessment (long-form COPA or COPA-LF) was developed. The COPA-LF consists of five skills, including rapid perception, access, differentiation, correction, and arrangement. Thirty items from the COPA-LF were selected for the short-form COPA (COPA-SF) based on cognitive diagnostic…

Descriptors: Computer Assisted Testing, Test Length, Test Validity, Orthographic Symbols

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Electronic Assessment Anxiety Scale: Development, Validity and Reliability

Peer reviewed
PDF on ERIC

Download full text

Osman Tat; Abdullah Faruk Kilic – Turkish Online Journal of Distance Education, 2024

The widespread availability of internet access in daily life has resulted in a greater acceptance of online assessment methods. E-assessment platforms offer various features such as randomizing questions and answers, utilizing extensive question banks, setting time limits, and managing access during online exams. Electronic assessment enables…

Descriptors: Test Construction, Test Validity, Test Reliability, Anxiety

Psychometrics of Art -- Validation of "RizbA," a Quantitative Rating Instrument for Pictorial Expression

Peer reviewed

Direct link

Schoch, Kerstin; Ostermann, Thomas – Creativity Research Journal, 2022

Although art has been subject to psychological research for some time, the artwork itself received little attention in quantitative research. The rating instrument for two-dimensional pictorial works ("RizbA") fills this gap by providing a tool for formal picture analysis. This study validates the questionnaire on 294 images created by…

Descriptors: Psychometrics, Art, Measures (Individuals), Visual Arts

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

Grantee Submission	10
ProQuest LLC	9
ETS Research Report Series	8
Education and Information…	7
Journal of Computer Assisted…	7
Language Assessment Quarterly	7
Journal of Speech, Language,…	6
Language Testing	6
New York State Education…	6
Journal of Psychoeducational…	5
Journal of Intelligence	4
Online Submission	4
Advances in Physiology…	3
International Association for…	3
International Journal of…	3
International Journal of…	3
Journal of Educational…	3
Advances in Health Sciences…	2
Assessment for Effective…	2
Canadian Journal of School…	2
Creativity Research Journal	2
Electronic Journal of…	2
IEEE Transactions on Learning…	2
International Educational…	2
International Journal of…	2
More ▼

McKown, Clark	4
Petscher, Yaacov	3
Tock, Jamie	3
Ackerman, Debra J.	2
Amit Sevak	2
Anna-Maria Fall	2
Beula M. Magimairaj	2
Biancarosa, Gina	2
Carlson, Sarah E.	2
Casabianca, Jodi M.	2
Chen, Guanhua	2
Cristina McKean	2
Daniel Fishtein	2
Darling-Hammond, Linda	2
Davis, Marcia H.	2
Davison, Mark L.	2
Divayana, Dewa Gede Hendra	2
Doewes, Afrizal	2
Dorothy V. M. Bishop	2
Ecalle, Jean	2
Erdemir, Mustafa	2
Galaczi, Evelina	2
Goodwin, Amanda P.	2
Greg Roberts	2
Hock, Michael	2
More ▼

Test of English as a Foreign…	5
Measures of Academic Progress	4
Gates MacGinitie Reading Tests	3
Woodcock Johnson Tests of…	3
MacArthur Communicative…	2
National Assessment of…	2
New York State Regents…	2
Peabody Picture Vocabulary…	2
Program for International…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Autism Diagnostic Observation…	1
Clinical Evaluation of…	1
Computer Attitude Scale	1
Dynamic Indicators of Basic…	1
International Association for…	1
International English…	1
Iowa Tests of Basic Skills	1
Mullen Scales of Early…	1
Pediatric Evaluation of…	1
Progress in International…	1
Raven Progressive Matrices	1
Social Skills Improvement…	1
Stanford Achievement Tests	1
Test of English for…	1
More ▼