ERIC - Search Results

Publication Date

In 2025	3
Since 2024	5
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	31

Descriptor

Automation	37
Test Validity	37
Scoring	18
Test Construction	16
Test Reliability	14
Computer Assisted Testing	13
Test Items	11
Testing	10
Psychometrics	8
Quality Control	8
Scores	7
Evaluation Methods	6
Item Analysis	6
Statistical Analysis	6
Test Bias	6
Grade 3	5
Grade 4	5
Grade 5	5
Grade 6	5
Grade 7	5
Grade 9	5
Item Response Theory	5
Language Arts	5
Performance	5
Scaling	5
More ▼

Publication Type

Journal Articles	26
Reports - Research	19
Reports - Descriptive	8
Numerical/Quantitative Data	5
Reports - Evaluative	5
Information Analyses	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Higher Education	7
Secondary Education	6
Early Childhood Education	5
Elementary Education	5
High Schools	5
Grade 3	4
Grade 4	4
Grade 5	4
Grade 6	4
Grade 7	4
Grade 9	4
Intermediate Grades	4
Junior High Schools	4
Middle Schools	4
Postsecondary Education	4
Primary Education	4
Grade 10	2
Grade 11	2
Grade 8	2
Adult Education	1
Elementary Secondary Education	1
Grade 12	1
Preschool Education	1
More ▼

Audience

Location

Iowa	1
Israel	1
Maryland	1
Spain	1
Sweden	1
United Kingdom (England)	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Autism Diagnostic Observation…	1
Conners Rating Scales	1
Conners Teacher Rating Scale	1
Diagnostic Interview Schedule…	1
Sixteen Personality Factor…	1
Slosson Intelligence Test	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Validation of the Language ENvironment Analysis in Swedish Children

Peer reviewed

Direct link

Sandra Nilsson; Elisabet Östlund; Yvonne Thalén; Ulrika Löfkvist – Journal of Speech, Language, and Hearing Research, 2025

Purpose: The Language ENvironment Analysis (LENA) is a technological tool designed for comprehensive recordings and automated analysis of young children's daily language and auditory environments. LENA recordings play a crucial role in both clinical interventions and research, offering insights into the amount of spoken language children are…

Descriptors: Foreign Countries, Family Environment, Toddlers, Oral Language

How Can Valid and Reliable Automatic Formative Assessment Predict the Acquisition of Learning Outcomes?

Peer reviewed

Direct link

Blaženka Divjak; Barbi Svetec; Damir Horvat – Journal of Computer Assisted Learning, 2024

Background: Sound learning design should be based on the constructive alignment of intended learning outcomes (LOs), teaching and learning activities and formative and summative assessment. Assessment validity strongly relies on its alignment with LOs. Valid and reliable formative assessment can be analysed as a predictor of students' academic…

Descriptors: Automation, Formative Evaluation, Test Validity, Test Reliability

A Systematic Review of Automated Writing Evaluation Systems

Peer reviewed

Direct link

Huawei, Shi; Aryadoust, Vahid – Education and Information Technologies, 2023

Automated writing evaluation (AWE) systems are developed based on interdisciplinary research and technological advances such as natural language processing, computer sciences, and latent semantic analysis. Despite a steady increase in research publications in this area, the results of AWE investigations are often mixed, and their validity may be…

Descriptors: Writing Evaluation, Writing Tests, Computer Assisted Testing, Automation

A Suggestive Approach for Assessing Item Quality, Usability and Validity of Automatic Item Generation

Peer reviewed

Direct link

Falcão, Filipe; Pereira, Daniela Marques; Gonçalves, Nuno; De Champlain, Andre; Costa, Patrício; Pêgo, José Miguel – Advances in Health Sciences Education, 2023

Automatic Item Generation (AIG) refers to the process of using cognitive models to generate test items using computer modules. It is a new but rapidly evolving research area where cognitive and psychometric theory are combined into digital framework. However, assessment of the item quality, usability and validity of AIG relative to traditional…

Descriptors: Computer Assisted Testing, Test Construction, Test Items, Automation

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Development of a New Measure of Cognitive Ability Using Automatic Item Generation and Its Psychometric Properties

Peer reviewed

Direct link

Ryoo, Ji Hoon; Park, Sunhee; Suh, Hongwook; Choi, Jaehwa; Kwon, Jongkyum – SAGE Open, 2022

In the development of cognitive science understanding human intelligence and mind, measurement of cognitive ability has played a key role. To address the development in data scientific point of views related to cognitive neuroscience, there has been a demand of creating a measurement to capture cognition in short and repeated time periods. This…

Descriptors: Cognitive Ability, Psychometrics, Test Validity, Test Construction

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

LanguageScreen: The Development, Validation, and Standardization of an Automated Language Assessment App

Peer reviewed

Direct link

Charles Hulme; Joshua McGrane; Mihaela Duta; Gillian West; Denise Cripps; Abhishek Dasgupta; Sarah Hearne; Rachel Gardner; Margaret Snowling – Language, Speech, and Hearing Services in Schools, 2024

Purpose: Oral language skills provide a critical foundation for formal education and especially for the development of children's literacy (reading and spelling) skills. It is therefore important for teachers to be able to assess children's language skills, especially if they are concerned about their learning. We report the development and…

Descriptors: Automation, Language Tests, Standardized Tests, Test Construction

Digital Module 18: Automated Scoring

Peer reviewed

Direct link

Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…

Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment

Automatic Multiple Choice Question Generation From Text: A Survey

Peer reviewed

Direct link

Rao, Dhawaleswar; Saha, Sujan Kumar – IEEE Transactions on Learning Technologies, 2020

Automatic multiple choice question (MCQ) generation from a text is a popular research area. MCQs are widely accepted for large-scale assessment in various domains and applications. However, manual generation of MCQs is expensive and time-consuming. Therefore, researchers have been attracted toward automatic MCQ generation since the late 90's.…

Descriptors: Multiple Choice Tests, Test Construction, Automation, Computer Software

Modeling Statistics ITAs' Speaking Performances in a Certification Test

Direct link

Ziwei Zhou – ProQuest LLC, 2020

In light of the ever-increasing capability of computer technology and advancement in speech and natural language processing techniques, automated speech scoring of constructed responses is gaining popularity in many high-stakes assessment and low-stakes educational settings. Automated scoring is a highly interdisciplinary and complex subject, and…

Descriptors: Certification, Speech Skills, Automation, Scoring

Results from NCME Survey on Revisions to the "Standards for Educational and Psychological Testing"

Download full text

Doris Zahner; Jeffrey T. Steedle; James Soland; Catherine Welch; Qi Qin; Kathryn Thompson; Richard Phelps – Online Submission, 2023

The "Standards for Educational and Psychological Testing" have served as a cornerstone for best practices in assessment. As the field evolves, so must these standards, with regular revisions ensuring they reflect current knowledge and practice. The National Council on Measurement in Education (NCME) conducted a survey to gather feedback…

Descriptors: Standards, Educational Assessment, Psychological Testing, Best Practices

Automated Summary Evaluation with Inbuilt Rubric Method: An Alternative to Constructed Responses and Multiple-Choice Tests Assessments

Peer reviewed

Direct link

Martínez-Huertas, José Á.; Jastrzebska, Olga; Olmos, Ricardo; León, José A. – Assessment & Evaluation in Higher Education, 2019

Automated summary evaluation is proposed as an alternative to rubrics and multiple-choice tests in knowledge assessment. Inbuilt rubric is a recent Latent Semantic Analysis (LSA) method that implements rubrics in an artificially-generated semantic space. It was compared with classical LSA's cosine-based methods assessing knowledge in a…

Descriptors: Automation, Scoring Rubrics, Alternative Assessment, Test Reliability

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3

Partnership for Assessment of…	3
Applied Measurement in…	2
ETS Research Report Series	2
Educational Measurement:…	2
Language Testing	2
New Meridian Corporation	2
Advances in Health Sciences…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
Education and Information…	1
Educational Assessment	1
Educational Researcher	1
Grantee Submission	1
IEEE Transactions on Learning…	1
Infants and Young Children	1
International Journal of…	1
Journal of Computer Assisted…	1
Journal of Consulting and…	1
Journal of Educational Issues	1
Journal of Occupational…	1
Journal of Personality…	1
Journal of School Psychology	1
Journal of Speech, Language,…	1
Journal of Statistics…	1
Language, Speech, and Hearing…	1
More ▼

Bejar, Isaac I.	2
Okan Bulut	2
Abhishek Dasgupta	1
Aryadoust, Vahid	1
Barbi Svetec	1
Barghaus, Katherine M.	1
Bastian, Amy J.	1
Belur, Vinetha	1
Ben-Simon, Anat	1
Bin Tan	1
Blaženka Divjak	1
Boyer, Michelle	1
Burkhardt, Amy	1
Catherine Welch	1
Chapelle, Carol A.	1
Charles Hulme	1
Chen, Jing	1
Choi, Jaehwa	1
Clariana, Roy	1
Cohen, Yoav	1
Costa, Patrício	1
Cotos, Elena	1
Cox, L. Clarke	1
Damir Horvat	1
More ▼