ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Computation	7
Computer Assisted Testing	7
Accuracy	3
Automation	3
Adaptive Testing	2
Educational Assessment	2
Feedback (Response)	2
Item Response Theory	2
Problem Solving	2
Student Evaluation	2
Test Items	2
Artificial Intelligence	1
Assignments	1
Coding	1
Diagnostic Tests	1
Difficulty Level	1
Disclosure	1
Efficiency	1
Elementary Secondary Education	1
Essays	1
Foreign Countries	1
Formative Evaluation	1
Grading	1
Item Banks	1
Learning Processes	1
More ▼

Source

Online Submission	3
International Educational…	2
Grantee Submission	1
Mathematics Education…	1

Author

He, Wei	2
Jiao, Hong	2
Wang, Shudong	2
Andreea Dutulescu	1
Coots, Madison	1
Cutumisu, Maria	1
Danielle S. McNamara	1
Goodman, Noah	1
Gvozdenko, Eugene	1
Lu, Chang	1
Malik, Ali	1
Mihai Dascalu	1
Mitchell, John	1
Piech, Chris	1
Price, Beth	1
Reckase, Mark	1
Song, Jinpeng	1
Stacey, Kaye	1
Stefan Ruseti	1
Steinle, Vicki	1
Vasavada, Vrinda	1
Wu, Mike	1
More ▼

Publication Type

Speeches/Meeting Papers	7
Reports - Research	6
Reports - Descriptive	1

Education Level

Elementary Secondary Education	2
Secondary Education	1

Audience

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

How Hard Can This Question Be? An Exploratory Analysis of Features Assessing Question Difficulty Using LLMs

Peer reviewed

Andreea Dutulescu; Stefan Ruseti; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Assessing the difficulty of reading comprehension questions is crucial to educational methodologies and language understanding technologies. Traditional methods of assessing question difficulty rely frequently on human judgments or shallow metrics, often failing to accurately capture the intricate cognitive demands of answering a question. This…

Descriptors: Difficulty Level, Reading Tests, Test Items, Reading Comprehension

Generative Grading: Near Human-Level Accuracy for Automated Feedback on Richly Structured Problems

Peer reviewed
PDF on ERIC

Download full text

Malik, Ali; Wu, Mike; Vasavada, Vrinda; Song, Jinpeng; Coots, Madison; Mitchell, John; Goodman, Noah; Piech, Chris – International Educational Data Mining Society, 2021

Access to high-quality education at scale is limited by the difficulty of providing student feedback on open-ended assignments in structured domains like programming, graphics, and short response questions. This problem has proven to be exceptionally difficult: for humans, it requires large amounts of manual work, and for computers, until…

Descriptors: Grading, Accuracy, Computer Assisted Testing, Automation

Integrating Deep Learning into an Automated Feedback Generation System for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Lu, Chang; Cutumisu, Maria – International Educational Data Mining Society, 2021

Digitalization and automation of test administration, score reporting, and feedback provision have the potential to benefit large-scale and formative assessments. Many studies on automated essay scoring (AES) and feedback generation systems were published in the last decade, but few connected AES and feedback generation within a unified framework.…

Descriptors: Learning Processes, Automation, Computer Assisted Testing, Scoring

Using Percentages to Describe and Calculate Change

Download full text

Price, Beth; Steinle, Vicki; Stacey, Kaye; Gvozdenko, Eugene – Mathematics Education Research Group of Australasia, 2014

This study reports on the use of formative, diagnostic online assessments for the topic percentages. Two new item formats (drag-drop and slider) are described. About one-third of the school students (Years 7 to 9) could, using a slider, estimate "80% more than" a given length, in contrast with over two-thirds who could estimate "90%…

Descriptors: Computation, Mathematical Concepts, Formative Evaluation, Diagnostic Tests

Incorporating Person Covariates and Response Times as Collateral Information to Improve Person and Item Parameter Estimations

Download full text

Wang, Shudong; Jiao, Hong – Online Submission, 2011

For decades, researchers and practitioners have made a great deal of effort to study a variety of methods to increase parameter accuracy, but only recently can researchers start focusing on improving parameter estimations by using a joint model that could incorporate RT and students information as CI. Given that many tests are currently…

Descriptors: Reaction Time, Item Response Theory, Computer Assisted Testing, Computation

Effect of Person Cluster on Accuracy of Ability Estimation of Computerized Adaptive Testing in K-12 Education Assessment

Download full text

Wang, Shudong; Jiao, Hong; He, Wei – Online Submission, 2011

The ability estimation procedure is one of the most important components in a computerized adaptive testing (CAT) system. Currently, all CATs that provide K-12 student scores are based on the item response theory (IRT) model(s); while such application directly violates the assumption of independent sample of a person in IRT models because ability…

Descriptors: Accuracy, Computation, Computer Assisted Testing, Adaptive Testing

Evaluating the Impacts of Item Exposure Procedures on Ability Estimates in CAT When Items are Disclosed

Download full text

He, Wei; Reckase, Mark – Online Submission, 2008

Test security has been a concern for computerized adaptive tests (CAT) due to the nature of continuous testing. This concern becomes unprecedentedly severe with increasingly easy access to the World-Wide-Web where some examinees post on the internet their recollections of items they are administered, leaving future examinees with opportunities to…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Item Banks