ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	12
Since 2007 (last 20 years)	17

Descriptor

Computer Software	23
Scoring	23
Test Items	23
Computer Assisted Testing	14
Item Analysis	11
Accuracy	9
Item Response Theory	8
Models	8
Foreign Countries	7
Test Construction	6
Artificial Intelligence	5
Comparative Analysis	5
Adaptive Testing	4
English (Second Language)	4
Evaluators	4
International Assessment	4
Language Tests	4
Mathematics Tests	4
Simulation	4
Statistical Analysis	4
Achievement Tests	3
Classification	3
College Students	3
Computation	3
Computational Linguistics	3
More ▼

Source

ETS Research Report Series	2
International Educational…	2
ProQuest LLC	2
Computers & Education	1
Educational and Psychological…	1
Grantee Submission	1
Innovations in Education and…	1
International Association for…	1
International Journal of…	1
International Journal of…	1
International Online Journal…	1
JALT CALL Journal	1
Journal of Educational and…	1
Journal of Microbiology &…	1
Journal of Technology,…	1
OECD Publishing	1
More ▼

Publication Type

Reports - Research	13
Journal Articles	12
Reports - Descriptive	3
Collected Works - General	2
Dissertations/Theses -…	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Guides - Non-Classroom	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	4
Higher Education	4
Postsecondary Education	4
Elementary Education	2
Secondary Education	2

Audience

Administrators	1
Practitioners	1
Researchers	1
Teachers	1

Location

Japan	2
United Kingdom	2
Australia	1
Austria	1
Belgium	1
Canada	1
Chile	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
France	1
Germany	1
Ireland	1
Italy	1
Netherlands	1
North Carolina	1
Norway	1
Poland	1
Russia	1
Slovakia	1
South Korea	1
Spain	1
Sweden	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
Advanced Placement…	1
Program for International…	1
Test of English as a Foreign…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Chatbot Responses Suggest That Hypothetical Biology Questions Are Harder than Realistic Ones

Peer reviewed
PDF on ERIC

Download full text

Direct link

Gregory J. Crowther; Usha Sankar; Leena S. Knight; Deborah L. Myers; Kevin T. Patton; Lekelia D. Jenkins; Thomas A. Knight – Journal of Microbiology & Biology Education, 2023

The biology education literature includes compelling assertions that unfamiliar problems are especially useful for revealing students' true understanding of biology. However, there is only limited evidence that such novel problems have different cognitive requirements than more familiar problems. Here, we sought additional evidence by using…

Descriptors: Science Instruction, Artificial Intelligence, Scoring, Molecular Structure

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

SARM: A Computer Program for Estimating Speed-Accuracy Response Models for Dichotomous Items. Research Report. ETS RR-18-15

Peer reviewed
PDF on ERIC

Download full text

van Rijn, Peter W.; Ali, Usama S. – ETS Research Report Series, 2018

A computer program was developed to estimate speed-accuracy response models for dichotomous items. This report describes how the models are estimated and how to specify data and input files. An example using data from a listening section of an international language test is described to illustrate the modeling approach and features of the computer…

Descriptors: Computer Software, Computation, Reaction Time, Timed Tests

A Review of Digital Formative Assessment Tools: Features and Future Directions

Peer reviewed
PDF on ERIC

Download full text

Çekiç, Ahmet; Bakla, Arif – International Online Journal of Education and Teaching, 2021

The Internet and the software stores for mobile devices come with a huge number of digital tools for any task, and those intended for digital formative assessment (DFA) have burgeoned exponentially in the last decade. These tools vary in terms of their functionality, pedagogical quality, cost, operating systems and so forth. Teachers and learners…

Descriptors: Formative Evaluation, Futures (of Society), Computer Assisted Testing, Guidance

Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Demirtasli, R. Nukhet – International Journal of Research in Education and Science, 2017

This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

TIMSS 2023 Assessment Frameworks

Download full text

Mullis, Ina V. S., Ed.; Martin, Michael O., Ed.; von Davier, Matthias, Ed. – International Association for the Evaluation of Educational Achievement, 2021

TIMSS (Trends in International Mathematics and Science Study) is a long-standing international assessment of mathematics and science at the fourth and eighth grades that has been collecting trend data every four years since 1995. About 70 countries use TIMSS trend data for monitoring the effectiveness of their education systems in a global…

Descriptors: Achievement Tests, International Assessment, Science Achievement, Mathematics Achievement

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

How Accurately Can the Google Web Speech API Recognize and Transcribe Japanese L2 English Learners' Oral Production?

Peer reviewed
PDF on ERIC

Download full text

Ashwell, Tim; Elam, Jesse R. – JALT CALL Journal, 2017

The ultimate aim of our research project was to use the Google Web Speech API to automate scoring of elicited imitation (EI) tests. However, in order to achieve this goal, we had to take a number of preparatory steps. We needed to assess how accurate this speech recognition tool is in recognizing native speakers' production of the test items; we…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

Proceedings of the International Conference on Educational Data Mining (EDM) (16th, Bengaluru, India, July 11-14, 2023)

Peer reviewed
PDF on ERIC

Download full text

Feng, Mingyu, Ed.; Käser, Tanja, Ed.; Talukdar, Partha, Ed. – International Educational Data Mining Society, 2023

The Indian Institute of Science is proud to host the fully in-person sixteenth iteration of the International Conference on Educational Data Mining (EDM) during July 11-14, 2023. EDM is the annual flagship conference of the International Educational Data Mining Society. The theme of this year's conference is "Educational data mining for…

Descriptors: Information Retrieval, Data Analysis, Computer Assisted Testing, Cheating

Technical Report of the Survey of Adult Skills (PIAAC)

Direct link

OECD Publishing, 2013

The Programme for the International Assessment of Adult Competencies (PIAAC) has been planned as an ongoing program of assessment. The first cycle of the assessment has involved two "rounds." The first round, which is covered by this report, took place over the period of January 2008-October 2013. The main features of the first cycle of…

Descriptors: International Assessment, Adults, Skills, Test Construction

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Previous Page | Next Page »

Pages: 1 | 2

Ali, Usama S.	1
Ashwell, Tim	1
Aybek, Eren Can	1
Bakla, Arif	1
Bennett, Randy Elliot	1
Breyer, F. Jay	1
Deborah L. Myers	1
Demirtasli, R. Nukhet	1
Deng, Nina	1
Denis Dumas	1
Elam, Jesse R.	1
Feng, Mingyu, Ed.	1
Gifford, Bernard	1
Gregory J. Crowther	1
Haladyna, Thomas M.	1
Heffernan, Neil	1
Kelly Berthiaume	1
Kevin T. Patton	1
Khorramdel, Lale	1
Klinkenberg, S.	1
Kunal Sareen	1
Käser, Tanja, Ed.	1
Lan, Andrew	1
Leena S. Knight	1
More ▼