ERIC - Search Results

Publication Date

In 2025	1
Since 2024	5
Since 2021 (last 5 years)	24
Since 2016 (last 10 years)	43
Since 2006 (last 20 years)	62

Descriptor

Computer Assisted Testing	109
Scoring	109
Test Items	109
Test Construction	44
Adaptive Testing	42
Test Format	27
Item Response Theory	20
Comparative Analysis	18
Item Analysis	17
Foreign Countries	16
Multiple Choice Tests	16
Psychometrics	15
Test Validity	15
Computer Software	14
Models	14
Simulation	14
Item Banks	13
Language Tests	13
Mathematics Tests	13
Test Reliability	13
College Students	12
Difficulty Level	12
Higher Education	12
Scores	12
Accuracy	10
More ▼

Publication Type

Journal Articles	52
Reports - Research	45
Reports - Evaluative	34
Speeches/Meeting Papers	15
Reports - Descriptive	13
Books	7
Collected Works - General	7
Dissertations/Theses -…	5
Tests/Questionnaires	5
Numerical/Quantitative Data	4
Collected Works - Proceedings	2
Information Analyses	2
Book/Product Reviews	1
Guides - Non-Classroom	1
Non-Print Media	1
Opinion Papers	1
Reports - General	1
More ▼

Education Level

Higher Education	10
Postsecondary Education	10
Secondary Education	9
Elementary Education	8
Elementary Secondary Education	7
Junior High Schools	4
Middle Schools	4
Grade 8	2
High Schools	2
Intermediate Grades	2
Early Childhood Education	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
More ▼

Audience

Practitioners	2
Researchers	1
Students	1

Location

Canada	3
Japan	3
Australia	2
Czech Republic	2
France	2
Germany	2
Nebraska	2
Netherlands	2
South Korea	2
United Kingdom	2
Austria	1
Belgium	1
Chile	1
China	1
Cyprus	1
Denmark	1
Estonia	1
Iran	1
Ireland	1
Israel	1
Italy	1
Maryland	1
Norway	1
Poland	1
Russia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Test of English as a Foreign…	3
Advanced Placement…	2
Graduate Record Examinations	2
Trends in International…	2
Computer Attitude Scale	1
International Association for…	1
Preliminary Scholastic…	1
Program for International…	1
Progress in International…	1
SAT (College Admission Test)	1
Torrance Tests of Creative…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 109 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Automated Marking of Longer Computational Questions in Engineering Subjects

Peer reviewed

Direct link

Pearson, Christopher; Penna, Nigel – Assessment & Evaluation in Higher Education, 2023

E-assessments are becoming increasingly common and progressively more complex. Consequently, how these longer, more complex questions are designed and marked is imperative. This article uses the NUMBAS e-assessment tool to investigate the best practice for creating longer questions and their mark schemes on surveying modules taken by engineering…

Descriptors: Automation, Scoring, Engineering Education, Foreign Countries

Automatic Question Generation and Answer Assessment: A Survey

Direct link

Das, Bidyut; Majumder, Mukta; Phadikar, Santanu; Sekh, Arif Ahmed – Research and Practice in Technology Enhanced Learning, 2021

Learning through the internet becomes popular that facilitates learners to learn anything, anytime, anywhere from the web resources. Assessment is most important in any learning system. An assessment system can find the self-learning gaps of learners and improve the progress of learning. The manual question generation takes much time and labor.…

Descriptors: Automation, Test Items, Test Construction, Computer Assisted Testing

Identifying Enemy Item Pairs Using Natural Language Processing

Peer reviewed

Direct link

Becker, Kirk A.; Kao, Shu-chuan – Journal of Applied Testing Technology, 2022

Natural Language Processing (NLP) offers methods for understanding and quantifying the similarity between written documents. Within the testing industry these methods have been used for automatic item generation, automated scoring of text and speech, modeling item characteristics, automatic question answering, machine translation, and automated…

Descriptors: Item Banks, Natural Language Processing, Computer Assisted Testing, Scoring

Technology-Enhanced Items and Model-Data Misfit. Research Report. ETS RR-22-11

Peer reviewed
PDF on ERIC

Download full text

Carol Eckerly; Yue Jia; Paul Jewsbury – ETS Research Report Series, 2022

Testing programs have explored the use of technology-enhanced items alongside traditional item types (e.g., multiple-choice and constructed-response items) as measurement evidence of latent constructs modeled with item response theory (IRT). In this report, we discuss considerations in applying IRT models to a particular type of adaptive testlet…

Descriptors: Computer Assisted Testing, Test Items, Item Response Theory, Scoring

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Beyond Semantic Distance: Automated Scoring of Divergent Thinking Greatly Improves with Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Peter Organisciak; Selcuk Acar; Denis Dumas; Kelly Berthiaume – Grantee Submission, 2023

Automated scoring for divergent thinking (DT) seeks to overcome a key obstacle to creativity measurement: the effort, cost, and reliability of scoring open-ended tests. For a common test of DT, the Alternate Uses Task (AUT), the primary automated approach casts the problem as a semantic distance between a prompt and the resulting idea in a text…

Descriptors: Automation, Computer Assisted Testing, Scoring, Creative Thinking

Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022

The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…

Descriptors: Scoring, Test Items, Test Format, Raw Scores

Towards Scalable, Diverse, and Secure Assessment in College STEM Education

Direct link

Binglin Chen – ProQuest LLC, 2022

Assessment is a key component of education. Routine grading of students' work, however, is time consuming. Automating the grading process allows instructors to spend more of their time helping their students learn and engaging their students with more open-ended, creative activities. One way to automate grading is through computer-based…

Descriptors: College Students, STEM Education, Student Evaluation, Grading

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Establishing a Physics Concept Inventory Using Computer Marked Free-Response Questions

Peer reviewed
PDF on ERIC

Download full text

Parker, Mark A. J.; Hedgeland, Holly; Jordan, Sally E.; Braithwaite, Nicholas St. J. – European Journal of Science and Mathematics Education, 2023

The study covers the development and testing of the alternative mechanics survey (AMS), a modified force concept inventory (FCI), which used automatically marked free-response questions. Data were collected over a period of three academic years from 611 participants who were taking physics classes at high school and university level. A total of…

Descriptors: Test Construction, Scientific Concepts, Physics, Test Reliability

Decoding Student Insights: Analyzing Response Change in NAEP Mathematics Constructed Response Items

Peer reviewed
PDF on ERIC

Download full text

Congning Ni; Bhashithe Abeysinghe; Juanita Hicks – International Electronic Journal of Elementary Education, 2025

The National Assessment of Educational Progress (NAEP), often referred to as The Nation's Report Card, offers a window into the state of U.S. K-12 education system. Since 2017, NAEP has transitioned to digital assessments, opening new research opportunities that were previously impossible. Process data tracks students' interactions with the…

Descriptors: Reaction Time, Multiple Choice Tests, Behavior Change, National Competency Tests

Machine Learning, Natural Language Processing, and Psychometrics. The MARCES Book Series

Direct link

Hong Jiao, Editor; Robert W. Lissitz, Editor – IAP - Information Age Publishing, Inc., 2024

With the exponential increase of digital assessment, different types of data in addition to item responses become available in the measurement process. One of the salient features in digital assessment is that process data can be easily collected. This non-conventional structured or unstructured data source may bring new perspectives to better…

Descriptors: Artificial Intelligence, Natural Language Processing, Psychometrics, Computer Assisted Testing

Young Children's Actions on Length Measurement Tasks: Strategies and Cognitive Attributes

Peer reviewed

Direct link

Clements, Douglas H.; Banse, Holland; Sarama, Julie; Tatsuoka, Curtis; Joswick, Candace; Hudyma, Aaron; Van Dine, Douglas W.; Tatsuoka, Kikumi K. – Mathematical Thinking and Learning: An International Journal, 2022

Researchers often develop instruments using correctness scores (and a variety of theories and techniques, such as Item Response Theory) for validation and scoring. Less frequently, observations of children's strategies are incorporated into the design, development, and application of assessments. We conducted individual interviews of 833…

Descriptors: Item Response Theory, Computer Assisted Testing, Test Items, Mathematics Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

ETS Research Report Series	6
Grantee Submission	5
ProQuest LLC	5
Journal of Educational…	4
Applied Psychological…	3
Educational and Psychological…	3
International Educational…	3
International Journal of…	3
Journal of Applied Testing…	3
International Association for…	2
Journal of Technology,…	2
Nebraska Department of…	2
Practical Assessment,…	2
Advanced Education	1
Applied Measurement in…	1
Assessment & Evaluation in…	1
Communique	1
Computers & Education	1
Delta Publishing Company	1
Education and Information…	1
Educational Assessment	1
Educational Measurement:…	1
Educational Testing Service	1
Electronic Journal of…	1
English Language Teaching	1
More ▼

Bennett, Randy Elliot	6
Anderson, Paul S.	4
Stocking, Martha L.	3
Davey, Tim	2
Denis Dumas	2
Kao, Shu-chuan	2
Kaplan, Randy M.	2
Kelly Berthiaume	2
Kim, Doyoung	2
Mills, Craig N.	2
Morley, Mary	2
Muntean, William	2
Peter Organisciak	2
Segall, Daniel O.	2
Selcuk Acar	2
Wainer, Howard	2
Wise, Steven L.	2
Woo, Ada	2
Yamamoto, Kentaro	2
Alderton, David L.	1
Ali, Usama S.	1
Ashwell, Tim	1
Aviad-Levitzky, Tami	1
Aybek, Eren Can	1
More ▼