ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	14

Descriptor

Computer Assisted Testing	20
Models	20
Test Format	20
Test Construction	10
Test Items	10
Adaptive Testing	7
Comparative Analysis	5
Foreign Countries	5
Item Response Theory	5
Test Validity	4
Computer Software	3
Correlation	3
Language Tests	3
Prediction	3
Simulation	3
Test Reliability	3
Testing	3
Accuracy	2
Automation	2
Difficulty Level	2
Educational Assessment	2
Educational Technology	2
Grading	2
Item Analysis	2
Item Banks	2
More ▼

Source

Applied Psychological…	2
International Journal of…	2
ALT-J: Research in Learning…	1
Annual Review of Applied…	1
Applied Measurement in…	1
ETS Research Report Series	1
European Educational Research…	1
Journal of Creative Behavior	1
Journal of Intelligence	1
Language Assessment Quarterly	1
Online Submission	1
Practical Assessment,…	1
Turkish Online Journal of…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	11
Reports - Evaluative	6
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	2
Adult Education	1
Elementary Education	1
Postsecondary Education	1

Audience

Location

Canada	1
Europe	1
France	1
Germany	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Test of English as a Foreign…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

Short-Answer Grading for German: Addressing the Challenges

Peer reviewed

Direct link

Ulrike Padó; Yunus Eryilmaz; Larissa Kirschner – International Journal of Artificial Intelligence in Education, 2024

Short-Answer Grading (SAG) is a time-consuming task for teachers that automated SAG models have long promised to make easier. However, there are three challenges for their broad-scale adoption: A technical challenge regarding the need for high-quality models, which is exacerbated for languages with fewer resources than English; a usability…

Descriptors: Grading, Automation, Test Format, Computer Assisted Testing

Cheating Automatic Short Answer Grading with the Adversarial Usage of Adjectives and Adverbs

Peer reviewed

Direct link

Anna Filighera; Sebastian Ochs; Tim Steuer; Thomas Tregel – International Journal of Artificial Intelligence in Education, 2024

Automatic grading models are valued for the time and effort saved during the instruction of large student bodies. Especially with the increasing digitization of education and interest in large-scale standardized testing, the popularity of automatic grading has risen to the point where commercial solutions are widely available and used. However,…

Descriptors: Cheating, Grading, Form Classes (Languages), Computer Software

Test-Taker Engagement in AI Technology-Mediated Language Assessment

Peer reviewed

Direct link

Yan Jin; Jason Fan – Language Assessment Quarterly, 2023

In language assessment, AI technology has been incorporated in task design, assessment delivery, automated scoring of performance-based tasks, score reporting, and provision of feedback. AI technology is also used for collecting and analyzing performance data in language assessment validation. Research has been conducted to investigate the…

Descriptors: Language Tests, Artificial Intelligence, Computer Assisted Testing, Test Format

Same Test, Better Scores: Boosting the Reliability of Short Online Intelligence Recruitment Tests with Nested Logit Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019

Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…

Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability

Development of Web-Based Examination System Using Open Source Programming Model

Peer reviewed
PDF on ERIC

Download full text

Abass, Olalere A.; Olajide, Samuel A.; Samuel, Babafemi O. – Turkish Online Journal of Distance Education, 2017

The traditional method of assessment (examination) is often characterized by examination questions leakages, human errors during marking of scripts and recording of scores. The technological advancement in the field of computer science has necessitated the need for computer usage in majorly all areas of human life and endeavors, education sector…

Descriptors: Computer Assisted Testing, Computer System Design, Test Format, Design Requirements

Age, Task Characteristics, and Acoustic Indicators of Engagement: Investigations into the Validity of a Technology-Enhanced Speaking Test for Young Language Learners

Download full text

Edward Paul Getman – Online Submission, 2020

Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement

Integrating Test-Form Formatting into Automated Test Assembly

Peer reviewed

Direct link

Diao, Qi; van der Linden, Wim J. – Applied Psychological Measurement, 2013

Automated test assembly uses the methodology of mixed integer programming to select an optimal set of items from an item bank. Automated test-form generation uses the same methodology to optimally order the items and format the test form. From an optimization point of view, production of fully formatted test forms directly from the item pool using…

Descriptors: Automation, Test Construction, Test Format, Item Banks

Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis

Assessment of Computer and Information Literacy in ICILS 2013: Do Different Item Types Measure the Same Construct?

Peer reviewed

Direct link

Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia – European Educational Research Journal, 2017

The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…

Descriptors: Foreign Countries, Computer Literacy, Information Literacy, International Assessment

Test Administration Models

Peer reviewed
PDF on ERIC

Download full text

Becker, Kirk A.; Bergstrom, Betty A. – Practical Assessment, Research & Evaluation, 2013

The need for increased exam security, improved test formats, more flexible scheduling, better measurement, and more efficient administrative processes has caused testing agencies to consider converting the administration of their exams from paper-and-pencil to computer-based testing (CBT). Many decisions must be made in order to provide an optimal…

Descriptors: Testing, Models, Testing Programs, Program Administration

Computerized Adaptive Testing for Polytomous Motivation Items: Administration Mode Effects and a Comparison with Short Forms

Peer reviewed

Direct link

Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007

In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…

Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Controlling Item Exposure Rates in a Realistic Adaptive Testing Paradigm.

Download full text

Stocking, Martha L. – 1993

In the context of paper and pencil testing, the frequency of the exposure of items is usually controlled through policies that regulate both the reuse of test forms and the frequency with which a candidate may retake the test. In the context of computerized adaptive testing, where item pools are large and expensive to produce and testing can be on…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Models

The Practical Impact of IRT Models and Parameters When Converting a Test to Adaptive Format.

Download full text

Bizot, Elizabeth B.; Goldman, Steven H. – 1994

A study was conducted to evaluate the effects of choice of item response theory (IRT) model, parameter calibration group, starting ability estimate, and stopping criterion on the conversion of an 80-item vocabulary test to computer adaptive format. Three parameter calibration groups were tested: (1) a group of 1,000 high school seniors, (2) a…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Estimation (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2

Wainer, Howard	2
Abass, Olalere A.	1
Anna Filighera	1
Baron, Simon	1
Becker, Kirk A.	1
Bergstrom, Betty A.	1
Bernard, David	1
Bizot, Elizabeth B.	1
Boulais, André-Philippe	1
Brindley, Geoff	1
Cox, Benita	1
De Champlain, André	1
Denis Dumas	1
Diao, Qi	1
Edward Paul Getman	1
Gerick, Julia	1
Gierl, Mark J.	1
Goldhammer, Frank	1
Goldman, Steven H.	1
Handley, Karen	1
Hol, A. Michiel	1
Ihme, Jan Marten	1
Jason Fan	1
Lai, Hollis	1
More ▼