ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Computer Assisted Testing	6
Evaluation Methods	6
Adaptive Testing	3
Test Items	3
Simulation	2
Accounting	1
Artificial Intelligence	1
College Students	1
Comparative Analysis	1
Comparative Testing	1
Educational Assessment	1
Ethics	1
Evaluation Research	1
Evidence Based Practice	1
Formative Evaluation	1
German	1
Group Testing	1
Guidelines	1
High Stakes Tests	1
Innovation	1
Interrater Reliability	1
Italian	1
Item Response Theory	1
Licensing Examinations…	1
Measures (Individuals)	1
More ▼

Source

Journal of Educational…

Author

Bengs, Daniel	1
Brefeld, Ulf	1
Breithaupt, Krista	1
Chen, Shu-Ying	1
Chuah, Siang Chee	1
Dorsey, David W.	1
Eignor, Daniel R.	1
Hamid Mohammadi	1
Kroehne, Ulf	1
Lei, Pui-Wa	1
Mark J. Gierl	1
Michaels, Hillary R.	1
Tahereh Firoozi	1
Yu, Lan	1
Zhang, Yanwei	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	6
Reports - Evaluative	3
Reports - Research	2
Book/Product Reviews	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Using Automated Procedures to Score Educational Essays Written in Three Languages

Peer reviewed

Direct link

Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025

The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…

Descriptors: College Students, Slavic Languages, German, Italian

Simultaneous Constrained Adaptive Item Selection for Group-Based Testing

Peer reviewed

Direct link

Bengs, Daniel; Kroehne, Ulf; Brefeld, Ulf – Journal of Educational Measurement, 2021

By tailoring test forms to the test-taker's proficiency, Computerized Adaptive Testing (CAT) enables substantial increases in testing efficiency over fixed forms testing. When used for formative assessment, the alignment of task difficulty with proficiency increases the chance that teachers can derive useful feedback from assessment data. The…

Descriptors: Computer Assisted Testing, Formative Evaluation, Group Testing, Program Effectiveness

Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment

Peer reviewed

Direct link

Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022

We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…

Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods

Detecting Differential Speededness in Multistage Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Breithaupt, Krista; Chuah, Siang Chee; Zhang, Yanwei – Journal of Educational Measurement, 2007

A potential undesirable effect of multistage testing is differential speededness, which happens if some of the test takers run out of time because they receive subtests with items that are more time intensive than others. This article shows how a probabilistic response-time model can be used for estimating differences in time intensities and speed…

Descriptors: Adaptive Testing, Evaluation Methods, Test Items, Reaction Time

Guidelines for Computerized-Adaptive Test Development and Use in Education [Book Review].

Peer reviewed

Eignor, Daniel R. – Journal of Educational Measurement, 1997

The authors of the "Guidelines," a task force of eight, intend to present an organized list of features to be considered in reporting or evaluating computerized-adaptive assessments. Apart from a few weaknesses, the book is a useful and complete document that will be very helpful to test developers. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Guidelines

Comparing Methods of Assessing Differential Item Functioning in a Computerized Adaptive Testing Environment

Peer reviewed

Direct link

Lei, Pui-Wa; Chen, Shu-Ying; Yu, Lan – Journal of Educational Measurement, 2006

Mantel-Haenszel and SIBTEST, which have known difficulty in detecting non-unidirectional differential item functioning (DIF), have been adapted with some success for computerized adaptive testing (CAT). This study adapts logistic regression (LR) and the item-response-theory-likelihood-ratio test (IRT-LRT), capable of detecting both unidirectional…

Descriptors: Evaluation Methods, Test Bias, Computer Assisted Testing, Multiple Regression Analysis