ERIC - Search Results

Publication Date

In 2025	1
Since 2024	10
Since 2021 (last 5 years)	20
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	20

Descriptor

Algorithms	48
Test Items	48
Computer Assisted Testing	18
Test Construction	17
Item Analysis	14
Adaptive Testing	13
Foreign Countries	10
Multiple Choice Tests	9
Item Banks	8
Item Response Theory	8
Latent Trait Theory	8
Mathematical Models	8
Accuracy	7
Difficulty Level	7
Models	7
Psychometrics	7
Selection	7
Simulation	7
Computation	6
Testing Problems	6
Guessing (Tests)	5
Higher Education	5
Test Format	5
Achievement Tests	4
Artificial Intelligence	4
More ▼

Source

Grantee Submission	4
Journal of Educational and…	4
Educational and Psychological…	3
Education and Information…	2
Journal of Educational…	2
Applied Psychological…	1
Educational Measurement:…	1
Field Methods	1
Informatics in Education	1
International Society for…	1
Journal for Research in…	1
Journal of Applied Measurement	1
Journal of Educational…	1
Large-scale Assessments in…	1
Measurement:…	1
NWEA	1
Psychometrika	1
More ▼

Publication Type

Reports - Research	48
Journal Articles	21
Speeches/Meeting Papers	7
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1

Audience

Researchers	4
Practitioners	1
Teachers	1

Location

Mexico	1
New York (Rochester)	1
Portugal	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Big Five Inventory	1
Measures of Academic Progress	1
National Assessment of…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 48 results Save | Export

Optimal Calibration of Items for Multidimensional Achievement Tests

Peer reviewed

Direct link

Mahmood Ul Hassan; Frank Miller – Journal of Educational Measurement, 2024

Multidimensional achievement tests are recently gaining more importance in educational and psychological measurements. For example, multidimensional diagnostic tests can help students to determine which particular domain of knowledge they need to improve for better performance. To estimate the characteristics of candidate items (calibration) for…

Descriptors: Multidimensional Scaling, Achievement Tests, Test Items, Test Construction

Exploring Quality Criteria and Evaluation Methods in Automated Question Generation: A Comprehensive Survey

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Education and Information Technologies, 2024

In light of the widespread adoption of technology-enhanced learning and assessment platforms, there is a growing demand for innovative, high-quality, and diverse assessment questions. Automatic Question Generation (AQG) has emerged as a valuable solution, enabling educators and assessment developers to efficiently produce a large volume of test…

Descriptors: Computer Assisted Testing, Test Construction, Test Items, Automation

Item Selection Algorithm Based on Collaborative Filtering for Item Exposure Control

Peer reviewed

Direct link

Pan, Yiqin; Livne, Oren; Wollack, James A.; Sinharay, Sandip – Educational Measurement: Issues and Practice, 2023

In computerized adaptive testing, overexposure of items in the bank is a serious problem and might result in item compromise. We develop an item selection algorithm that utilizes the entire bank well and reduces the overexposure of items. The algorithm is based on collaborative filtering and selects an item in two stages. In the first stage, a set…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Using Attributes of Survey Items to Predict Response Times May Benefit Survey Research

Peer reviewed

Direct link

Schneider, Stefan; Jin, Haomiao; Orriens, Bart; Junghaenel, Doerte U.; Kapteyn, Arie; Meijer, Erik; Stone, Arthur A. – Field Methods, 2023

Researchers have become increasingly interested in response times to survey items as a measure of cognitive effort. We used machine learning to develop a prediction model of response times based on 41 attributes of survey items (e.g., question length, response format, linguistic features) collected in a large, general population sample. The…

Descriptors: Surveys, Response Rates (Questionnaires), Test Items, Artificial Intelligence

A Psychometric Framework for Evaluating Fairness in Algorithmic Decision Making: Differential Algorithmic Functioning

Peer reviewed

Direct link

Youmi Suk; Kyung T. Han – Journal of Educational and Behavioral Statistics, 2024

As algorithmic decision making is increasingly deployed in every walk of life, many researchers have raised concerns about fairness-related bias from such algorithms. But there is little research on harnessing psychometric methods to uncover potential discriminatory bias inside decision-making algorithms. The main goal of this article is to…

Descriptors: Psychometrics, Ethics, Decision Making, Algorithms

Effects of the Quantity and Magnitude of Cross-Loading and Model Specification on MIRT Item Parameter Recovery

Peer reviewed

Direct link

Mostafa Hosseinzadeh; Ki Lynn Matlock Cole – Educational and Psychological Measurement, 2024

In real-world situations, multidimensional data may appear on large-scale tests or psychological surveys. The purpose of this study was to investigate the effects of the quantity and magnitude of cross-loadings and model specification on item parameter recovery in multidimensional Item Response Theory (MIRT) models, especially when the model was…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Algorithms

Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index

Peer reviewed

Direct link

Tan, Qingrong; Cai, Yan; Luo, Fen; Tu, Dongbo – Journal of Educational and Behavioral Statistics, 2023

To improve the calibration accuracy and calibration efficiency of cognitive diagnostic computerized adaptive testing (CD-CAT) for new items and, ultimately, contribute to the widespread application of CD-CAT in practice, the current article proposed a Gini-based online calibration method that can simultaneously calibrate the Q-matrix and item…

Descriptors: Cognitive Tests, Computer Assisted Testing, Adaptive Testing, Accuracy

Efficiency of PROMIS MCAT Assessments for Orthopaedic Care

Peer reviewed

Direct link

Michael Bass; Scott Morris; Sheng Zhang – Measurement: Interdisciplinary Research and Perspectives, 2025

Administration of patient-reported outcome measures (PROs), using multidimensional computer adaptive tests (MCATs) has the potential to reduce patient burden, but the efficiency of MCAT depends on the degree to which an individual's responses fit the psychometric properties of the assessment. Assessing patients' symptom burden through the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Patients, Outcome Measures

Multi-Group Regularized Gaussian Variational Estimation: Fast Detection of DIF

Peer reviewed

Direct link

Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…

Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory

Incorporating Test-Taking Engagement into the Item Selection Algorithm in Low-Stakes Computerized Adaptive Tests

Peer reviewed

Direct link

Gorgun, Guher; Bulut, Okan – Large-scale Assessments in Education, 2023

In low-stakes assessment settings, students' performance is not only influenced by students' ability level but also their test-taking engagement. In computerized adaptive tests (CATs), disengaged responses (e.g., rapid guesses) that fail to reflect students' true ability levels may lead to the selection of less informative items and thereby…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Progress Is Impossible without Change: Implementing Automatic Item Generation in Medical Knowledge Progress Testing

Peer reviewed

Direct link

Filipe Manuel Vidal Falcão; Daniela S.M. Pereira; José Miguel Pêgo; Patrício Costa – Education and Information Technologies, 2024

Progress tests (PT) are a popular type of longitudinal assessment used for evaluating clinical knowledge retention and long-life learning in health professions education. Most PTs consist of multiple-choice questions (MCQs) whose development is costly and time-consuming. Automatic Item Generation (AIG) generates test items through algorithms,…

Descriptors: Automation, Test Items, Progress Monitoring, Medical Education

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Generating Multiple Choice Questions from a Textbook: LLMs Match Human Performance on Most Metrics

Peer reviewed
PDF on ERIC

Download full text

Andrew M. Olney – Grantee Submission, 2023

Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms

Changing the Success Probability in Computerized Adaptive Testing: A Monte-Carlo Simultion on the Open Matrices Item Bank

Peer reviewed
PDF on ERIC

Download full text

Hanif Akhtar – International Society for Technology, Education, and Science, 2023

For efficiency, Computerized Adaptive Test (CAT) algorithm selects items with the maximum information, typically with a 50% probability of being answered correctly. However, examinees may not be satisfied if they only correctly answer 50% of the items. Researchers discovered that changing the item selection algorithms to choose easier items (i.e.,…

Descriptors: Success, Probability, Computer Assisted Testing, Adaptive Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Chun Wang	3
Roid, Gale	3
Boekkooi-Timminga, Ellen	2
Gongjun Xu	2
van der Linden, Wim J.	2
Allan S. Cohen	1
Anderson, Lorin W.	1
Anderson, Ronald E.	1
Andrew M. Olney	1
Bejar, Isaac I.	1
Blando, John A.	1
Bock, R. Darrell	1
Bowles, Ryan	1
Bulut, Okan	1
Cai, Yan	1
Chengyu Cui	1
Chiu, Chia-Yi	1
Choppin, Bruce	1
Christoph Kern	1
Daniela S.M. Pereira	1
Dogan, Dilek	1
Filipe Manuel Vidal Falcão	1
Finn, Patrick	1
Frank Miller	1
More ▼