ERIC - Search Results

Publication Date

In 2025	3
Since 2024	7
Since 2021 (last 5 years)	23
Since 2016 (last 10 years)	48
Since 2006 (last 20 years)	87

Descriptor

Item Response Theory	94
Computer Software	93
Models	38
Test Items	30
Foreign Countries	23
Comparative Analysis	22
Computation	22
Statistical Analysis	20
Simulation	19
Correlation	16
Monte Carlo Methods	16
Bayesian Statistics	15
Accuracy	14
Item Analysis	13
Achievement Tests	12
Difficulty Level	12
Computer Assisted Testing	11
Scores	11
Error of Measurement	10
Psychometrics	10
Sample Size	10
Data Analysis	9
Evaluation Methods	9
Markov Processes	9
Maximum Likelihood Statistics	8
More ▼

Publication Type

Reports - Research	94
Journal Articles	78
Speeches/Meeting Papers	13
Information Analyses	1

Education Level

Higher Education	14
Secondary Education	10
Postsecondary Education	9
Elementary Education	8
Elementary Secondary Education	5
Middle Schools	5
High Schools	3
Intermediate Grades	3
Junior High Schools	3
Grade 4	2
Grade 6	1
Grade 8	1
More ▼

Audience

Teachers

Location

Taiwan	4
Canada	2
Hong Kong	2
Asia	1
China	1
Florida	1
Germany	1
Japan	1
Japan (Tokyo)	1
Oman	1
Saudi Arabia	1
Turkey	1
United Kingdom	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	3
ACT Assessment	1
MacArthur Communicative…	1
National Education…	1
Progress in International…	1
Rosenberg Self Esteem Scale	1
SAT (College Admission Test)	1
Students Evaluation of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 94 results Save | Export

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Computational Strategies and Estimation Performance with Bayesian Semiparametric Item Response Theory Models

Peer reviewed

Direct link

Paganin, Sally; Paciorek, Christopher J.; Wehrhahn, Claudia; Rodríguez, Abel; Rabe-Hesketh, Sophia; de Valpine, Perry – Journal of Educational and Behavioral Statistics, 2023

Item response theory (IRT) models typically rely on a normality assumption for subject-specific latent traits, which is often unrealistic in practice. Semiparametric extensions based on Dirichlet process mixtures (DPMs) offer a more flexible representation of the unknown distribution of the latent trait. However, the use of such models in the IRT…

Descriptors: Bayesian Statistics, Item Response Theory, Guidance, Evaluation Methods

A Practical Guide to Power Analyses of Moderation Effects in Multisite Individual and Cluster Randomized Trials

Peer reviewed

Direct link

Nianbo Dong; Benjamin Kelcey; Jessaca Spybrook; Yanli Xie; Dung Pham; Peilin Qiu; Ning Sui – Grantee Submission, 2024

Multisite trials that randomize individuals (e.g., students) within sites (e.g., schools) or clusters (e.g., teachers/classrooms) within sites (e.g., schools) are commonly used for program evaluation because they provide opportunities to learn about treatment effects as well as their heterogeneity across sites and subgroups (defined by moderating…

Descriptors: Statistical Analysis, Randomized Controlled Trials, Educational Research, Effect Size

Automated Search for Logistic Knowledge Tracing Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Philip I. Pavlik; Luke G. Eglington – Grantee Submission, 2023

This paper presents a tool for creating student models in logistic regression. Creating student models has typically been done by expert selection of the appropriate terms, beginning with models as simple as IRT or AFM but more recently with highly complex models like BestLR. While alternative methods exist to select the appropriate predictors for…

Descriptors: Students, Models, Regression (Statistics), Alternative Assessment

Automated Search for Logistic Knowledge Tracing Models

Peer reviewed
PDF on ERIC

Download full text

Philip I. Pavlik; Luke G. Eglington – International Educational Data Mining Society, 2023

Descriptors: Students, Models, Regression (Statistics), Alternative Assessment

Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach

Peer reviewed

Direct link

Ö. Emre C. Alagöz; Thorsten Meiser – Educational and Psychological Measurement, 2024

To improve the validity of self-report measures, researchers should control for response style (RS) effects, which can be achieved with IRTree models. A traditional IRTree model considers a response as a combination of distinct decision-making processes, where the substantive trait affects the decision on response direction, while decisions about…

Descriptors: Item Response Theory, Validity, Self Evaluation (Individuals), Decision Making

Multidimensional Computerized Adaptive Testing Simulations in R

Peer reviewed
PDF on ERIC

Download full text

Ince Araci, F. Gul; Tan, Seref – International Journal of Assessment Tools in Education, 2022

Computerized Adaptive Testing (CAT) is a beneficial test technique that decreases the number of items that need to be administered by taking items in accordance with individuals' own ability levels. After the CAT applications were constructed based on the unidimensional Item Response Theory (IRT), Multidimensional CAT (MCAT) applications have…

Descriptors: Adaptive Testing, Computer Assisted Testing, Simulation, Item Response Theory

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

Accounting for Standard Errors of Measurement When Modeling Change

Peer reviewed

Direct link

Grimm, Kevin J.; Fine, Kimberly; Stegmann, Gabriela – International Journal of Behavioral Development, 2021

Modeling within-person change over time and between-person differences in change over time is a primary goal in prevention science. When modeling change in an observed score over time with multilevel or structural equation modeling approaches, each observed score counts toward the estimation of model parameters equally. However, observed scores…

Descriptors: Error of Measurement, Weighted Scores, Accuracy, Item Response Theory

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

The Comparison of Estimation Methods for the Four-Parameter Logistic Item Response Theory Model

Peer reviewed

Direct link

Kalkan, Ömür Kaya – Measurement: Interdisciplinary Research and Perspectives, 2022

The four-parameter logistic (4PL) Item Response Theory (IRT) model has recently been reconsidered in the literature due to the advances in the statistical modeling software and the recent developments in the estimation of the 4PL IRT model parameters. The current simulation study evaluated the performance of expectation-maximization (EM),…

Descriptors: Comparative Analysis, Sample Size, Test Length, Algorithms

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

Peer reviewed

Direct link

Uto, Masaki; Okano, Masashi – IEEE Transactions on Learning Technologies, 2021

In automated essay scoring (AES), scores are automatically assigned to essays as an alternative to grading by humans. Traditional AES typically relies on handcrafted features, whereas recent studies have proposed AES models based on deep neural networks to obviate the need for feature engineering. Those AES models generally require training on a…

Descriptors: Essays, Scoring, Writing Evaluation, Item Response Theory

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

A Multilevel Mixture IRT Framework for Modeling Response Times as Predictors or Indicators of Response Engagement in IRT Models

Peer reviewed

Direct link

Nagy, Gabriel; Ulitzsch, Esther – Educational and Psychological Measurement, 2022

Disengaged item responses pose a threat to the validity of the results provided by large-scale assessments. Several procedures for identifying disengaged responses on the basis of observed response times have been suggested, and item response theory (IRT) models for response engagement have been proposed. We outline that response time-based…

Descriptors: Item Response Theory, Hierarchical Linear Modeling, Predictor Variables, Classification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	15
Applied Psychological…	10
Journal of Educational…	8
International Educational…	6
Grantee Submission	5
Measurement:…	4
ETS Research Report Series	3
International Journal of…	3
Journal of Educational and…	3
IEEE Transactions on Learning…	2
International Journal of…	2
Journal of Applied Measurement	2
Online Submission	2
Teaching of Psychology	2
ACT, Inc.	1
Asia-Pacific Science Education	1
Computer Science Education	1
Computers & Education	1
Computers and Education	1
E-Learning and Digital Media	1
Educational Measurement:…	1
Educational Psychology	1
Educational Sciences: Theory…	1
Educational Technology &…	1
Educational Technology…	1
More ▼

Wang, Wen-Chung	8
DeMars, Christine E.	4
Jin, Kuan-Yu	4
Wang, Chun	4
Jiao, Hong	3
Luo, Yong	3
Wilson, Mark	3
Engelhard, George, Jr.	2
Huang, Hung-Yu	2
Kalkan, Ömür Kaya	2
Kelecioglu, Hülya	2
Leventhal, Brian C.	2
Luke G. Eglington	2
Nydick, Steven W.	2
PaaBen, Benjamin	2
Philip I. Pavlik	2
Pinkwart, Niels	2
Shi, Ning-Zhong	2
Stahl, John	2
Tao, Jian	2
Wang, Shudong	2
Zhang, Xue	2
von Davier, Matthias	2
Ahmed Al - Badri	1
Alexander Kah	1
More ▼