ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	19

Descriptor

Comparative Analysis	25
Educational Assessment	25
Simulation	18
Evaluation Methods	10
Models	9
Computer Assisted Testing	7
Computer Simulation	7
Computer Software	7
Item Response Theory	7
Test Items	7
Classification	6
Probability	6
Statistical Analysis	6
Bayesian Statistics	4
Computer Assisted Instruction	4
Correlation	4
Item Analysis	4
Performance Based Assessment	4
Programming	4
Scores	4
Student Evaluation	4
Academic Achievement	3
Accuracy	3
Computation	3
Data Analysis	3
More ▼

Source

Applied Measurement in…	3
ProQuest LLC	3
IEEE Transactions on Learning…	2
Journal of Educational…	2
Journal of Educational and…	2
Applied Psychological…	1
Computers & Education	1
International Association for…	1
International Journal of…	1
International Working Group…	1
Journal of Applied Testing…	1
Journal of the Learning…	1
Measurement:…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	11
Reports - Evaluative	7
Speeches/Meeting Papers	4
Dissertations/Theses -…	3
Collected Works - Proceedings	2
Reports - Descriptive	2

Education Level

Elementary Secondary Education	4
High Schools	3
Grade 9	2
Higher Education	2
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Adult Education	1
Elementary Education	1
Grade 10	1
Grade 12	1
Grade 4	1
Grade 7	1
Grade 8	1
Junior High Schools	1
More ▼

Audience

Location

Australia	2
Connecticut	2
Israel	2
Japan	2
Massachusetts	2
Netherlands	2
Pennsylvania	2
Spain	2
Asia	1
Brazil	1
Czech Republic	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Italy	1
Kazakhstan	1
New York (New York)	1
North Carolina	1
Norway	1
Ohio	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	1
Massachusetts Comprehensive…	1
Program for International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

GDINA and CDM Packages in R

Peer reviewed

Direct link

Rupp, André A.; van Rijn, Peter W. – Measurement: Interdisciplinary Research and Perspectives, 2018

We review the GIDNA and CDM packages in R for fitting cognitive diagnosis/diagnostic classification models. We first provide a summary of their core capabilities and then use both simulated and real data to compare their functionalities in practice. We found that the most relevant routines in the two packages appear to be more similar than…

Descriptors: Educational Assessment, Cognitive Measurement, Measurement, Computer Software

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

Semiparametric Item Response Functions in the Context of Guessing

Peer reviewed

Direct link

Falk, Carl F.; Cai, Li – Journal of Educational Measurement, 2016

We present a logistic function of a monotonic polynomial with a lower asymptote, allowing additional flexibility beyond the three-parameter logistic model. We develop a maximum marginal likelihood-based approach to estimate the item parameters. The new item response model is demonstrated on math assessment data from a state, and a computationally…

Descriptors: Item Response Theory, Guessing (Tests), Mathematics Tests, Simulation

Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics

Peer reviewed

Direct link

Tendeiro, Jorge N.; Meijer, Rob R. – Journal of Educational Measurement, 2014

In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…

Descriptors: Educational Assessment, Test Validity, Scores, Statistical Analysis

Posterior Predictive Model Checking in Bayesian Networks

Direct link

Crawford, Aaron – ProQuest LLC, 2014

This simulation study compared the utility of various discrepancy measures within a posterior predictive model checking (PPMC) framework for detecting different types of data-model misfit in multidimensional Bayesian network (BN) models. The investigated conditions were motivated by an applied research program utilizing an operational complex…

Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

Maximum Clique Algorithm and Its Approximation for Uniform Test Form Assembly

Peer reviewed

Direct link

Ishii, Takatoshi; Songmuang, Pokpong; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2014

Educational assessments occasionally require uniform test forms for which each test form comprises a different set of items, but the forms meet equivalent test specifications (i.e., qualities indicated by test information functions based on item response theory). We propose two maximum clique algorithms (MCA) for uniform test form assembly. The…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Between-Person and Within-Person Subscore Reliability: Comparison of Unidimensional and Multidimensional IRT Models

Direct link

Bulut, Okan – ProQuest LLC, 2013

The importance of subscores in educational and psychological assessments is undeniable. Subscores yield diagnostic information that can be used for determining how each examinee's abilities/skills vary over different content domains. One of the most common criticisms about reporting and using subscores is insufficient reliability of subscores.…

Descriptors: Item Response Theory, Simulation, Correlation, Reliability

A Study of Assessments Designed for Student Success

Direct link

Delepine, Sidney G., III – ProQuest LLC, 2012

The purpose of this quantitative study is to compare a new assessment tool, the SkillsUSA Connect Assessment with the NOCTI assessment to determine which test results in more students achieving success. A quantitative study, designed to compare test scores of students taking the NOCTI assessment and new assessments from SkillsUSA, called the…

Descriptors: Educational Assessment, Academic Achievement, Scores, Comparative Analysis

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Multilevel Assessment for Discourse, Understanding, and Achievement

Peer reviewed

Direct link

Hickey, Daniel T.; Zuiker, Steven J. – Journal of the Learning Sciences, 2012

Evaluating the impact of instructional innovations and coordinating instruction, assessment, and testing present complex tensions. Many evaluation and coordination efforts aim to address these tensions by using the coherence provided by modern cognitive science perspectives on domain-specific learning. This paper introduces an alternative…

Descriptors: Program Effectiveness, Achievement Tests, Performance Based Assessment, Genetics

Previous Page | Next Page »

Pages: 1 | 2

Ishii, Takatoshi	2
Ueno, Maomi	2
Allan S. Cohen	1
Ban, Jae-Chun	1
Barnes, Tiffany, Ed.	1
Barr, James	1
Black, John B.	1
Bulut, Okan	1
Cai, Li	1
Chang, Hua-Hua	1
Chapman, Dane M.	1
Chassapis, Constantin	1
Corter, James E.	1
Crawford, Aaron	1
Delepine, Sidney G., III	1
Desmarais, Michel, Ed.	1
Dhanidina, Lutaf	1
Dost, Marcia A.	1
Esche, Sven K.	1
Falk, Carl F.	1
Finch, F. L.	1
Fuchimoto, Kazuma	1
Hickey, Daniel T.	1
Higgins, Jennifer	1
More ▼