ERIC - Search Results

Publication Date

In 2025	2
Since 2024	5
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	27

Descriptor

Comparative Analysis	28
Models	23
Computation	10
Simulation	9
Item Response Theory	8
Statistical Analysis	8
Evaluation Methods	7
Foreign Countries	6
Maximum Likelihood Statistics	6
Test Items	6
Monte Carlo Methods	5
Reaction Time	5
Scoring	5
Bayesian Statistics	4
Computer Assisted Testing	4
Computer Software	4
Correlation	4
Item Analysis	4
Regression (Statistics)	4
Statistical Bias	4
Bias	3
Equations (Mathematics)	3
Longitudinal Studies	3
Markov Processes	3
Sample Size	3
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	28
Reports - Research	19
Reports - Evaluative	6
Reports - Descriptive	3

Education Level

Secondary Education	4
Elementary Secondary Education	2
Higher Education	2
Elementary Education	1
Grade 1	1
High Schools	1
Postsecondary Education	1

Audience

Location

Netherlands	3
Finland	2
Sweden	2
United Kingdom (England)	2
United States	2
Australia	1
Austria	1
Azerbaijan	1
Belgium	1
Canada	1
China (Shanghai)	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
France	1
Germany	1
Greece	1
Indonesia	1
Ireland	1
Italy	1
Japan	1
Liechtenstein	1
Massachusetts	1
Montenegro	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Program for International…	2
Center for Epidemiologic…	1
Law School Admission Test	1
National Longitudinal Study…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Mixed-Effects Location Scale Models for Joint Modeling School Value-Added Effects on the Mean and Variance of Student Achievement

Peer reviewed

Direct link

George Leckie; Richard Parker; Harvey Goldstein; Kate Tilling – Journal of Educational and Behavioral Statistics, 2024

School value-added models are widely applied to study, monitor, and hold schools to account for school differences in student learning. The traditional model is a mixed-effects linear regression of student current achievement on student prior achievement, background characteristics, and a school random intercept effect. The latter is referred to…

Descriptors: Academic Achievement, Value Added Models, Accountability, Institutional Characteristics

On Longitudinal Item Response Theory Models: A Didactic

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Chun; Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2020

Recent work on measuring growth with categorical outcome variables has combined the item response theory (IRT) measurement model with the latent growth curve model and extended the assessment of growth to multidimensional IRT models and higher order IRT models. However, there is a lack of synthetic studies that clearly evaluate the strength and…

Descriptors: Item Response Theory, Longitudinal Studies, Comparative Analysis, Models

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Peer reviewed

Direct link

Flynt, Abby; Dean, Nema – Journal of Educational and Behavioral Statistics, 2016

Cluster analysis is a set of statistical methods for discovering new group/class structure when exploring data sets. This article reviews the following popular libraries/commands in the R software language for applying different types of cluster analysis: from the stats library, the kmeans, and hclust functions; the mclust library; the poLCA…

Descriptors: Multivariate Analysis, Computer Software, Comparative Analysis, Programming Languages

Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level

Peer reviewed

Direct link

Savalei, Victoria; Rhemtulla, Mijke – Journal of Educational and Behavioral Statistics, 2017

In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately…

Descriptors: Computation, Statistical Analysis, Test Items, Maximum Likelihood Statistics

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Posterior Predictive Checks for Conditional Independence between Response Time and Accuracy

Peer reviewed

Direct link

Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016

Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…

Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)

Grade of Membership Response Time Model for Detecting Guessing Behaviors

Peer reviewed

Direct link

Pokropek, Artur – Journal of Educational and Behavioral Statistics, 2016

A response model that is able to detect guessing behaviors and produce unbiased estimates in low-stake conditions using timing information is proposed. The model is a special case of the grade of membership model in which responses are modeled as partial members of a class that is affected by motivation and a class that responds only according to…

Descriptors: Reaction Time, Models, Guessing (Tests), Computation

Testing for Aberrant Behavior in Response Time Modeling

Peer reviewed

Direct link

Marianti, Sukaesi; Fox, Jean-Paul; Avetisyan, Marianna; Veldkamp, Bernard P.; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2014

Many standardized tests are now administered via computer rather than paper-and-pencil format. In a computer-based testing environment, it is possible to record not only the test taker's response to each question (item) but also the amount of time spent by the test taker in considering and answering each item. Response times (RTs) provide…

Descriptors: Reaction Time, Response Style (Tests), Computer Assisted Testing, Bayesian Statistics

Using Data-Dependent Priors to Mitigate Small Sample Bias in Latent Growth Models: A Discussion and Illustration Using M"plus"

Peer reviewed

Direct link

McNeish, Daniel M. – Journal of Educational and Behavioral Statistics, 2016

Mixed-effects models (MEMs) and latent growth models (LGMs) are often considered interchangeable save the discipline-specific nomenclature. Software implementations of these models, however, are not interchangeable, particularly with small sample sizes. Restricted maximum likelihood estimation that mitigates small sample bias in MEMs has not been…

Descriptors: Models, Statistical Analysis, Hierarchical Linear Modeling, Sample Size

Previous Page | Next Page »

Pages: 1 | 2

Tijmstra, Jesper	2
Veldkamp, Bernard P.	2
Allan S. Cohen	1
Ariel, Adelaide	1
Aseltine, Robert H., Jr.	1
Avetisyan, Marianna	1
Berger, Martijn P. F.	1
Bolsinova, Maria	1
Bonnet, Gerard	1
Botella, Juan	1
Buchholz, Janine	1
Chen, Haiwen	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
Dean, Nema	1
Debeer, Dries	1
Draper, David	1
Flynt, Abby	1
Fox, Jean-Paul	1
George Leckie	1
Goldstein, Harvey	1
Haberman, Shelby J.	1
Harel, Ofer	1
Hartig, Johannes	1
Harvey Goldstein	1
More ▼