ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	11

Descriptor

Bayesian Statistics	11
Error of Measurement	11
Computation	7
Models	4
Simulation	4
Statistical Analysis	4
Item Response Theory	3
Mathematics Achievement	3
Academic Achievement	2
Classification	2
Effect Size	2
Equations (Mathematics)	2
Goodness of Fit	2
Grants	2
Longitudinal Studies	2
Markov Processes	2
Monte Carlo Methods	2
Probability	2
Public Schools	2
Regression (Statistics)	2
Scores	2
Statistical Inference	2
Academic Persistence	1
Accountability	1
Accuracy	1
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Grade 6	2
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 7	1
Grade 8	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Location

New York	1
Pennsylvania	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

An Explicit Form with Continuous Attribute Profile of the Partial Mastery DINA Model

Peer reviewed

Direct link

Shu, Tian; Luo, Guanzhong; Luo, Zhaosheng; Yu, Xiaofeng; Guo, Xiaojun; Li, Yujun – Journal of Educational and Behavioral Statistics, 2023

Cognitive diagnosis models (CDMs) are the statistical framework for cognitive diagnostic assessment in education and psychology. They generally assume that subjects' latent attributes are dichotomous--mastery or nonmastery, which seems quite deterministic. As an alternative to dichotomous attribute mastery, attention is drawn to the use of a…

Descriptors: Cognitive Measurement, Models, Diagnostic Tests, Accuracy

Assessing Inter-Rater Reliability with Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables

Peer reviewed

Direct link

Martinková, Patrícia; Bartoš, František; Brabec, Marek – Journal of Educational and Behavioral Statistics, 2023

Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater's or ratee's gender, major, or experience. Identification of such heterogeneity sources in IRR is important for the implementation of policies with the potential to decrease measurement error…

Descriptors: Interrater Reliability, Bayesian Statistics, Statistical Inference, Hierarchical Linear Modeling

A Fast and Simple Algorithm for Bayesian Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Ren, Hao – Journal of Educational and Behavioral Statistics, 2020

The Bayesian way of accounting for the effects of error in the ability and item parameters in adaptive testing is through the joint posterior distribution of all parameters. An optimized Markov chain Monte Carlo algorithm for adaptive testing is presented, which samples this distribution in real time to score the examinee's ability and optimally…

Descriptors: Bayesian Statistics, Adaptive Testing, Error of Measurement, Markov Processes

Decision Theory Applied to Selecting the Winners, Ranking, and Classification

Peer reviewed

Direct link

Longford, Nicholas Tibor – Journal of Educational and Behavioral Statistics, 2016

We address the problem of selecting the best of a set of units based on a criterion variable, when its value is recorded for every unit subject to estimation, measurement, or another source of error. The solution is constructed in a decision-theoretical framework, incorporating the consequences (ramifications) of the various kinds of error that…

Descriptors: Decision Making, Classification, Guidelines, Undergraduate Students

A Quasi-Parametric Method for Fitting Flexible Item Response Functions

Peer reviewed

Direct link

Liang, Longjuan; Browne, Michael W. – Journal of Educational and Behavioral Statistics, 2015

If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating…

Descriptors: Item Response Theory, Statistical Analysis, Goodness of Fit, Bayesian Statistics

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects

Peer reviewed

Direct link

Lockwood, J. R.; McCaffrey, Daniel F. – Journal of Educational and Behavioral Statistics, 2014

A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…

Descriptors: Error of Measurement, Scores, Statistical Analysis, Computation

Simultaneous One-Sided Tests with Application to Education Evaluation Systems

Peer reviewed

Direct link

Han, Bing; Dalal, Siddhartha R.; McCaffrey, Daniel F. – Journal of Educational and Behavioral Statistics, 2012

There is widespread interest in using various statistical inference tools as a part of the evaluations for individual teachers and schools. Evaluation systems typically involve classifying hundreds or even thousands of teachers or schools according to their estimated performance. Many current evaluations are largely based on individual estimates…

Descriptors: Statistical Inference, Error of Measurement, Classification, Statistical Analysis

A Bayesian Approach to Ranking and Rater Evaluation: An Application to Grant Reviews

Peer reviewed

Direct link

Cao, Jing; Stokes, S. Lynne; Zhang, Song – Journal of Educational and Behavioral Statistics, 2010

We develop a Bayesian hierarchical model for the analysis of ordinal data from multirater ranking studies. The model for a rater's score includes four latent factors: one is a latent item trait determining the true order of items and the other three are the rater's performance characteristics, including bias, discrimination, and measurement error…

Descriptors: Bayesian Statistics, Data Analysis, Bias, Measurement

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

A Latent Cluster-Mean Approach to the Contextual Effects Model with Missing Data

Peer reviewed

Direct link

Shin, Yongyun; Raudenbush, Stephen W. – Journal of Educational and Behavioral Statistics, 2010

In organizational studies involving multiple levels, the association between a covariate and an outcome often differs at different levels of aggregation, giving rise to widespread interest in "contextual effects models." Such models partition the regression into within- and between-cluster components. The conventional approach uses each…

Descriptors: Academic Achievement, National Surveys, Computation, Inferences

McCaffrey, Daniel F.	2
Bartoš, František	1
Boyd, Donald	1
Brabec, Marek	1
Browne, Michael W.	1
Cao, Jing	1
Dalal, Siddhartha R.	1
Guo, Xiaojun	1
Han, Bing	1
Lankford, Hamilton	1
Li, Yujun	1
Liang, Longjuan	1
Lockwood, J. R.	1
Loeb, Susanna	1
Longford, Nicholas Tibor	1
Luo, Guanzhong	1
Luo, Zhaosheng	1
Martinková, Patrícia	1
Raudenbush, Stephen W.	1
Ren, Hao	1
Shin, Yongyun	1
Shu, Tian	1
Sinharay, Sandip	1
Stokes, S. Lynne	1
Wyckoff, James	1
More ▼