ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	12

Descriptor

Bayesian Statistics	20
Statistical Analysis	8
Models	7
Goodness of Fit	6
Item Response Theory	6
Probability	6
Cheating	5
Test Items	5
Computation	4
Educational Assessment	4
Markov Processes	4
Scores	4
Test Bias	4
Deception	3
Identification	3
Prediction	3
Psychometrics	3
Achievement Gains	2
Comparative Analysis	2
Constructed Response	2
Correlation	2
Data	2
Diagnostic Tests	2
Epistemology	2
Estimation (Mathematics)	2
More ▼

Source

Journal of Educational and…	6
ETS Research Report Series	3
Grantee Submission	3
Applied Psychological…	2
Educational Testing Service	1
Educational and Psychological…	1
Journal of Educational…	1
Measurement:…	1

Author

Sinharay, Sandip	20
Johnson, Matthew S.	8
Dorans, Neil J.	3
Blew, Edwin O.	2
Grant, Mary C.	2
Williamson, David M.	2
Almond, Russell	1
Almond, Russell G.	1
Bejar, Isaac I.	1
Johnson, Matthew	1
Knorr, Colleen M.	1
Stern, Hal S.	1
Yan, Duanli	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	13
Reports - Descriptive	3
Reports - Evaluative	3
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

High Schools	2
Middle Schools	2
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Pre Professional Skills Tests	2
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

The Use of the Posterior Probability in Score Differencing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Journal of Educational and Behavioral Statistics, 2021

Score differencing is one of the six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…

Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis

The Use of the Posterior Probability in Score Differencing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2021

Score differencing is one of six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…

Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis

Detecting Test Fraud Using Bayes Factors

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019

According to Wollack and Schoenig (2018), score differencing is one of six types of statistical methods used to detect test fraud. In this paper, we suggested the use of Bayes factors (e.g., Kass & Raftery, 1995) for score differencing. A simulation study shows that the suggested approach performs slightly better than an existing frequentist…

Descriptors: Cheating, Deception, Statistical Analysis, Bayesian Statistics

Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Peer reviewed

Direct link

Sinharay, Sandip – Measurement: Interdisciplinary Research and Perspectives, 2018

Producers and consumers of test scores are increasingly concerned about fraudulent behavior before and during the test. There exist several statistical or psychometric methods for detecting fraudulent behavior on tests. This paper provides a review of the Bayesian approaches among them. Four hitherto-unpublished real data examples are provided to…

Descriptors: Ethics, Cheating, Student Behavior, Bayesian Statistics

Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Descriptors: Ethics, Cheating, Student Behavior, Bayesian Statistics

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Two Simple Approaches to Overcome a Problem with the Mantel-Haenszel Statistic: Comments on Wang, Bradlow, Wainer, and Muller (2008)

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J. – Journal of Educational and Behavioral Statistics, 2010

The Mantel-Haenszel (MH) procedure (Mantel and Haenszel) is a popular method for estimating and testing a common two-factor association parameter in a 2 x 2 x K table. Holland and Holland and Thayer described how to use the procedure to detect differential item functioning (DIF) for tests with dichotomously scored items. Wang, Bradlow, Wainer, and…

Descriptors: Test Bias, Statistical Analysis, Computation, Bayesian Statistics

Using Past Data to Enhance Small Sample DIF Estimation: A Bayesian Approach

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O. – Journal of Educational and Behavioral Statistics, 2009

Test administrators often face the challenge of detecting differential item functioning (DIF) with samples of size smaller than that recommended by experts. A Bayesian approach can incorporate, in the form of a prior distribution, existing information on the inference problem at hand, which yields more stable estimation, especially for small…

Descriptors: Test Bias, Computation, Bayesian Statistics, Data

Calibration of Polytomous Item Families Using Bayesian Hierarchical Modeling

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Applied Psychological Measurement, 2005

For complex educational assessments, there is an increasing use of item families, which are groups of related items. Calibration or scoring in an assessment involving item families requires models that can take into account the dependence structure inherent among the items that belong to the same item family. This article extends earlier works in…

Descriptors: National Competency Tests, Markov Processes, Bayesian Statistics

Assessing Fit of Cognitive Diagnostic Models: A Case Study

Peer reviewed

Direct link

Sinharay, Sandip; Almond, Russell G. – Educational and Psychological Measurement, 2007

A cognitive diagnostic model uses information from educational experts to describe the relationships between item performances and posited proficiencies. When the cognitive relationships can be described using a fully Bayesian model, Bayesian model checking procedures become available. Checking models tied to cognitive theory of the domains…

Descriptors: Epistemology, Clinical Diagnosis, Job Training, Item Response Theory

Posterior Predictive Assessment of Item Response Theory Models

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S.; Stern, Hal S. – Applied Psychological Measurement, 2006

Model checking in item response theory (IRT) is an underdeveloped area. There is no universally accepted tool for checking IRT models. The posterior predictive model-checking method is a popular Bayesian model-checking tool because it has intuitive appeal, is simple to apply, has a strong theoretical basis, and can provide graphical or numerical…

Descriptors: Predictive Measurement, Item Response Theory, Bayesian Statistics, Models

Assessing Fit of Unidimensional Item Response Theory Models Using a Bayesian Approach

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2005

Even though Bayesian estimation has recently become quite popular in item response theory (IRT), there is a lack of works on model checking from a Bayesian perspective. This paper applies the posterior predictive model checking (PPMC) method (Guttman, 1967; Rubin, 1984), a popular Bayesian model checking tool, to a number of real applications of…

Descriptors: Measurement Techniques, Item Response Theory, Bayesian Statistics, Models

Calibrating Item Families and Summarizing the Results Using Family Expected Response Functions

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S.; Williamson, David M. – Journal of Educational and Behavioral Statistics, 2003

Item families, which are groups of related items, are becoming increasingly popular in complex educational assessments. For example, in automatic item generation (AIG) systems, a test may consist of multiple items generated from each of a number of item models. Item calibration or scoring for such an assessment requires fitting models that can…

Descriptors: Test Items, Markov Processes, Educational Testing, Probability

Using Past Data to Enhance Small-Sample DIF Estimation: A Bayesian Approach. Research Report. ETS RR-06-09

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O.; Knorr, Colleen M. – ETS Research Report Series, 2006

The application of the Mantel-Haenszel test statistic (and other popular DIF-detection methods) to determine DIF requires large samples, but test administrators often need to detect DIF with small samples. There is no universally agreed upon statistical approach for performing DIF analysis with small samples; hence there is substantial scope of…

Descriptors: Test Bias, Computation, Sample Size, Bayesian Statistics

Calibration of Automatically Generated Items Using Bayesian Hierarchical Modeling.

Download full text

Johnson, Matthew S.; Sinharay, Sandip – 2003

For complex educational assessments, there is an increasing use of "item families," which are groups of related items. However, calibration or scoring for such an assessment requires fitting models that take into account the dependence structure inherent among the items that belong to the same item family. C. Glas and W. van der Linden…

Descriptors: Bayesian Statistics, Constructed Response, Educational Assessment, Estimation (Mathematics)

Previous Page | Next Page »

Pages: 1 | 2