ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	9

Descriptor

Probability	10
Bayesian Statistics	6
Cheating	4
Item Response Theory	4
Scores	4
Prediction	3
Statistical Analysis	3
Achievement Gains	2
Classification	2
Computation	2
Computer Assisted Testing	2
Deception	2
Ethics	2
High School Students	2
Identification	2
Models	2
Simulation	2
Student Behavior	2
Test Items	2
Testing Problems	2
Tests	2
Academic Achievement	1
Accuracy	1
Achievement Tests	1
Advanced Placement	1
More ▼

Source

Journal of Educational and…	4
Grantee Submission	2
Journal of Educational…	2
Educational and Psychological…	1
Measurement:…	1

Author

Sinharay, Sandip	10
Johnson, Matthew S.	4
Choi, Seung W.	1
Holland, Paul W.	1
Kim, Dong-In	1
Wan, Ping	1
Whitaker, Mike	1
Williamson, David M.	1
Zhang, Litong	1

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

High Schools	2
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Indiana Statewide Testing for…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

The Use of the Posterior Probability in Score Differencing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Journal of Educational and Behavioral Statistics, 2021

Score differencing is one of the six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…

Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis

The Use of the Posterior Probability in Score Differencing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2021

Score differencing is one of six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…

Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Peer reviewed

Direct link

Sinharay, Sandip – Measurement: Interdisciplinary Research and Perspectives, 2018

Producers and consumers of test scores are increasingly concerned about fraudulent behavior before and during the test. There exist several statistical or psychometric methods for detecting fraudulent behavior on tests. This paper provides a review of the Bayesian approaches among them. Four hitherto-unpublished real data examples are provided to…

Descriptors: Ethics, Cheating, Student Behavior, Bayesian Statistics

Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Descriptors: Ethics, Cheating, Student Behavior, Bayesian Statistics

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

A New Approach to Comparing Several Equating Methods in the Context of the NEAT Design

Peer reviewed

Direct link

Sinharay, Sandip; Holland, Paul W. – Journal of Educational Measurement, 2010

The nonequivalent groups with anchor test (NEAT) design involves missing data that are missing by design. Three equating methods that can be used with a NEAT design are the frequency estimation equipercentile equating method, the chain equipercentile equating method, and the item-response-theory observed-score-equating method. We suggest an…

Descriptors: Equated Scores, Item Response Theory, Comparative Analysis, Evaluation

Calibrating Item Families and Summarizing the Results Using Family Expected Response Functions

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S.; Williamson, David M. – Journal of Educational and Behavioral Statistics, 2003

Item families, which are groups of related items, are becoming increasingly popular in complex educational assessments. For example, in automatic item generation (AIG) systems, a test may consist of multiple items generated from each of a number of item models. Item calibration or scoring for such an assessment requires fitting models that can…

Descriptors: Test Items, Markov Processes, Educational Testing, Probability