ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	40

Descriptor

Scores	41
Item Response Theory	13
Comparative Analysis	12
Statistical Analysis	12
Correlation	10
Prediction	9
Regression (Statistics)	9
Test Items	9
Test Theory	8
Computer Assisted Testing	7
Factor Analysis	7
English (Second Language)	6
Language Tests	6
Licensing Examinations…	6
Psychometrics	6
Reliability	6
Data Analysis	5
Educational Testing	5
Goodness of Fit	5
Second Language Learning	5
Simulation	5
Tests	5
Bayesian Statistics	4
Cheating	4
Models	4
More ▼

Source

Journal of Educational…	9
Educational Measurement:…	7
ETS Research Report Series	6
Educational Testing Service	3
Grantee Submission	3
Journal of Educational and…	3
Applied Measurement in…	2
Language Testing	2
Measurement:…	2
Educational and Psychological…	1
International Journal of…	1
Multivariate Behavioral…	1
Psychometrika	1
More ▼

Author

Sinharay, Sandip	41
Haberman, Shelby J.	10
Puhan, Gautam	5
Haberman, Shelby	4
Attali, Yigal	2
Choi, Seung W.	2
Feng, Ying	2
Johnson, Matthew S.	2
Kim, Dong-In	2
Larkin, Kevin	2
Lee, Yi-Hsuan	2
Powers, Donald E.	2
Saldivia, Luis	2
Sawaki, Yasuyo	2
Simpson, Annabelle	2
Wan, Ping	2
Weng, Vincent	2
Boughton, Keith	1
Deane, Paul	1
Eckerly, Carol	1
Ginuta, Anthony	1
Giunta, Anthony	1
Gorney, Kylie	1
Guo, Hongwen	1
Holland, Paul W.	1
More ▼

Publication Type

Journal Articles	35
Reports - Research	24
Reports - Evaluative	11
Reports - Descriptive	4
Opinion Papers	2
Tests/Questionnaires	2
Information Analyses	1

Education Level

High Schools	3
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

Chile	2
Colombia	2
Ecuador	2

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Indiana Statewide Testing for…	2
Graduate Record Examinations	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

Measuring the Uncertainty of Imputed Scores

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2023

Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…

Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Using Item Scores and Distractors to Detect Item Compromise and Preknowledge

Peer reviewed

Direct link

Gorney, Kylie; Wollack, James A.; Sinharay, Sandip; Eckerly, Carol – Journal of Educational and Behavioral Statistics, 2023

Any time examinees have had access to items and/or answers prior to taking a test, the fairness of the test and validity of test score interpretations are threatened. Therefore, there is a high demand for procedures to detect both compromised items (CI) and examinees with preknowledge (EWP). In this article, we develop a procedure that uses item…

Descriptors: Scores, Test Validity, Test Items, Prior Learning

Score Reporting for Examinees with Incomplete Data on Large-Scale Educational Assessments

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021

Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…

Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing

A New Interpretation of Augmented Subscores and Their Added Value in Terms of Parallel Forms

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2018

The value-added method of Haberman is arguably one of the most popular methods to evaluate the quality of subscores. The method is based on the classical test theory and deems a subscore to be of added value if the subscore predicts the corresponding true subscore better than does the total score. Sinharay provided an interpretation of the added…

Descriptors: Scores, Value Added Models, Raw Scores, Item Response Theory

Detecting Test Fraud Using Bayes Factors

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019

According to Wollack and Schoenig (2018), score differencing is one of six types of statistical methods used to detect test fraud. In this paper, we suggested the use of Bayes factors (e.g., Kass & Raftery, 1995) for score differencing. A simulation study shows that the suggested approach performs slightly better than an existing frequentist…

Descriptors: Cheating, Deception, Statistical Analysis, Bayesian Statistics

Digital Module 07: Subscores--Evaluation and Reporting https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2019

Test score users often demand the reporting of subscores due to their potential diagnostic, remedial, and instructional benefits. Therefore, there is substantial pressure on testing programs to report subscores. However, professional standards require that subscores have to satisfy minimum quality standards before they can be reported. In this…

Descriptors: Testing, Scores, Item Response Theory, Evaluation Methods

Prediction of Essay Scores from Writing Process and Product Features Using Data Mining Methods

Peer reviewed

Direct link

Sinharay, Sandip; Zhang, Mo; Deane, Paul – Applied Measurement in Education, 2019

Analysis of keystroke logging data is of increasing interest, as evident from a substantial amount of recent research on the topic. Some of the research on keystroke logging data has focused on the prediction of essay scores from keystroke logging features, but linear regression is the only prediction method that has been used in this research.…

Descriptors: Scores, Prediction, Writing Processes, Data Analysis

Analysis of Added Value of Subscores with Respect to Classification

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2014

Brennan noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. One way to interpret the method is that a subscore has added value…

Descriptors: Scores, Test Theory, Classification, Cutting Scores

Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Peer reviewed

Direct link

Sinharay, Sandip – Measurement: Interdisciplinary Research and Perspectives, 2018

Producers and consumers of test scores are increasingly concerned about fraudulent behavior before and during the test. There exist several statistical or psychometric methods for detecting fraudulent behavior on tests. This paper provides a review of the Bayesian approaches among them. Four hitherto-unpublished real data examples are provided to…

Descriptors: Ethics, Cheating, Student Behavior, Bayesian Statistics

Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Descriptors: Ethics, Cheating, Student Behavior, Bayesian Statistics

The Use of Item Scores and Response Times to Detect Examinees Who May Have Benefited from Item Preknowledge

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Grantee Submission, 2019

According to Wollack and Schoenig (2018), benefitting from item preknowledge is one of the three broad types of test fraud that occur in educational assessments. We use tools from constrained statistical inference to suggest a new statistic that is based on item scores and response times and can be used to detect the examinees who may have…

Descriptors: Scores, Test Items, Reaction Time, Cheating

Too Simple to Be Useful: A Comment on Feinberg and Wainer (2014)

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby; Boughton, Keith – Educational Measurement: Issues and Practice, 2015

Feinberg and Wainer (2014) provided a simple equation to approximate/predict a subscore's value. The purpose of this note is to point out that their equation is often inaccurate in that it does not always predict a subscore's value correctly. Therefore, the utility of their simple equation is not clear.

Descriptors: Equations (Mathematics), Scores, Prediction, Accuracy

How to Compare Parametric and Nonparametric Person-Fit Statistics Using Real Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2017

Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…

Descriptors: Goodness of Fit, Testing, Test Items, Scores

A Note on Assessing the Added Value of Subscores

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2014

Brennan (Brennan, R. L., 2012) noted that users of test scores often want (indeed, demand) that subscores be reported, along with total test scores, for diagnostic purposes. Haberman (Haberman, S. J., 2008) suggested a method based on classical test theory (CTT) to determine if subscores have added value over the total score. According to this…

Descriptors: Scores, Test Theory, Test Interpretation

Previous Page | Next Page »

Pages: 1 | 2 | 3