Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 17 |
Descriptor
Regression (Statistics) | 18 |
Item Response Theory | 9 |
Scores | 9 |
Comparative Analysis | 5 |
Computer Assisted Testing | 5 |
Models | 5 |
National Competency Tests | 5 |
Statistical Analysis | 5 |
Data Analysis | 4 |
Essays | 4 |
Prediction | 4 |
More ▼ |
Source
ETS Research Report Series | 6 |
Journal of Educational and… | 4 |
Educational Measurement:… | 2 |
Educational Testing Service | 2 |
Journal of Educational… | 2 |
Applied Measurement in… | 1 |
Large-scale Assessments in… | 1 |
Author
Sinharay, Sandip | 18 |
Haberman, Shelby J. | 4 |
von Davier, Matthias | 4 |
Attali, Yigal | 2 |
Choi, Seung W. | 2 |
Kim, Dong-In | 2 |
Wan, Ping | 2 |
Deane, Paul | 1 |
Guo, Hongwen | 1 |
Guo, Zhumei | 1 |
Holland, Paul | 1 |
More ▼ |
Publication Type
Journal Articles | 16 |
Reports - Research | 14 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Tests/Questionnaires | 1 |
Education Level
Grade 8 | 3 |
Elementary Education | 2 |
Grade 4 | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Secondary Education | 2 |
Elementary Secondary Education | 1 |
Grade 12 | 1 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
More ▼ |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 4 |
Indiana Statewide Testing for… | 2 |
Graduate Record Examinations | 1 |
Test of English as a Foreign… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022
Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…
Descriptors: Computation, Data Analysis, Educational Testing, Accuracy
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021
Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…
Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing
Sinharay, Sandip; Zhang, Mo; Deane, Paul – Applied Measurement in Education, 2019
Analysis of keystroke logging data is of increasing interest, as evident from a substantial amount of recent research on the topic. Some of the research on keystroke logging data has focused on the prediction of essay scores from keystroke logging features, but linear regression is the only prediction method that has been used in this research.…
Descriptors: Scores, Prediction, Writing Processes, Data Analysis
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2016
Data mining methods for classification and regression are becoming increasingly popular in various scientific fields. However, these methods have not been explored much in educational measurement. This module first provides a review, which should be accessible to a wide audience in education measurement, of some of these methods. The module then…
Descriptors: Data Collection, Information Retrieval, Classification, Regression (Statistics)
Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015
With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis
Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014
With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)
van Rijn, Peter W.; Sinharay, Sandip; Haberman, Shelby J.; Johnson, Matthew S. – Large-scale Assessments in Education, 2016
Latent regression models are used for score-reporting purposes in large-scale educational survey assessments such as the National Assessment of Educational Progress (NAEP) and Trends in International Mathematics and Science Study (TIMSS). One component of these models is based on item response theory. While there exists some research on assessment…
Descriptors: Goodness of Fit, Item Response Theory, Regression (Statistics), National Competency Tests
Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015
The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…
Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)
Haberman, Shelby J.; Sinharay, Sandip; Lee, Yi-Hsuan – Educational Testing Service, 2011
Providing information to test takers and test score users about the abilities of test takers at different score levels has been a persistent problem in educational and psychological measurement (Carroll, 1993). Scale anchoring (Beaton & Allen, 1992), a technique that describes what students at different points on a score scale know and can do,…
Descriptors: Statistical Analysis, Scores, Regression (Statistics), Item Response Theory
Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015
The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…
Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
Haberman, Shelby J.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010
Most automated essay scoring programs use a linear regression model to predict an essay score from several essay features. This article applied a cumulative logit model instead of the linear regression model to automated essay scoring. Comparison of the performances of the linear regression model and the cumulative logit model was performed on a…
Descriptors: Scoring, Regression (Statistics), Essays, Computer Software
Sinharay, Sandip; Guo, Zhumei; von Davier, Matthias; Veldkamp, Bernard P. – ETS Research Report Series, 2009
The reporting methods used in large-scale educational assessments such as the National Assessment of Educational Progress (NAEP) rely on a "latent regression model". There is a lack of research on the assessment of fit of latent regression models. This paper suggests a simulation-based model-fit technique to assess the fit of such…
Descriptors: Regression (Statistics), Models, Goodness of Fit, National Competency Tests
von Davier, Matthias; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010
This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…
Descriptors: Item Response Theory, Statistical Analysis, Regression (Statistics), Models
Haberman, Shelby J.; Sinharay, Sandip – ETS Research Report Series, 2008
Sample-size requirements were considered for automated essay scoring in cases in which the automated essay score estimates the score provided by a human rater. Analysis considered both cases in which an essay prompt is examined in isolation and those in which a family of essay prompts is studied. In typical cases in which content analysis is not…
Descriptors: Sample Size, Scoring, Essays, Automation
Previous Page | Next Page »
Pages: 1 | 2