ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	17

Descriptor

Regression (Statistics)	18
Item Response Theory	9
Scores	9
Comparative Analysis	5
Computer Assisted Testing	5
Models	5
National Competency Tests	5
Statistical Analysis	5
Data Analysis	4
Essays	4
Prediction	4
Scoring	4
Achievement Tests	3
Classification	3
Computation	3
Correlation	3
Goodness of Fit	3
Grade 8	3
Mathematics Tests	3
Simulation	3
Writing Tests	3
Computer Software	2
Construct Validity	2
Difficulty Level	2
Educational Assessment	2
More ▼

Source

ETS Research Report Series	6
Journal of Educational and…	4
Educational Measurement:…	2
Educational Testing Service	2
Journal of Educational…	2
Applied Measurement in…	1
Large-scale Assessments in…	1

Author

Sinharay, Sandip	18
Haberman, Shelby J.	4
von Davier, Matthias	4
Attali, Yigal	2
Choi, Seung W.	2
Kim, Dong-In	2
Wan, Ping	2
Deane, Paul	1
Guo, Hongwen	1
Guo, Zhumei	1
Holland, Paul	1
Johnson, Matthew S.	1
Lee, Yi-Hsuan	1
Veldkamp, Bernard P.	1
Whitaker, Mike	1
Zhang, Litong	1
Zhang, Mo	1
van Rijn, Peter W.	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	14
Reports - Descriptive	2
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Grade 8	3
Elementary Education	2
Grade 4	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Elementary Secondary Education	1
Grade 12	1
High Schools	1
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	4
Indiana Statewide Testing for…	2
Graduate Record Examinations	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Reporting Proficiency Levels for Examinees with Incomplete Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022

Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…

Descriptors: Computation, Data Analysis, Educational Testing, Accuracy

Score Reporting for Examinees with Incomplete Data on Large-Scale Educational Assessments

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2021

Technical difficulties occasionally lead to missing item scores and hence to incomplete data on computerized tests. It is not straightforward to report scores to the examinees whose data are incomplete due to technical difficulties. Such reporting essentially involves imputation of missing scores. In this paper, a simulation study based on data…

Descriptors: Data Analysis, Scores, Educational Assessment, Educational Testing

Prediction of Essay Scores from Writing Process and Product Features Using Data Mining Methods

Peer reviewed

Direct link

Sinharay, Sandip; Zhang, Mo; Deane, Paul – Applied Measurement in Education, 2019

Analysis of keystroke logging data is of increasing interest, as evident from a substantial amount of recent research on the topic. Some of the research on keystroke logging data has focused on the prediction of essay scores from keystroke logging features, but linear regression is the only prediction method that has been used in this research.…

Descriptors: Scores, Prediction, Writing Processes, Data Analysis

An NCME Instructional Module on Data Mining Methods for Classification and Regression

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2016

Data mining methods for classification and regression are becoming increasingly popular in various scientific fields. However, these methods have not been explored much in educational measurement. This module first provides a review, which should be accessible to a wide audience in education measurement, of some of these methods. The module then…

Descriptors: Data Collection, Information Retrieval, Classification, Regression (Statistics)

Assessing Individual-Level Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015

With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis

Determining the Overall Impact of Interruptions during Online Testing

Peer reviewed

Direct link

Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014

With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…

Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)

Assessment of Fit of Item Response Theory Models Used in Large-Scale Educational Survey Assessments

Peer reviewed

Direct link

van Rijn, Peter W.; Sinharay, Sandip; Haberman, Shelby J.; Johnson, Matthew S. – Large-scale Assessments in Education, 2016

Latent regression models are used for score-reporting purposes in large-scale educational survey assessments such as the National Assessment of Educational Progress (NAEP) and Trends in International Mathematics and Science Study (TIMSS). One component of these models is based on item response theory. While there exists some research on assessment…

Descriptors: Goodness of Fit, Item Response Theory, Regression (Statistics), National Competency Tests

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

Statistical Procedures to Evaluate Quality of Scale Anchoring. Research Report. ETS RR-11-02

Download full text

Haberman, Shelby J.; Sinharay, Sandip; Lee, Yi-Hsuan – Educational Testing Service, 2011

Providing information to test takers and test score users about the abilities of test takers at different score levels has been a persistent problem in educational and psychological measurement (Carroll, 1993). Scale anchoring (Beaton & Allen, 1992), a technique that describes what students at different points on a score scale know and can do,…

Descriptors: Statistical Analysis, Scores, Regression (Statistics), Item Response Theory

Automated Trait Scores for "GRE"® Writing Tasks. Research Report. ETS RR-15-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…

Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

The Application of the Cumulative Logistic Regression Model to Automated Essay Scoring

Peer reviewed

Direct link

Haberman, Shelby J.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010

Most automated essay scoring programs use a linear regression model to predict an essay score from several essay features. This article applied a cumulative logit model instead of the linear regression model to automated essay scoring. Comparison of the performances of the linear regression model and the cumulative logit model was performed on a…

Descriptors: Scoring, Regression (Statistics), Essays, Computer Software

Assessing Fit of Latent Regression Models. Research Report. ETS RR-09-50

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Guo, Zhumei; von Davier, Matthias; Veldkamp, Bernard P. – ETS Research Report Series, 2009

The reporting methods used in large-scale educational assessments such as the National Assessment of Educational Progress (NAEP) rely on a "latent regression model". There is a lack of research on the assessment of fit of latent regression models. This paper suggests a simulation-based model-fit technique to assess the fit of such…

Descriptors: Regression (Statistics), Models, Goodness of Fit, National Competency Tests

Stochastic Approximation Methods for Latent Regression Item Response Models

Peer reviewed

Direct link

von Davier, Matthias; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010

This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…

Descriptors: Item Response Theory, Statistical Analysis, Regression (Statistics), Models

Sample-Size Requirements for Automated Essay Scoring. Research Report. ETS RR-08-32

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Sinharay, Sandip – ETS Research Report Series, 2008

Sample-size requirements were considered for automated essay scoring in cases in which the automated essay score estimates the score provided by a human rater. Analysis considered both cases in which an essay prompt is examined in isolation and those in which a family of essay prompts is studied. In typical cases in which content analysis is not…

Descriptors: Sample Size, Scoring, Essays, Automation

Previous Page | Next Page »

Pages: 1 | 2