Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 6 |
Descriptor
Evaluation Methods | 10 |
Simulation | 10 |
Statistical Distributions | 10 |
Bayesian Statistics | 3 |
Computation | 3 |
Testing Problems | 3 |
Cheating | 2 |
Correlation | 2 |
Deception | 2 |
Equations (Mathematics) | 2 |
Error Patterns | 2 |
More ▼ |
Source
Journal of Educational and… | 3 |
Grantee Submission | 2 |
Psychometrika | 2 |
National Center for Education… | 1 |
ProQuest LLC | 1 |
Author
Sinharay, Sandip | 2 |
Bloxom, Bruce | 1 |
Davey, T. C. | 1 |
Deke, John | 1 |
Dongho Shin | 1 |
Feinberg, Richard A. | 1 |
Finucane, Mariel | 1 |
Gregson, Robert A. M. | 1 |
Kistner, Emily O. | 1 |
Muller, Keith E. | 1 |
Oshima, T. C. | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Evaluative | 4 |
Reports - Research | 3 |
Dissertations/Theses -… | 2 |
Guides - Non-Classroom | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
National Assessment of… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating

Dongho Shin – Grantee Submission, 2024
We consider Bayesian estimation of a hierarchical linear model (HLM) from small sample sizes. The continuous response Y and covariates C are partially observed and assumed missing at random. With C having linear effects, the HLM may be efficiently estimated by available methods. When C includes cluster-level covariates having interactive or other…
Descriptors: Bayesian Statistics, Computation, Hierarchical Linear Modeling, Data Analysis
Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022
BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…
Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2018
Wollack, Cohen, and Eckerly suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This article suggests two modifications of the EDI for…
Descriptors: Deception, Identification, Testing Problems, Cheating
Sinharay, Sandip – Grantee Submission, 2017
Wollack, Cohen, and Eckerly (2015) suggested the "erasure detection index" (EDI) to detect fraudulent erasures for individual examinees. Wollack and Eckerly (2017) extended the EDI to detect fraudulent erasures at the group level. The EDI at the group level was found to be slightly conservative. This paper suggests two modifications of…
Descriptors: Deception, Identification, Testing Problems, Cheating
Si, Yajuan; Reiter, Jerome P. – Journal of Educational and Behavioral Statistics, 2013
In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian,…
Descriptors: Nonparametric Statistics, Bayesian Statistics, Measurement, Evaluation Methods
Feinberg, Richard A. – ProQuest LLC, 2012
Subscores, also known as domain scores, diagnostic scores, or trait scores, can help determine test-takers' relative strengths and weaknesses and appropriately focus remediation. However, subscores often have poor psychometric properties, particularly reliability and distinctiveness (Folske, Gessaroli, & Swanson, 1999; Monaghan, 2006;…
Descriptors: Simulation, Tests, Testing, Scores
Oshima, T. C.; Davey, T. C. – 1994
This paper evaluated multidimensional linking procedures with which multidimensional test data from two separate calibrations were put on a common scale. Data were simulated with known ability distributions varying on two factors which made linking necessary: mean vector differences and variance-covariance (v-c) matrix differences. After the…
Descriptors: Ability, Estimation (Mathematics), Evaluation Methods, Matrices

Bloxom, Bruce; And Others – Journal of Educational and Behavioral Statistics, 1995
Develops and evaluates the linkage of the Armed Services Vocational Aptitude Battery to the mathematics scale of the National Assessment of Educational Progress. The accuracy of the proficiency distribution estimated from the projection was close to the accuracy of the distribution estimated from the large scale assessment. (SLD)
Descriptors: Educational Assessment, Estimation (Mathematics), Evaluation Methods, Mathematics Tests

Gregson, Robert A. M. – Psychometrika, 1994
The derivation of the variance of similarity judgments is made from the 3-D process in nonlinear psychophysics. The idea of separability of dimensions in metric space theories of similarity is replaced by one parameter that represents the degree of a form of interdimensional cross-sampling. (SLD)
Descriptors: Decision Making, Equations (Mathematics), Evaluation Methods, Models
Kistner, Emily O.; Muller, Keith E. – Psychometrika, 2004
Intraclass correlation and Cronbach's alpha are widely used to describe reliability of tests and measurements. Even with Gaussian data, exact distributions are known only for compound symmetric covariance (equal variances and equal correlations). Recently, large sample Gaussian approximations were derived for the distribution functions. New exact…
Descriptors: Correlation, Test Reliability, Test Results, Probability