Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Computation | 5 |
Statistical Analysis | 5 |
Weighted Scores | 5 |
Error of Measurement | 2 |
Evaluation Methods | 2 |
Probability | 2 |
Sampling | 2 |
Test Items | 2 |
Comparative Analysis | 1 |
Computer Assisted Testing | 1 |
Equations (Mathematics) | 1 |
More ▼ |
Source
ETS Research Report Series | 2 |
Education Sciences | 1 |
Educational and Psychological… | 1 |
Journal of Experimental… | 1 |
Author
Qian, Jiahe | 2 |
Aydin, Burak | 1 |
Ganzfried, Sam | 1 |
Gurel, Sungur | 1 |
Jiang, Yanming | 1 |
Leite, Walter L. | 1 |
Schuster, Christof | 1 |
Yusuf, Farzana | 1 |
von Davier, Alina A. | 1 |
Publication Type
Journal Articles | 5 |
Reports - Research | 3 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Leite, Walter L.; Aydin, Burak; Gurel, Sungur – Journal of Experimental Education, 2019
This Monte Carlo simulation study compares methods to estimate the effects of programs with multiple versions when assignment of individuals to program version is not random. These methods use generalized propensity scores, which are predicted probabilities of receiving a particular level of the treatment conditional on covariates, to remove…
Descriptors: Probability, Weighted Scores, Monte Carlo Methods, Statistical Bias
Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018
A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…
Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests
Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013
Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…
Descriptors: Item Response Theory, Test Items, Sampling, True Scores
Schuster, Christof – Educational and Psychological Measurement, 2004
This article presents a formula for weighted kappa in terms of rater means, rater variances, and the rater covariance that is particularly helpful in emphasizing that weighted kappa is an absolute agreement measure in the sense that it is sensitive to differences in rater's marginal distributions. Specifically, rater mean differences will decrease…
Descriptors: Computation, Rating Scales, Interrater Reliability, Statistical Analysis
Qian, Jiahe – ETS Research Report Series, 2006
Weighting and variance estimation are two statistical issues involved in survey data analysis for large-scale assessment programs such as the Higher Education Information and Communication Technology (ICT) Literacy Assessment. Because survey data are always acquired by probability sampling, to draw unbiased or almost unbiased inferences for the…
Descriptors: Weighted Scores, Sampling, Statistical Analysis, Higher Education