Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Error of Measurement | 5 |
Measurement | 5 |
Comparative Analysis | 2 |
Computation | 2 |
Item Response Theory | 2 |
Regression (Statistics) | 2 |
Reliability | 2 |
Scores | 2 |
Simulation | 2 |
Accuracy | 1 |
Bayesian Statistics | 1 |
More ▼ |
Source
Journal of Educational and… | 5 |
Author
Béguin, Anton A. | 1 |
Cao, Jing | 1 |
Cook, Thomas D. | 1 |
Crompvoets, Elise A. V. | 1 |
Guo, Hongwen | 1 |
Jiang, Yanlin | 1 |
Li, Deping | 1 |
Oranje, Andreas | 1 |
Shadish, William R. | 1 |
Sijtsma, Klaas | 1 |
Sinharay, Sandip | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 4 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
What Works Clearinghouse Rating
Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020
Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…
Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement
Steiner, Peter M.; Cook, Thomas D.; Shadish, William R. – Journal of Educational and Behavioral Statistics, 2011
The effect of unreliability of measurement on propensity score (PS) adjusted treatment effects has not been previously studied. The authors report on a study simulating different degrees of unreliability in the multiple covariates that were used to estimate the PS. The simulation uses the same data as two prior studies. Shadish, Clark, and Steiner…
Descriptors: Statistical Bias, Reliability, Measurement, Scores
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
Cao, Jing; Stokes, S. Lynne; Zhang, Song – Journal of Educational and Behavioral Statistics, 2010
We develop a Bayesian hierarchical model for the analysis of ordinal data from multirater ranking studies. The model for a rater's score includes four latent factors: one is a latent item trait determining the true order of items and the other three are the rater's performance characteristics, including bias, discrimination, and measurement error…
Descriptors: Bayesian Statistics, Data Analysis, Bias, Measurement
Li, Deping; Oranje, Andreas; Jiang, Yanlin – Journal of Educational and Behavioral Statistics, 2009
To find population proficiency distributions, a two-level hierarchical linear model may be applied to large-scale survey assessments such as the National Assessment of Educational Progress (NAEP). The model and parameter estimation are developed and a simulation was carried out to evaluate parameter recovery. Subsequently, both a hierarchical and…
Descriptors: Computation, National Competency Tests, Measurement, Regression (Statistics)