Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 7 |
Descriptor
Error of Measurement | 7 |
Guidelines | 7 |
Item Analysis | 3 |
Probability | 3 |
Comparative Analysis | 2 |
Computation | 2 |
Item Response Theory | 2 |
Models | 2 |
Scores | 2 |
Statistical Distributions | 2 |
Test Construction | 2 |
More ▼ |
Source
Journal of Educational and… | 2 |
Assessment & Evaluation in… | 1 |
Educational and Psychological… | 1 |
Journal of Educational… | 1 |
National Center for Research… | 1 |
Review of Educational Research | 1 |
Author
Brennan, Robert L. | 1 |
Cheema, Jehanzeb R. | 1 |
Clauser, Brian E. | 1 |
Clauser, Jerome C. | 1 |
Ferrao, Maria | 1 |
French, Brian F. | 1 |
Griffin, Noelle | 1 |
Kane, Michael | 1 |
Kolen, Michael J. | 1 |
Lee, Won-Chan | 1 |
Maller, Susan J. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 7 |
Journal Articles | 6 |
Tests/Questionnaires | 2 |
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Mississippi | 1 |
Portugal | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023
This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…
Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Cheema, Jehanzeb R. – Review of Educational Research, 2014
Missing data are a common occurrence in survey-based research studies in education, and the way missing values are handled can significantly affect the results of analyses based on such data. Despite known problems with performance of some missing data handling methods, such as mean imputation, many researchers in education continue to use those…
Descriptors: Educational Research, Data, Data Collection, Data Processing
French, Brian F.; Maller, Susan J. – Educational and Psychological Measurement, 2007
Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…
Descriptors: Effect Size, Test Bias, Guidelines, Error of Measurement
Ferrao, Maria – Assessment & Evaluation in Higher Education, 2010
The Bologna Declaration brought reforms into higher education that imply changes in teaching methods, didactic materials and textbooks, infrastructures and laboratories, etc. Statistics and mathematics are disciplines that traditionally have the worst success rates, particularly in non-mathematics core curricula courses. This research project,…
Descriptors: Foreign Countries, Computer Assisted Testing, Educational Technology, Educational Assessment
Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006
Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…
Descriptors: Probability, Intervals, Guidelines, Computer Simulation
Niemi, David; Wang, Jia; Wang, Haiwen; Vallone, Julia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007
There are usually many testing activities going on in a school, with different tests serving different purposes, thus organization and planning are key in creating an efficient system in assessing the most important educational objectives. In the ideal case, an assessment system will be able to inform on student learning, instruction and…
Descriptors: School Administration, Educational Objectives, Administration, Public Schools