Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 3 |
Descriptor
Evaluation Methods | 6 |
Models | 6 |
True Scores | 6 |
Mathematical Models | 2 |
Measurement Techniques | 2 |
Prediction | 2 |
Rating Scales | 2 |
Academic Standards | 1 |
Classroom Environment | 1 |
Comparative Analysis | 1 |
Computation | 1 |
More ▼ |
Author
Cowell, Ryan | 1 |
Drewes, Donald W. | 1 |
Hooper, Jay | 1 |
Longford, Nicholas T. | 1 |
Miller, Angela D. | 1 |
Murdock, Tamera B. | 1 |
Pommerich, Mary | 1 |
Roudabush, Glenn E. | 1 |
Publication Type
Reports - Research | 4 |
Journal Articles | 3 |
Reports - Evaluative | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
North Carolina End of Course… | 1 |
What Works Clearinghouse Rating
Hooper, Jay; Cowell, Ryan – Educational Assessment, 2014
There has been much research and discussion on the principles of standards-based grading, and there is a growing consensus of best practice. Even so, the actual process of implementing standards-based grading at a school or district level can be a significant challenge. There are very practical questions that remain unclear, such as how the grades…
Descriptors: True Scores, Grading, Academic Standards, Computation
Drewes, Donald W. – Psychological Methods, 2009
A unifying theory of subject-centered scalability is offered that is grounded in structural true score modeling, is conceptually distinct from internal consistency and homogeneity as determined by item correlations, and is empirically confirmable. Scalability holds when item true scores are perfectly correlated but differ in their individual scale…
Descriptors: Rating Scales, Factor Analysis, True Scores, Mathematical Models
Miller, Angela D.; Murdock, Tamera B. – Contemporary Educational Psychology, 2007
Measures of classroom climate such as classroom goal structures are often assessed through students' perceptions; the aggregated means within classrooms are then sometimes labeled as "classroom characteristics." The validity of these constructs is limited by the reliability of the measure at both the student and classroom level; yet, few studies…
Descriptors: True Scores, Teacher Characteristics, Classroom Environment, Student Attitudes
Longford, Nicholas T. – 1993
A model-based approach to rater reliability for essays read by multiple readers is presented. Variation of rater severity (between-rater variation) and rater inconsistency (within-rater variation) is considered in the presence of between-examinee variation. An additive variance component model is posited and the method of moments for its…
Descriptors: Educational Diagnosis, Error of Measurement, Essays, Estimation (Mathematics)
Roudabush, Glenn E. – 1974
In this paper, several models for the psychometric nature of criterion-referenced tests are presented and results derived with implications for test construction, reliability and validity measures, and educational decision making. Both dichotomous and continuous underlying abilities to perform are considered. Illustrative data fitting both cases…
Descriptors: Criterion Referenced Tests, Decision Making, Evaluation Methods, Measurement Techniques
Pommerich, Mary – 1995
When tests contain few items, observed score may not be an accurate reflection of true score, and the Mantel Haenszel (MH) statistic may perform poorly in detecting differential item functioning. Applications of the MH procedure in such situations require an alternate strategy; one such strategy is to include background variables in the matching…
Descriptors: Criteria, Evaluation Methods, Grade 3, Identification