ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Evaluation Methods	6
Models	6
True Scores	6
Mathematical Models	2
Measurement Techniques	2
Prediction	2
Rating Scales	2
Academic Standards	1
Classroom Environment	1
Comparative Analysis	1
Computation	1
Correlation	1
Criteria	1
Criterion Referenced Tests	1
Decision Making	1
Educational Diagnosis	1
Error of Measurement	1
Essays	1
Estimation (Mathematics)	1
Factor Analysis	1
Goal Orientation	1
Grade 3	1
Grading	1
Identification	1
Interrater Reliability	1
More ▼

Source

Contemporary Educational…	1
Educational Assessment	1
Psychological Methods	1

Author

Cowell, Ryan	1
Drewes, Donald W.	1
Hooper, Jay	1
Longford, Nicholas T.	1
Miller, Angela D.	1
Murdock, Tamera B.	1
Pommerich, Mary	1
Roudabush, Glenn E.	1

Publication Type

Reports - Research	4
Journal Articles	3
Reports - Evaluative	2
Speeches/Meeting Papers	1

Education Level

Higher Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

North Carolina End of Course…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Standards-Based Grading: History Adjusted True Score

Peer reviewed

Direct link

Hooper, Jay; Cowell, Ryan – Educational Assessment, 2014

There has been much research and discussion on the principles of standards-based grading, and there is a growing consensus of best practice. Even so, the actual process of implementing standards-based grading at a school or district level can be a significant challenge. There are very practical questions that remain unclear, such as how the grades…

Descriptors: True Scores, Grading, Academic Standards, Computation

Subject-Centered Scalability: The Sine Qua Non of Summated Ratings

Peer reviewed

Direct link

Drewes, Donald W. – Psychological Methods, 2009

A unifying theory of subject-centered scalability is offered that is grounded in structural true score modeling, is conceptually distinct from internal consistency and homogeneity as determined by item correlations, and is empirically confirmable. Scalability holds when item true scores are perfectly correlated but differ in their individual scale…

Descriptors: Rating Scales, Factor Analysis, True Scores, Mathematical Models

Modeling Latent True Scores to Determine the Utility of Aggregate Student Perceptions as Classroom Indicators in HLM: The Case of Classroom Goal Structures

Peer reviewed

Direct link

Miller, Angela D.; Murdock, Tamera B. – Contemporary Educational Psychology, 2007

Measures of classroom climate such as classroom goal structures are often assessed through students' perceptions; the aggregated means within classrooms are then sometimes labeled as "classroom characteristics." The validity of these constructs is limited by the reliability of the measure at both the student and classroom level; yet, few studies…

Descriptors: True Scores, Teacher Characteristics, Classroom Environment, Student Attitudes

Reliability of Essay Rating and Score Adjustment. Program Statistics Research Technical Report No. 93-36.

Download full text

Longford, Nicholas T. – 1993

A model-based approach to rater reliability for essays read by multiple readers is presented. Variation of rater severity (between-rater variation) and rater inconsistency (within-rater variation) is considered in the presence of between-examinee variation. An additive variance component model is posited and the method of moments for its…

Descriptors: Educational Diagnosis, Error of Measurement, Essays, Estimation (Mathematics)

Models for a Beginning Theory of Criterion-Referenced Tests.

Download full text

Roudabush, Glenn E. – 1974

In this paper, several models for the psychometric nature of criterion-referenced tests are presented and results derived with implications for test construction, reliability and validity measures, and educational decision making. Both dichotomous and continuous underlying abilities to perform are considered. Illustrative data fitting both cases…

Descriptors: Criterion Referenced Tests, Decision Making, Evaluation Methods, Measurement Techniques

Demonstrating the Utility of a Multilevel Model in the Assessment of Differential Item Functioning.

Download full text

Pommerich, Mary – 1995

When tests contain few items, observed score may not be an accurate reflection of true score, and the Mantel Haenszel (MH) statistic may perform poorly in detecting differential item functioning. Applications of the MH procedure in such situations require an alternate strategy; one such strategy is to include background variables in the matching…

Descriptors: Criteria, Evaluation Methods, Grade 3, Identification