NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 7 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Pollio, Marty; Hochbein, Craig – Teachers College Record, 2015
Background/Context: From two decades of research on the grading practices of teachers in secondary schools, researchers discovered that teachers evaluated students on numerous factors that do not validly assess a student's achievement level in a specific content area. These consistent findings suggested that traditional grading practices evolved…
Descriptors: Standardized Tests, Academic Standards, Grading, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Yu, Eunjyu – Research & Teaching in Developmental Education, 2014
In a study designed to analyze faculty and student perceptions of the value of digital writing in the first year composition classroom, 21 first-year college students and a nationwide sample of 50 college composition teachers participated in conceptualizing digital multimodal composition and defining the benchmarks for first-year college digital…
Descriptors: Developmental Programs, Freshman Composition, Electronic Publishing, Benchmarking
Peer reviewed Peer reviewed
Direct linkDirect link
Kreiner, Svend – Applied Psychological Measurement, 2011
To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
Descriptors: Item Analysis, Correlation, Item Response Theory, Models
Mazer, Irene R. – 1981
The need to determine eligibility for a program for intellectually gifted students resulted in combining deviation scores on achievement, aptitude, ability and motivation measures into a matrix score. These matrix scores and the students' success in the program were determined for present participants. Students were classified as successful or…
Descriptors: Eligibility, Evaluation Methods, Gifted, Scores
Cole, Nancy S. – 1982
The advantages and disadvantages of grade equivalent (GE) scores are explored, including appropriate uses for GE type scores and how to bring current GE scales closer to the type of information educators appear to desire. Although GE scores are not an equal interval scale, not comparable across school subjects, and do not indicate the grade level…
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Methods, Formative Evaluation