Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 5 |
Descriptor
Evaluation Methods | 7 |
Scores | 7 |
Scoring Formulas | 7 |
Academic Standards | 2 |
Computation | 2 |
Correlation | 2 |
Evaluation Criteria | 2 |
Item Response Theory | 2 |
Models | 2 |
Scoring Rubrics | 2 |
Test Construction | 2 |
More ▼ |
Source
Applied Psychological… | 1 |
Educational Assessment | 1 |
Measurement and Evaluation in… | 1 |
Research & Teaching in… | 1 |
Teachers College Record | 1 |
Author
Bardhoshi, Gerta | 1 |
Cole, Nancy S. | 1 |
Erford, Bradley T. | 1 |
Hochbein, Craig | 1 |
Kreiner, Svend | 1 |
Lee, Minji K. | 1 |
Mazer, Irene R. | 1 |
Melican, Gerald J. | 1 |
Pollio, Marty | 1 |
Sweeney, Kevin | 1 |
Yu, Eunjyu | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Descriptive | 3 |
Reports - Research | 3 |
Speeches/Meeting Papers | 2 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Education Level
Adult Education | 1 |
Grade 11 | 1 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Denmark | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Pollio, Marty; Hochbein, Craig – Teachers College Record, 2015
Background/Context: From two decades of research on the grading practices of teachers in secondary schools, researchers discovered that teachers evaluated students on numerous factors that do not validly assess a student's achievement level in a specific content area. These consistent findings suggested that traditional grading practices evolved…
Descriptors: Standardized Tests, Academic Standards, Grading, Scores
Yu, Eunjyu – Research & Teaching in Developmental Education, 2014
In a study designed to analyze faculty and student perceptions of the value of digital writing in the first year composition classroom, 21 first-year college students and a nationwide sample of 50 college composition teachers participated in conceptualizing digital multimodal composition and defining the benchmarks for first-year college digital…
Descriptors: Developmental Programs, Freshman Composition, Electronic Publishing, Benchmarking
Kreiner, Svend – Applied Psychological Measurement, 2011
To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
Descriptors: Item Analysis, Correlation, Item Response Theory, Models
Mazer, Irene R. – 1981
The need to determine eligibility for a program for intellectually gifted students resulted in combining deviation scores on achievement, aptitude, ability and motivation measures into a matrix score. These matrix scores and the students' success in the program were determined for present participants. Students were classified as successful or…
Descriptors: Eligibility, Evaluation Methods, Gifted, Scores
Cole, Nancy S. – 1982
The advantages and disadvantages of grade equivalent (GE) scores are explored, including appropriate uses for GE type scores and how to bring current GE scales closer to the type of information educators appear to desire. Although GE scores are not an equal interval scale, not comparable across school subjects, and do not indicate the grade level…
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Methods, Formative Evaluation