Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 63 |
Since 2006 (last 20 years) | 138 |
Descriptor
Scoring Formulas | 582 |
Test Reliability | 146 |
Multiple Choice Tests | 120 |
Test Validity | 105 |
Guessing (Tests) | 100 |
Scoring | 91 |
Higher Education | 89 |
Evaluation Methods | 77 |
Test Interpretation | 76 |
Test Construction | 74 |
Statistical Analysis | 68 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 12 |
Practitioners | 10 |
Community | 5 |
Parents | 5 |
Teachers | 3 |
Policymakers | 2 |
Location
Florida | 7 |
United Kingdom | 6 |
United Kingdom (England) | 6 |
Australia | 5 |
Canada | 5 |
United States | 5 |
Georgia | 3 |
New York | 3 |
North Carolina | 3 |
Turkey | 3 |
California | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
No Child Left Behind Act 2001 | 3 |
Education for All Handicapped… | 1 |
Individuals with Disabilities… | 1 |
Serrano v Priest | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Partnership for Assessment of Readiness for College and Careers, 2015
The 2014-2015 administrations of the PARCC assessment included two separate test administration windows: the Performance-Based Assessment (PBA) and the End-of-Year (EOY), both of which were administered in paper-based and computer-based formats. The first window was for administration of the PBA, and the second window was for the administration of…
Descriptors: Mathematics Tests, Scoring Formulas, Scoring Rubrics, Performance Based Assessment
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017
Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…
Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns
DeSanto, Dan; Nichols, Aaron – College & Research Libraries, 2017
This article presents the results of a faculty survey conducted at the University of Vermont during academic year 2014-2015. The survey asked faculty about: familiarity with scholarly metrics, metric-seeking habits, help-seeking habits, and the role of metrics in their department's tenure and promotion process. The survey also gathered faculty…
Descriptors: College Faculty, Teacher Surveys, Knowledge Level, Use Studies
Peterson, Claudette M.; Peterson, Tim O. – Journal of Management Education, 2016
As professors, we each have our own approach to grading which allows us to assess learning and provide useful feedback to our students, yet is not too onerous. This article explains one approach we have used that differs from standard grading scales we often hear about from our colleagues. Rather than being based on 100 points or 100% over the…
Descriptors: Grading, Student Evaluation, Evaluation Criteria, Evaluation Methods
Wise, Steven L.; Kingsbury, G. Gage – Journal of Educational Measurement, 2016
This study examined the utility of response time-based analyses in understanding the behavior of unmotivated test takers. For the data from an adaptive achievement test, patterns of observed rapid-guessing behavior and item response accuracy were compared to the behavior expected under several types of models that have been proposed to represent…
Descriptors: Achievement Tests, Student Motivation, Test Wiseness, Adaptive Testing
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Wheeler, Amber D. – ProQuest LLC, 2017
The purpose of this study is to explore the perceptions of parents and teachers regarding the success of a standards-based grading initiative in meeting its goals. Furthermore, findings from this study will be used to inform decisions made in future grade level implementations. Standards-based grading meets all criteria for a problem of practice.…
Descriptors: Grading, Academic Standards, Models, Success
Guo, Shenyang; Fraser, Mark W. – SAGE Publications Ltd (CA), 2014
Fully updated to reflect the most recent changes in the field, the Second Edition of "Propensity Score Analysis" provides an accessible, systematic review of the origins, history, and statistical foundations of propensity score analysis, illustrating how it can be used for solving evaluation and causal-inference problems. With a strong…
Descriptors: Probability, Scores, Statistical Analysis, Causal Models
Martin, Jeremy P. – Change: The Magazine of Higher Learning, 2015
Rankings are a powerful force in higher education, swaying the enrollment decisions of prospective students and affecting the opinions of parents, board members, and policymakers. In the words of one provost, "The rankings matter to our university because they matter to people who matter to us." Rankings are also a business--one that is…
Descriptors: Higher Education, Achievement Rating, Institutional Characteristics, Reputation
Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015
In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…
Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making
Handley, Fiona J. L.; Read, Ann – Perspectives: Policy and Practice in Higher Education, 2017
In 2011, Southampton Solent University, a post-1992 university in southern England, introduced a new marking scheme with the aims of changing marking practice to achieve greater transparency and consistency in marking, and to ensure that the full range of marks was being awarded to students. This paper discusses the strategic background to the…
Descriptors: Case Studies, Grading, Strategic Planning, Evaluation Methods
Xie, Jianping – English Language Teaching, 2017
The ultimate communicative purpose of literature reviews is to convince the reader of the worthiness of the writer's research, which is realized stage by stage and evaluation plays an important role in achieving this end. However, concerns about evaluation demonstration in novice academic writers' literature reviews have been repeatedly voiced in…
Descriptors: Literature Reviews, Masters Theses, English (Second Language), College Second Language Programs
A Generalizable Framework for Multi-Scale Auditing of Digital Learning Provision in Higher Education
Ross, Samuel R. P-J.; Volz, Veronica; Lancaster, Matthew K.; Divan, Aysha – Online Learning, 2018
It is increasingly important that higher education institutions be able to audit and evaluate the scope and efficacy of their digital learning resources across various scales. To date there has been little effort to address this need for a validated, appropriate, and simple-to-execute method that will facilitate such an audit, whether it be at the…
Descriptors: Higher Education, Audits (Verification), Electronic Learning, Educational Resources