Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 37 |
Since 2006 (last 20 years) | 80 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 8 |
Practitioners | 1 |
Location
Australia | 3 |
Turkey | 3 |
United Kingdom | 3 |
United Kingdom (England) | 3 |
China | 2 |
India | 2 |
Malaysia | 2 |
Thailand | 2 |
Bosnia and Herzegovina… | 1 |
California | 1 |
Canada | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024
At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…
Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations
Kacprzyk, Joanna; Parsons, Martin; Maguire, Patricia B.; Stewart, Gavin S. – Irish Educational Studies, 2019
The optimum assessment structure measures student knowledge accurately and without bias. In this study, the performance of the first-year undergraduate science students from the University College Dublin was evaluated to test the gender equality of the assessment structure in place. Results of male and female students taking three life science…
Descriptors: Science Tests, Gender Bias, College Freshmen, Foreign Countries
Wyse, Adam E. – Practical Assessment, Research & Evaluation, 2018
One common modification to the Angoff standard-setting method is to have panelists round their ratings to the nearest 0.05 or 0.10 instead of 0.01. Several reasons have been offered as to why it may make sense to have panelists round their ratings to the nearest 0.05 or 0.10. In this article, we examine one reason that has been suggested, which is…
Descriptors: Interrater Reliability, Evaluation Criteria, Scoring Formulas, Achievement Rating
Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018
Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…
Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity
Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018
Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…
Descriptors: Simulation, Decision Making, Test Construction, Validity
Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018
The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…
Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017
Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…
Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns
DeSanto, Dan; Nichols, Aaron – College & Research Libraries, 2017
This article presents the results of a faculty survey conducted at the University of Vermont during academic year 2014-2015. The survey asked faculty about: familiarity with scholarly metrics, metric-seeking habits, help-seeking habits, and the role of metrics in their department's tenure and promotion process. The survey also gathered faculty…
Descriptors: College Faculty, Teacher Surveys, Knowledge Level, Use Studies
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015
In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…
Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making
Handley, Fiona J. L.; Read, Ann – Perspectives: Policy and Practice in Higher Education, 2017
In 2011, Southampton Solent University, a post-1992 university in southern England, introduced a new marking scheme with the aims of changing marking practice to achieve greater transparency and consistency in marking, and to ensure that the full range of marks was being awarded to students. This paper discusses the strategic background to the…
Descriptors: Case Studies, Grading, Strategic Planning, Evaluation Methods
Xie, Jianping – English Language Teaching, 2017
The ultimate communicative purpose of literature reviews is to convince the reader of the worthiness of the writer's research, which is realized stage by stage and evaluation plays an important role in achieving this end. However, concerns about evaluation demonstration in novice academic writers' literature reviews have been repeatedly voiced in…
Descriptors: Literature Reviews, Masters Theses, English (Second Language), College Second Language Programs
A Generalizable Framework for Multi-Scale Auditing of Digital Learning Provision in Higher Education
Ross, Samuel R. P-J.; Volz, Veronica; Lancaster, Matthew K.; Divan, Aysha – Online Learning, 2018
It is increasingly important that higher education institutions be able to audit and evaluate the scope and efficacy of their digital learning resources across various scales. To date there has been little effort to address this need for a validated, appropriate, and simple-to-execute method that will facilitate such an audit, whether it be at the…
Descriptors: Higher Education, Audits (Verification), Electronic Learning, Educational Resources
Rozell, Timothy G.; Johnson, Jessica; Sexten, Andrea; Rhodes, Ashley E. – Journal of College Science Teaching, 2017
Students in a junior- and senior-level Anatomy and Physiology course have the opportunity to correct missed exam questions ("regrade") and earn up to half of the original points missed. The three objectives of this study were to determine if: (a) performance on the regrade assignment was correlated with scores on subsequent exams, (b)…
Descriptors: Physiology, Scores, Grades (Scholastic), Exit Examinations