ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	37
Since 2006 (last 20 years)	80

Descriptor

Scoring Formulas	274
Test Reliability	72
Multiple Choice Tests	60
Higher Education	54
Test Validity	53
Guessing (Tests)	50
Scoring	43
Test Items	41
Statistical Analysis	37
Test Construction	35
Comparative Analysis	32
Evaluation Methods	32
Item Analysis	32
Testing Problems	32
Scores	31
Test Interpretation	29
Foreign Countries	26
Achievement Tests	24
Response Style (Tests)	24
Cutting Scores	23
Correlation	22
Mathematical Models	21
Psychometrics	21
Difficulty Level	20
Evaluation Criteria	20
More ▼

Publication Type

Reports - Research	274
Journal Articles	162
Speeches/Meeting Papers	44
Information Analyses	10
Tests/Questionnaires	8
Numerical/Quantitative Data	5
Reports - Evaluative	4
Opinion Papers	2
Collected Works - General	1
Collected Works - Serials	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Reports - Descriptive	1
More ▼

Education Level

Higher Education	42
Postsecondary Education	33
Secondary Education	10
Elementary Education	7
High Schools	7
Elementary Secondary Education	4
Junior High Schools	4
Middle Schools	4
Grade 7	3
Adult Education	2
Early Childhood Education	2
Grade 11	2
Grade 12	1
Grade 2	1
Grade 3	1
Grade 8	1
Primary Education	1
More ▼

Audience

Researchers	8
Practitioners	1

Location

Australia	3
Turkey	3
United Kingdom	3
United Kingdom (England)	3
China	2
India	2
Malaysia	2
Thailand	2
Bosnia and Herzegovina…	1
California	1
Canada	1
Czech Republic	1
District of Columbia	1
Germany	1
Ireland (Dublin)	1
Japan	1
Kansas	1
Minnesota	1
Nevada (Las Vegas)	1
New York	1
New York (New York)	1
Oklahoma	1
Russia	1
United Kingdom (Bristol)	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 274 results Save | Export

Validity, Reliability, and Fairness Evidence for the JD-Next Exam. Research Report. ETS RR-24-04

Peer reviewed
PDF on ERIC

Download full text

Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024

At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…

Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations

Examining Gender Effects in Different Types of Undergraduate Science Assessment

Peer reviewed

Direct link

Kacprzyk, Joanna; Parsons, Martin; Maguire, Patricia B.; Stewart, Gavin S. – Irish Educational Studies, 2019

The optimum assessment structure measures student knowledge accurately and without bias. In this study, the performance of the first-year undergraduate science students from the University College Dublin was evaluated to test the gender equality of the assessment structure in place. Results of male and female students taking three life science…

Descriptors: Science Tests, Gender Bias, College Freshmen, Foreign Countries

Rounding in Angoff Ratings

Peer reviewed
PDF on ERIC

Download full text

Wyse, Adam E. – Practical Assessment, Research & Evaluation, 2018

One common modification to the Angoff standard-setting method is to have panelists round their ratings to the nearest 0.05 or 0.10 instead of 0.01. Several reasons have been offered as to why it may make sense to have panelists round their ratings to the nearest 0.05 or 0.10. In this article, we examine one reason that has been suggested, which is…

Descriptors: Interrater Reliability, Evaluation Criteria, Scoring Formulas, Achievement Rating

Development and Validity Testing of the School Health Score Card

Peer reviewed

Direct link

Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018

Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…

Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity

On Using Simulations to Inform Decision Making during Instrument Development

Peer reviewed

Direct link

Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018

Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…

Descriptors: Simulation, Decision Making, Test Construction, Validity

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Developing, Analyzing, and Using Distractors for Multiple-Choice Tests in Education: A Comprehensive Review

Peer reviewed

Direct link

Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017

Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…

Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns

Scholarly Metrics Baseline: A Survey of Faculty Knowledge, Use, and Opinion about Scholarly Metrics

Peer reviewed

Direct link

DeSanto, Dan; Nichols, Aaron – College & Research Libraries, 2017

This article presents the results of a faculty survey conducted at the University of Vermont during academic year 2014-2015. The survey asked faculty about: familiarity with scholarly metrics, metric-seeking habits, help-seeking habits, and the role of metrics in their department's tenure and promotion process. The survey also gathered faculty…

Descriptors: College Faculty, Teacher Surveys, Knowledge Level, Use Studies

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Evidence-Based Decision about Test Scoring Rules in Clinical Anatomy Multiple-Choice Examinations

Peer reviewed

Direct link

Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015

In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…

Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making

Developing Assessment Policy and Evaluating Practice: A Case Study of the Introduction of A New Marking Scheme

Peer reviewed

Direct link

Handley, Fiona J. L.; Read, Ann – Perspectives: Policy and Practice in Higher Education, 2017

In 2011, Southampton Solent University, a post-1992 university in southern England, introduced a new marking scheme with the aims of changing marking practice to achieve greater transparency and consistency in marking, and to ensure that the full range of marks was being awarded to students. This paper discusses the strategic background to the…

Descriptors: Case Studies, Grading, Strategic Planning, Evaluation Methods

Evaluation in Moves: An Integrated Analysis of Chinese MA Thesis Literature Reviews

Peer reviewed
PDF on ERIC

Download full text

Xie, Jianping – English Language Teaching, 2017

The ultimate communicative purpose of literature reviews is to convince the reader of the worthiness of the writer's research, which is realized stage by stage and evaluation plays an important role in achieving this end. However, concerns about evaluation demonstration in novice academic writers' literature reviews have been repeatedly voiced in…

Descriptors: Literature Reviews, Masters Theses, English (Second Language), College Second Language Programs

A Generalizable Framework for Multi-Scale Auditing of Digital Learning Provision in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Ross, Samuel R. P-J.; Volz, Veronica; Lancaster, Matthew K.; Divan, Aysha – Online Learning, 2018

It is increasingly important that higher education institutions be able to audit and evaluate the scope and efficacy of their digital learning resources across various scales. To date there has been little effort to address this need for a validated, appropriate, and simple-to-execute method that will facilitate such an audit, whether it be at the…

Descriptors: Higher Education, Audits (Verification), Electronic Learning, Educational Resources

Research and Teaching: Correcting Missed Exam Questions as a Learning Tool in a Physiology Course

Peer reviewed

Direct link

Rozell, Timothy G.; Johnson, Jessica; Sexten, Andrea; Rhodes, Ashley E. – Journal of College Science Teaching, 2017

Students in a junior- and senior-level Anatomy and Physiology course have the opportunity to correct missed exam questions ("regrade") and earn up to half of the original points missed. The three objectives of this study were to determine if: (a) performance on the regrade assignment was correlated with scores on subsequent exams, (b)…

Descriptors: Physiology, Scores, Grades (Scholastic), Exit Examinations

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

Educational and Psychological…	12
Journal of Educational…	8
ETS Research Report Series	7
Journal of Experimental…	7
Perceptual and Motor Skills	6
Applied Psychological…	4
Journal of School Psychology	4
English Language Teaching	3
International Education…	3
Journal of Clinical Psychology	3
Advances in Health Sciences…	2
American Journal of Mental…	2
Anatomical Sciences Education	2
Applied Measurement in…	2
British Journal of…	2
College & Research Libraries	2
Educational Assessment	2
Evaluation and the Health…	2
Journal of Computer-Based…	2
Journal of Educational…	2
Learning Disability Quarterly	2
Online Submission	2
Psychology in the Schools	2
Psychometrika	2
Research Quarterly	2
More ▼

Weiss, David J.	9
Frary, Robert B.	7
Wilcox, Rand R.	6
Plake, Barbara S.	4
Angoff, William H.	3
Cross, Lawrence H.	3
Huynh, Huynh	3
Livingston, Samuel A.	3
Schrader, William B.	3
Albanese, Mark A.	2
Bliss, Leonard B.	2
Bruno, James E.	2
Clarke, Stephen R.	2
Donlon, Thomas F.	2
Dorans, Neil J.	2
Hocevar, Dennis	2
Hutchinson, T. P.	2
Kingston, Neal M.	2
Larkin, Kevin C.	2
Lowry, Stephen R.	2
Melican, Gerald J.	2
Powell, J. C.	2
Stricker, Lawrence J.	2
Vale, C. David	2
More ▼

Wechsler Intelligence Scale…	7
SAT (College Admission Test)	5
Graduate Record Examinations	4
Test of English as a Foreign…	3
Bender Gestalt Test	2
California Achievement Tests	2
College Board Achievement…	2
Comprehensive Tests of Basic…	2
Graduate Management Admission…	2
Group Embedded Figures Test	2
Iowa Tests of Basic Skills	2
Matching Familiar Figures Test	2
Adaptive Behavior Scale	1
Advanced Placement…	1
Armed Services Vocational…	1
Bender Visual Motor Gestalt…	1
British Ability Scales	1
California Psychological…	1
Childrens Manifest Anxiety…	1
College Level Examination…	1
Goodenough Harris Drawing Test	1
International English…	1
Kaufman Test of Educational…	1
Metropolitan Achievement Tests	1
Minnesota Multiphasic…	1
More ▼