ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Scoring	23
Scoring Formulas	23
Test Reliability	23
Test Validity	15
Multiple Choice Tests	10
Test Construction	9
Item Analysis	8
Higher Education	6
Test Items	6
Weighted Scores	6
Guessing (Tests)	5
Test Interpretation	5
Testing	5
Confidence Testing	4
Measurement Techniques	4
Computer Programs	3
High Schools	3
Research Reports	3
Response Style (Tests)	3
Standardized Tests	3
Testing Problems	3
Tests	3
Achievement Tests	2
Comparative Analysis	2
Comparative Testing	2
More ▼

Source

Applied Psychological…	3
Journal of Educational…	2
Anatomical Sciences Education	1
Assessment in Education:…	1
Journal of School Health	1
Society for Research on…	1

Publication Type

Reports - Research	12
Journal Articles	5
Speeches/Meeting Papers	3
Reports - Descriptive	2
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Researchers

Location

New York	1
New York (New York)	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Rod and Frame Test	1
Rosenberg Self Esteem Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

Development and Validity Testing of the School Health Score Card

Peer reviewed

Direct link

Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018

Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…

Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity

Evidence-Based Decision about Test Scoring Rules in Clinical Anatomy Multiple-Choice Examinations

Peer reviewed

Direct link

Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015

In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…

Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making

Improving Marking Quality through a Taxonomy of Mark Schemes

Peer reviewed

Direct link

Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011

At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…

Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance

A Study of the Reliability of Nedelsky's Method for Choosing a Passing Score.

Livingston, Samuel A.; Kastrinos, William – 1982

Leo Nedelsky developed a method for determining absolute grading standards for multiple choice tests. His method required a group of judges to examine each test question and eliminate those responses which the lowest D- student should be able to reject as incorrect. The correct answer probabilities remaining were used in computing an expected test…

Descriptors: Cutting Scores, Judges, Multiple Choice Tests, Real Estate

Expected Multiple-Choice Test Item Scores Under Ordinal Response Modes.

Frary, Robert B. – 1980

Ordinal response modes for multiple choice tests are those under which the examinee marks one or more choices in an effort to identify the correct choice, or include it in a proper subset of the choices. Two ordinal response modes: answer-until-correct, and Coomb's elimination of choices which examinees identify as wrong, were analyzed for scoring…

Descriptors: Guessing (Tests), Multiple Choice Tests, Responses, Scoring

Scoring Field Dependence: A Methodological Analysis of Five Rod-and-Frame Scoring Systems

Peer reviewed

McGarvey, Bill; And Others – Applied Psychological Measurement, 1977

The most consistently used scoring system for the rod-and-frame task has been the total number of degrees in error from the true vertical. Since a logical case can be made for at least four alternative scoring systems, a thorough comparison of all five systems was performed. (Author/CTM)

Descriptors: Analysis of Variance, Cognitive Style, Cognitive Tests, Elementary Education

A Preliminary Study of the Reliability and Validity of a Scoring Procedure Based Upon Confidence and Partial Information

Peer reviewed

Diamond, James J. – Journal of Educational Measurement, 1975

Investigates the reliability and validity of scores yielded from a new scoring formula. (Author/DEP)

Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Scoring

Alternative Response and Scoring Methods for Multiple Choice Items: An Empirical Study of Probabilistic and Ordinal Response Modes

Peer reviewed

Poizner, Sharon B.; And Others – Applied Psychological Measurement, 1978

Binary, probability, and ordinal scoring procedures for multiple-choice items were examined. In two situations, it was found that both the probability and ordinal scoring systems were more reliable than the binary scoring method. (Author/CTM)

Descriptors: Confidence Testing, Guessing (Tests), Higher Education, Multiple Choice Tests

The Effect of a Scoring System Based on the Algorithm Underlying the Students' Response Patterns on the Dimensionality of Achievement Test Data of the Problem Solving Type.

Peer reviewed

Birenbaum, Menucha; Fatsuoka, Kikumi K. – Journal of Educational Measurement, 1983

The outcomes of two scoring methods (one based on an error analysis and the second on a conventional method) on free-response tests, compared in terms of reliability and dimensionality, indicates the conventional method is inferior in both aspects. (Author/PN)

Descriptors: Achievement Tests, Algorithms, Data, Junior High Schools

Toward an Integration of Theory and Method for Criterion-Referenced Tests.

Download full text

Hambleton, Ronald K.; Novick, Melvin R. – 1972

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…

Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling

RSE-40: An Alternate Scoring System for the Rosenberg Self-Esteem Scale (RSE).

Wallace, Gaylen R. – 1988

The Rosenberg Self-Esteem Inventory (RSE) is a 10-item scale purporting to measure self-esteem using self-acceptance and self-worth statements. This analysis covers concerns about the degree to which the RSE items represent a particular content universe, the RSE's applicability, factor analytic methods used, and the RSE's reliability and validity.…

Descriptors: Adults, College Students, High School Students, High Schools

The Effect of Differential Weighting of Individual Item Responses on the Predictive Validity and Reliability of an Aptitude Test.

Download full text

Sabers, Darrell L.; White, Gordon W. – 1971

A procedure for scoring multiple-choice tests by assigning different weights to every option of a test item is investigated. The weighting method used was based on that proposed by Davis, which involves taking the upper and lower 27% of a sample, according to some criterion measure, and using the percentages of these groups marking an item option…

Descriptors: Computer Oriented Programs, Item Analysis, Measurement Techniques, Multiple Choice Tests

A Comparison of Two Instruments for Evaluating Composition.

Barter, Alice K.; And Others – 1980

A follow-up study of two instruments for evaluating college writing was conducted. The experimental scale (E Scale) was developed in 1976 and revised for this study. The control scale (C Scale) was described in the literature in 1977. Ten English majors graded ten essays from diagnostic entrance exams. Both the E Scale and the C Scale were used,…

Descriptors: College Entrance Examinations, Comparative Testing, Essay Tests, Evaluation Criteria

An Empirical Comparison of Two-Stage and Pyramidal Adaptive Ability Testing.

Download full text

Larkin, Kevin C.; Weiss, David J. – 1975

A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…

Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs

Previous Page | Next Page »

Pages: 1 | 2

Echternacht, Gary	2
Ahmed, Ayesha	1
Barter, Alice K.	1
Birenbaum, Menucha	1
Brennan, Robert L.	1
Choi, Soo Hyuk	1
Diamond, James J.	1
Downey, Ronald G.	1
Fatsuoka, Kikumi K.	1
Ferreira, Maria Amélia	1
Frary, Robert B.	1
Gaio, A. Rita	1
Gilmer, Jerry S.	1
Hambleton, Ronald K.	1
Kang, Joon-ho	1
Kastrinos, William	1
Kim, Yaeji	1
Larkin, Kevin C.	1
Lenel, Julia C.	1
Lim, Cheolil	1
Livingston, Samuel A.	1
McGarvey, Bill	1
Novick, Melvin R.	1
Pascale, Pietro J.	1
More ▼