NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 256 to 270 of 582 results Save | Export
Peer reviewed Peer reviewed
Garcia-Perez, Miguel A.; Frary, Robert B. – Applied Psychological Measurement, 1989
Simulation techniques were used to generate conventional test responses and track the proportion of alternatives examinees could classify independently before and after taking the test. Finite-state scores were compared with these actual values and with number-correct and formula scores. Finite-state scores proved useful. (TJH)
Descriptors: Comparative Analysis, Computer Simulation, Guessing (Tests), Mathematical Models
Hester, Yvette – 1993
Some of the different approaches to standard setting are discussed. Brief comments and references are offered concerning strategies that rely primarily on the use of expert judgment. Controversy surrounds methods that use expert judges, as well as those using test groups to set scores empirically. A minimax procedure developed by H. Huynh, an…
Descriptors: Academic Standards, Classification, Cutting Scores, Evaluation Methods
Peer reviewed Peer reviewed
Glanz, Peter K.; Brown, R. S. – Physics Teacher, 1976
States that final exams can best motivate students if the exams are counted substantially toward the final course grade. Proposes a weighting system in which a performance on the final which exceeds the student's average would be weighed more heavily than a poor performance. (CP)
Descriptors: College Science, Evaluation Methods, Higher Education, Motivation
Peer reviewed Peer reviewed
Andrews, Hans A. – Journal of Vocational Behavior, 1975
This study was designed to test and expand Holland's vocational development theory by utilizing more than a single high point code in classification of personality patterns of jobs. A more "refined" and/or "subtle" difference was shown in the personality-job relationships when two high point codes were used. (Author)
Descriptors: Career Choice, Career Development, Decision Making, Personality
Georgia State Dept. of Education, Atlanta. – 1988
Intended to familiarize persons with the scoring standards and criteria used for the 8th grade version of the Georgia Criterion-Referenced Writing Test, this scoring manual is in eight sections: (1) an introduction to the scoring dimensions and scale points; (2) definitions of the four scale points; (3) definitions of scoring dimensions and…
Descriptors: Grade 8, Junior High Schools, Scoring, Scoring Formulas
Livingston, Samuel A. – 1981
The standard error of measurement (SEM) is a measure of the inconsistency in the scores of a particular group of test-takers. It is largest for test-takers with scores ranging in the 50 percent correct bracket; with nearly perfect scores, it is smaller. On tests used to make pass/fail decisions, the test-takers' scores tend to cluster in the range…
Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Formulas, Pass Fail Grading
Smith, Lawrence L. – 1975
The six tables presented in this paper were prepared in the belief that more educators would use the Harris-Jacobson Readability Formulas if the operations employed in those formulas to determine the estimated readability level of materials could be completed more quickly and with fewer calculations. Instructions for using the tables are included.…
Descriptors: Elementary Secondary Education, Measurement Techniques, Readability Formulas, Reading Instruction
Frary, Robert B. – 1980
Ordinal response modes for multiple choice tests are those under which the examinee marks one or more choices in an effort to identify the correct choice, or include it in a proper subset of the choices. Two ordinal response modes: answer-until-correct, and Coomb's elimination of choices which examinees identify as wrong, were analyzed for scoring…
Descriptors: Guessing (Tests), Multiple Choice Tests, Responses, Scoring
Kane, Michael T.; Moloney, James M. – 1974
Gilman and Ferry have shown that when the student's score on a multiple choice test is the total number of responses necessary to get all items correct, substantial increases in reliability can occur. In contrast, similar procedures giving partial credit on multiple choice items have resulted in relatively small gains in reliability. The analysis…
Descriptors: Feedback, Guessing (Tests), Multiple Choice Tests, Response Style (Tests)
Doppelt, Jerome E. – Test Service Bulletin, 1956
The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
Descriptors: Bulletins, Error of Measurement, Measurement Techniques, Reliability
Williams, Cynthia L. – 1973
Each test in the Divergent Production battery requires the examinee to produce a response. Since these responses must be evaluated, the factor of rater judgment influences the reliability of scores. The problem of scoring reliability is one which pervades the literature on creativity research, where either low estimates or no estimates have been…
Descriptors: Analysis of Variance, Evaluation Criteria, Examiners, Scoring Formulas
Peer reviewed Peer reviewed
Frary, Robert B.; And Others – Journal of Experimental Education, 1977
To date a theoretical basis has not been developed for determining changes in reliability when score points from random guessing are eliminated and those from non-randon guessing are retained. This paper presents a derivation of an expression for the reliability coefficient which displays the effect of deleting score components due to random…
Descriptors: Data Analysis, Guessing (Tests), Multiple Choice Tests, Scoring Formulas
Peer reviewed Peer reviewed
Orlofsky, Jacob L.; And Others – Journal of Personality Assessment, 1977
A median split and a difference/median split method were used to classify college students into masculine, feminine, androgynous and undifferentiated sex role orientations using the Bem Sex Role Inventory. The difference/ median split procedure was more successful in discriminating between sex role groups and in predicting sex role ideology. (EVH)
Descriptors: Androgyny, Attitudes, College Students, Feminism
Peer reviewed Peer reviewed
Wilcox, Rand R.; Harris, Chester W. – Journal of Educational Measurement, 1977
Emrick's proposed method for determining a mastery level cut-off score is questioned. Emrick's method is shown to be useful only in limited situations. (JKS)
Descriptors: Correlation, Cutting Scores, Mastery Tests, Mathematical Models
Peer reviewed Peer reviewed
Abu-Sayf, F. K. – Educational and Psychological Measurement, 1977
A new formula for the correction for chance success due to guessing was advanced and investigated, the mathematical solution of which has the property of equating the scores of fast and slow examinees of equal ability. (Author/JKS)
Descriptors: Academic Ability, Equated Scores, Guessing (Tests), Scoring Formulas
Pages: 1  |  ...  |  14  |  15  |  16  |  17  |  18  |  19  |  20  |  21  |  22  |  ...  |  39