Publication Date
| In 2026 | 0 |
| Since 2025 | 186 |
| Since 2022 (last 5 years) | 1065 |
| Since 2017 (last 10 years) | 2887 |
| Since 2007 (last 20 years) | 6172 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Peer reviewedGalbraith, Michael W.; Cohen, Norman H. – Michigan Community College Journal: Research & Practice, 1997
Describes the Principles of Adult Mentoring Scale, a self-assessment instrument for mentors of adult learners. Discusses the development and underlying principles of the scale, and provides information on scoring and interpreting results. The scale, instructions for scoring, and a description of six behavioral functions of the mentor role are…
Descriptors: Adult Education, Adult Students, Interpersonal Competence, Mentors
In-Basket Assessment by Fully Objective Methods: Development and Evaluation of a Self-Report System.
Peer reviewedHakstian, A. Ralph; Scratchley, Linda S. – Educational and Psychological Measurement, 1997
The feasibility and efficacy of using self-report response methods with an In-Basket exercise were evaluated in two studies involving 258 managers and 55 college students, respectively. Results suggest that high face validity of the In-Basket exercise can be combined with the scoring ease and objectivity of self-reports. (SLD)
Descriptors: Administrators, College Students, Evaluation Methods, Higher Education
Peer reviewedMoon, Tonya R.; Hughes, Kevin R. – Educational Measurement: Issues and Practice, 2002
Examined a scoring anomaly that became apparent in a state-mandated writing assessment. Results for 3,660 essays by sixth graders show that using a spiral model for training raters and scoring papers results in higher mean ratings than does using a sequential model for training and scoring. Findings demonstrate the importance of making decisions…
Descriptors: Elementary School Students, Essay Tests, Intermediate Grades, Scoring
Peer reviewedHamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989
Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…
Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)
Peer reviewedThissen, David; And Others – Journal of Educational Measurement, 1989
An approach to scoring reading comprehension based on the concept of the testlet is described, using models developed for items in multiple categories. The model is illustrated using data from 3,866 examinees. Application of testlet scoring to multiple category models developed for individual items is discussed. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Mathematical Models
Peer reviewedNorcini, John J.; And Others – Evaluation and the Health Professions, 1990
Aggregate scoring was applied to a recertifying examination for medical professionals to generate an answer key and allow comparison of peer examinees. Results for 1,927 candidates for recertification indicate considerable agreement between the traditional answer key and the aggregate answer key. (TJH)
Descriptors: Answer Keys, Criterion Referenced Tests, Error of Measurement, Generalizability Theory
Peer reviewedCahan, Sorel; Cohen, Nora – Educational and Psychological Measurement, 1990
A solution is offered to problems associated with the inequality in the manipulability of probabilities of classification errors of masters versus nonmasters, based on competency test results. Eschewing the typical arbitrary establishment of observed-score standards below 100 percent, the solution incorporates a self-correction of wrong answers.…
Descriptors: Classification, Error of Measurement, Mastery Tests, Minimum Competency Testing
Peer reviewedWoodrow, Janice E. J. – Computers and Education, 1989
Describes the design and operation of four software packages, or macros, written in the programing language of Microsoft's EXCEL for use on the Macintosh computer for data manipulation and presentation used in educational research. Reordering tabulated data, reversing the scoring of tabulated data, and creating tables and graphs are explained.…
Descriptors: Computer Graphics, Computer Software, Data Processing, Educational Research
Peer reviewedLunz, Mary E.; And Others – Applied Measurement in Education, 1990
An extension of the Rasch model is used to obtain objective measurements for examinations graded by judges. The model calibrates elements of each facet of the examination on a common log-linear scale. Real examination data illustrate the way correcting for judge severity improves fairness of examinee measures. (SLD)
Descriptors: Certification, Difficulty Level, Interrater Reliability, Judges
Peer reviewedLund, Thorleif – Scandinavian Journal of Educational Research, 1995
Four general criteria are proposed for the choice of a metrical solution for a causal effect: (1) compatibility with the effect; (2) ease of communication; (3) lack of measurement error bias; and (4) stability across subjects and situations. These criteria are illustrated for randomized and nonrandomized designs. (SLD)
Descriptors: Causal Models, Communication (Thought Transfer), Criteria, Error of Measurement
Peer reviewedThissen, David; And Others – Applied Psychological Measurement, 1995
Methods are described, based on item response theory, that provide scaled scores, or estimates of trait level, for each summed score for rated responses or for combinations of rated responses and multiple-choice items. These useful methods avoid problems associated with response-pattern scoring. (SLD)
Descriptors: Constructed Response, Estimation (Mathematics), Item Response Theory, Multiple Choice Tests
Peer reviewedGuion, Robert M. – Educational Measurement: Issues and Practice, 1995
This commentary discusses three essential themes in performance assessment and its scoring. First, scores should mean something. Second, performance scores should permit fair and meaningful comparisons. Third, validity-reducing errors should be minimal. Increased attention to performance assessment may overcome these problems. (SLD)
Descriptors: Educational Assessment, Performance Based Assessment, Scores, Scoring
Peer reviewedLunz, Mary E.; And Others – Educational and Psychological Measurement, 1994
In a study involving eight judges, analysis with the FACETS model provides evidence that judges grade differently, whether or not scores correlate well. This outcome suggests that adjustments for differences among judges should be made before student measures are estimated to produce reproducible decisions. (SLD)
Descriptors: Correlation, Decision Making, Evaluation Methods, Evaluators
Peer reviewedWessinger, Nancy Peoples – Quest, 1994
Discusses the meaning of scoring in children's games. Fourth graders suggest that feeling good in the gym results from scoring or helping to win, even when nobody keeps score. The article explores the meaning of scoring using the text, "Meaning in Movement, Sport and Physical Education," noting implications for teachers. (SM)
Descriptors: Achievement, Childrens Games, Competition, Elementary Education
Peer reviewedSimpson, Robert G.; Halpin, Gerald – Educational and Psychological Measurement, 1995
The Passage Comprehension Test of the Woodcock-Johnson Psycho-Educational Battery--Revised was administered to 77 elementary and middle school students and scored according to ceiling criteria in the manual. Relaxing the ceiling criteria reduced the number of items needed to establish a ceiling without negatively affecting psychometric properties.…
Descriptors: Criteria, Elementary Education, Elementary School Students, Middle School Students


