Publication Date
| In 2026 | 0 |
| Since 2025 | 186 |
| Since 2022 (last 5 years) | 1065 |
| Since 2017 (last 10 years) | 2887 |
| Since 2007 (last 20 years) | 6172 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
ACT, Inc., 2008
This handbook is intended to help high school and college counselors effectively use and interpret ACT results. It contains four main sections: Section 1, "Components of the ACT," which contains general information about the ACT tests, probably will be of interest both to high school counselors and to college advisors and other staff. …
Descriptors: High Schools, Testing Programs, College Entrance Examinations, School Counselors
Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008
Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…
Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies
National Academy of Sciences - National Research Council, Washington, DC. Mathematical Sciences Education Board. – 1993
The call for educational reform has caused mathematics curriculum, instruction, and assessment to come under both professional and public scrutiny. The three elements contesting for leadership in assessment reform are the conventional testing agencies, reformers led by the National Council of Teachers of Mathematics (NCTM), and political agencies…
Descriptors: Elementary School Mathematics, Evaluation Criteria, Evaluation Methods, Grade 4
Bruno, James E.; Opp, Ronald D. – 1985
The admissable probability measurement (APM) format was used to score a criterion referenced language arts test administered in an inner city junior high school. Its 30 items covered capitalization, punctuation, parts of speech, and sentence analysis. With APM, students indicate their confidence in their answer choice, and guessing is heavily…
Descriptors: Confidence Testing, Criterion Referenced Tests, Educational Testing, Equivalency Tests
Golub-Smith, Marna; And Others – 1993
The Test of Written English (TWE), administered with certain designated examinations of the Test of English as a Foreign Language (TOEFL), consists of a single essay prompt to which examinees have 30 minutes to respond. Questions have been raised about the comparability of different TWE prompts. This study was designed to elicit essays for prompts…
Descriptors: Charts, Comparative Analysis, English (Second Language), Essay Tests
Kump, Ann – 1992
Directions are given for scoring typing tests taken on a typewriter or on a computer using special software. The speed score (gross words per minute) is obtained by determining the total number of strokes typed, and dividing by 25. The accuracy score is obtained by comparing the examinee's test paper to the appropriate scoring key and counting the…
Descriptors: Computer Assisted Testing, Employment Qualifications, Guidelines, Job Applicants
Ferrara, F. Felicia – 1995
Cut scores, quartile ranking, sample size, and overall classification scheme were studied as personnel selection procedures in two samples. The first was 120 simulated observations of employee scores based on actual selection procedures for applicants for administrative assistant positions. The other sample was composed of test results for 73…
Descriptors: Classification, Cutting Scores, Job Applicants, Personnel Selection
Lunz, Mary E.; O'Neill, Thomas R. – 1997
This retrospective longitudinal study was designed to show grading leniency patterns of judges within and across clinical examination administrations. Data from 17 different administrations of the histology examination of the American Society of Clinical Pathologists over 10 years were studied. Over the 10 years there were 4,683 candidates and 57…
Descriptors: Higher Education, Interrater Reliability, Item Response Theory, Judges
Krippendorff, Klaus – 1992
When one wants to set data reliability standards for a class of scientific inquiries or when one needs to compare and select among many different kinds of data with reliabilities that are crucial to a particular research undertaking, then one needs a single reliability coefficient that is adaptable to all or most situations. Work toward this goal…
Descriptors: Definitions, Equations (Mathematics), Mathematical Models, Reliability
Bontempo, Brian D.; Marks, Casimer M.; Karabatsos, George – 1998
Using meta-analysis, this research takes a look at studies included in a meta-analysis by R. Jaeger (1989) that compared the cut score set by one standard setting method with that set by another. This meta-analysis looks beyond Jaeger's studies to select 10 from the research literature. Each compared at least two types of standard setting method.…
Descriptors: Comparative Analysis, Cutting Scores, Effect Size, Meta Analysis
Horgan, Dianne D.; Barnett, Loretta – 1991
Seventy-four college students participated in a peer review assignment. Subjects were asked to write a draft of a three-page paper, distribute copies to three peer reviewers, revise their papers using the resulting feedback from each of the three peer reviewers, and then prepare and submit a final paper. Reviews were scored for the quality and…
Descriptors: College Students, Feedback, Higher Education, Instructional Effectiveness
Thayer, Jerome D. – 1991
Combining student scores to form subtotals and finally a total score to determine a grade is discussed. The composite score reached by combining measures or subtotals is only valid when the scores are combined so that the actual weight of each measure or subtotal in the total score is the same as the intended weight. Three types of variability…
Descriptors: Academic Achievement, Elementary Secondary Education, Grading, Mathematical Models
Peer reviewedReilly, Richard R. – Educational and Psychological Measurement, 1975
Because previous reports have suggested that the lowered validity of tests scored with empirical option weights might be explained by a capitalization of the keying procedures on omitting tendencies, a procedure was devised to key options empirically with a "correction-for-guessing" constraint. (Author)
Descriptors: Achievement Tests, Graduate Study, Guessing (Tests), Scoring Formulas
Peer reviewedWatkins, Julia M.; Watkins, Dennis A. – Journal of Clinical Psychology, 1975
This study researched the Plenk scoring system more thoroughly to see whether it could be used with older children and whether it could differentiate normal from emotionally disturbed Ss. (Author)
Descriptors: Data Collection, Emotional Disturbances, Handicapped Children, Research Methodology
Follman, John; Panther, Edward – Child Study Journal Monographs, 1974
Examines empirically the efficacy of utilizing Olympic diving and gymnastic scoring systems for grading graduate students' English compositions. Results indicated that such scoring rules do not produce ratings different in reliability or in level from conventional letter grades. (ED)
Descriptors: English Curriculum, Evaluation Methods, Grading, Graduate Students

Direct link
