Publication Date
| In 2026 | 0 |
| Since 2025 | 186 |
| Since 2022 (last 5 years) | 1065 |
| Since 2017 (last 10 years) | 2887 |
| Since 2007 (last 20 years) | 6172 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Cizek, Gregory J.; Fitzgerald, Shawn M. – 1996
A group-process approach to standard setting was compared to an independent approach for a medical specialty certification examination. Both approaches used the Angoff (1971) standard-setting method. In the group-process method, reviewers discussed items and their ratings during the rating process; in the independent condition, reviewers provided…
Descriptors: Comparative Analysis, Cost Effectiveness, Group Dynamics, Judges
Taylor, Catherine S. – 1996
This study investigated the impact of task directions on the mathematical performance of high school students from six classes. Students analyzed data regarding school dropout by answering six short-answer questions and writing a letter discussing the trends and their predictions about school dropout. Tasks were scored using two methods: (1) trait…
Descriptors: Dropouts, High School Students, High Schools, Mathematics Tests
PDF pending restorationHanson, Bradley A.; And Others – 1994
This paper compares various methods of smoothed equipercentile equating and linear equating in the random groups equating design. Three presmoothing methods (based on the beta binomial model, four-parameter beta binomial model and a log-linear model) are compared to postsmoothing using cubic splines, linear equating and unsmoothed equipercentile…
Descriptors: Comparative Analysis, Equated Scores, Error of Measurement, Estimation (Mathematics)
Reckase, Mark D. – 1998
Standard setting is a fairly widespread activity in educational and psychological measurement, but there is no formal psychometric theory to guide the development of standard setting methodology. This paper presents a conceptual framework for such a psychometric theory and uses the conceptual framework to analyze a number of methods for setting…
Descriptors: Educational Assessment, Evaluation Methods, Judges, Measurement Techniques
Plasse, Lorraine A. – 1982
To determine the extent of influence that a reader's perspective as a member of a specific audience has on the assessment of student writing, a study examined the holistic judgment and the positive and negative comments made by four different types of writing evaluators on 40 different letters, each of which was written to one of four audience…
Descriptors: Audiences, Evaluation Criteria, Grade 12, High Schools
Dorans, Neil J.; Zeller, Karin – ETS Research Report Series, 2004
In the Spring 2003 issue of "Harvard Educational Review," Roy Freedle stated that the SAT® is both culturally and statistically biased, and he proposed a solution to ameliorate this bias. His claims, which garnered national attention, were based on serious errors in his analysis. We begin our analyses by assessing the psychometric…
Descriptors: Test Bias, Statistical Bias, Psychometrics, College Entrance Examinations
Wainer, Howard – 1985
Techniques derived from item response theory are useful for estimating the reliability of test classification above and below the cutting score. Test developers can construct a test whose information is peaked in the region of the cutting score; users can select a test which provides the most information in this region. The Cut-Score…
Descriptors: Cutting Scores, Item Analysis, Latent Trait Theory, Mastery Tests
Frary, Robert B.; And Others – 1985
Students in an introductory college course (n=275) responded to equivalent 20-item halves of a test under number-right and formula-scoring instructions. Formula scores of those who omitted items overaged about one point lower than their comparable (formula adjusted) scores on the test half administered under number-right instructions. In contrast,…
Descriptors: Guessing (Tests), Higher Education, Multiple Choice Tests, Questionnaires
PDF pending restorationKingston, Neal M. – 1985
This research investigated the effect on estimated lower asymptotes of the instructions to Graduate Record Examination (GRE) examinees about how the test would be scored. This effect was assessed for four different verbal item types (analogies, antonyms, sentence completion, and reading comprehension) using a two-way, unweighted means analysis of…
Descriptors: Analysis of Variance, College Entrance Examinations, Guessing (Tests), Higher Education
Arrasmith, Dean G.; Hambleton, Ronald K. – 1988
Specific steps for applying the Angoff method are described. In the Angoff method, judges are asked to estimate the probabilities of minimally competent candidates' answering multiple choice test items correctly. Initial information must be obtained for designing the standard-setting process, beginning with the purpose of the examination and any…
Descriptors: Certification, Credentials, Licensing Examinations (Professions), Minimum Competencies
Sigmon, Gary L.; Halpin, Gerald – 1984
Traditionally, judgmental standard setting methods have been used exclusively at the test item level. In this study, the Ebel and Angoff methods of standard setting were utilized to determine minimum competency standards on a list of 175 identified competency statements for vocational evaluators. The following research questions were addressed:…
Descriptors: Certification, College Faculty, Cutting Scores, Evaluation Methods
Green, Donald Ross; Yen, Wendy M. – 1983
The Comprehensive Tests of Basic Skills, Form U, is scored in two ways: number-correct and pattern. The latter makes use of the information about which particular items are answered correctly, giving more weight to the more discriminating items and making allowances for guessing. Critics have suggested that black students are penalized by pattern…
Descriptors: Basic Skills, Black Students, Elementary Education, Guessing (Tests)
PDF pending restorationSweitzer, H. Frederick; Weinstein, Gerald – 1985
Self-Knowledge Development Theory (SKDT) by Weinstein and A. Alschuler (1985) is a structural developmental theory positing four stages in the development of self-knowledge. The Experience Recall Test-2 (ERT2) is described, which is the most recent instrument developed for assessing the SKDT. Self-knowledge is defined as the ability to describe…
Descriptors: Classification, Developmental Stages, Group Testing, Individual Development
Swartz, Richard; And Others – 1985
In preparation for adding an essay test to the General Educational Development (GED) test, the GED Testing Service undertook a series of studies to establish (1) whether acceptable reading reliabilities were attainable in decentralized holistic scoring sessions often involving no more than a dozen papers; (2) whether essay readers in a variety of…
Descriptors: Essay Tests, High School Equivalency Programs, Scoring, Test Reliability
Education Commission of the States, Denver, CO. National Assessment of Educational Progress. – 1982
This publication contains some of the open-ended art exercises used by the National Assessment of Educational Progress in its 1978-79 assessment of the art ability of students ages nine through 17. The objective is to provide classroom teachers easy access to released and tested art assessment materials. The open-ended exercises required students…
Descriptors: Affective Measures, Art Appreciation, Art Education, Educational Assessment

Peer reviewed
