Publication Date
| In 2026 | 0 |
| Since 2025 | 186 |
| Since 2022 (last 5 years) | 1065 |
| Since 2017 (last 10 years) | 2887 |
| Since 2007 (last 20 years) | 6172 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Plake, Barbara S.; Hambleton, Ronald K. – 1998
This paper reports on a standard-setting method designed for complex performance assessments with multiple performance categories. The method studied, the Analytical Judgment Method, involves panelists' making analytical classification decisions for each of the test's components individually. It also allows for discussion and reconsideration of…
Descriptors: Classification, Data Analysis, Grade 8, Junior High School Students
PDF pending restorationAnderson, Paul S.; Hyers, Albert D. – 1991
Three descriptive statistics (difficulty, discrimination, and reliability) of multiple-choice (MC) test items were compared to those of a new (1980s) format of machine-scored questions. The new method, answer-bank multi-digit testing (MDT), uses alphabetized lists of up to 1,000 alternatives and approximates the completion style of assessment…
Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Correlation
Tatsuoka, Kikumi K. – 1991
Constructed-response formats are desired for measuring complex and dynamic response processes that require the examinee to understand the structures of problems and micro-level cognitive tasks. These micro-level tasks and their organized structures are usually unobservable. This study shows that elementary graph theory is useful for organizing…
Descriptors: Adult Literacy, Cognitive Measurement, Cognitive Processes, Constructed Response
Berkay, Paul; And Others – 1994
Instructions for the administration of the Opinions about Deaf People Scale are given. This scale measures the beliefs of hearing adults about the capabilities of deaf adults. The somewhat ambiguous title of the instrument is designed to avoid leading respondents to respond in socially desirable ways. The instrument is based on misconceptions…
Descriptors: Adults, Attitude Measures, Attitudes, Beliefs
North, Brian – 1993
Theoretical issues underlying the development of scales of language proficiency are examined. First, a brief classification of scale types is presented, and problems identified in them and advantages they offer are outlined. Discussion then moves to the issues of describing and measuring language proficiency. In this section, behaviorally-based…
Descriptors: Classification, Communicative Competence (Languages), Evaluation Criteria, Item Response Theory
Chang, Lei; And Others – 1994
The present study examines the influence of judges' item-related knowledge on setting standards for competency tests. Seventeen judges from different professions took a 122-item teacher-certification test in economics while setting competency standards for the test using the Angoff procedure. Judges tended to set higher standards for items they…
Descriptors: Economics, Evaluators, Experience, Interrater Reliability
International Personnel Management Association, Washington, DC. – 1986
The International Association of Personnel Management Assessment Council (IPMAAC) is a section of the International Association of Personnel Management devoted to individuals involved in professional level public personnel assessment. Author-generated summaries/outlines of papers presented at the IPMAAC's 1986 conference are provided. The…
Descriptors: Assessment Centers (Personnel), Evaluation Methods, Job Analysis, Job Performance
Crews, William E., Jr. – 1991
As part of a study of teacher evaluation of student replies to open-ended questions, a second question--the best method of determining interrater reliability--was examined. The standard method, the Pearson Product-Moment correlation, overestimated the degree of match between researchers' and teachers' scoring of tests. The simpler percent…
Descriptors: Comparative Analysis, Elementary School Teachers, Evaluation Methods, Evaluators
Orange County Academic Decathalon Association, CA. – 1983
Orange County (California) students in grades 9 and 10 compete in an annually held series of 10 competitive events measuring academic strengths. These events include tests in grammar and literature, fine arts, mathematics, science, social science, study skills, and a super quiz--a team event held before a large audience. In addition, there are…
Descriptors: Academic Standards, Competitive Selection, Evaluation Criteria, Grade 10
Aghbar, Ali-Asghar – WATESOL Working Papers, 1983
An adaptation of an impressionistic scoring method developed for use on student placement in an English as a second language (ESL) program is described and a correlational study of its reliability is presented. The method was chosen because of its efficiency and apparent reliability. The original emphasis of organization, length, and content of…
Descriptors: English (Second Language), Evaluation Criteria, Grammar, Holistic Evaluation
Littlefield, John H.; And Others – 1983
Observational ratings of student clinical performance are influenced by factors other than the quality of the performance. Individual raters may be more stringent or lenient than their colleagues. In this medical school setting, multiple raters evaluated each student. To reduce the influence of "error" due to differences among raters, each rater…
Descriptors: Bias, Error of Measurement, Higher Education, Interrater Reliability
Allen, Russell H.; Kaufman, B. Darwin – 1983
This bulletin has three purposes: (1) to discuss the variety of factors which should be considered in the development of a local testing program, providing a basic understanding of what is necessary to implement and maintain a testing program; (2) to suggest several of the significant elements related to the development and maintenance of a…
Descriptors: Accountability, Elementary Secondary Education, Minimum Competency Testing, Program Implementation
Strouss, Sara J. – 1987
This paper describes the scoring of the upper levels of the Career Ladder system for teachers in the Tennessee Career Ladder Evaluation Systems. The instruments used to evaluate teachers include classroom observation, dialogue with the teacher, a peer questionnaire, student questionnaires, a professional skills test, professional development and…
Descriptors: Career Ladders, Elementary Secondary Education, Evaluation Criteria, Evaluation Methods
Lord, Frederic M. – 1980
The purpose of this book is to make it possible for measurement specialists to solve practical testing problems through the use of item response theory (IRT). The topics, organization, and presentation are those used in a 4-week seminar held each summer for the past several years. The material is organized to facilitate understanding; all related…
Descriptors: Adaptive Testing, Estimation (Mathematics), Evaluation Problems, Item Analysis
Crocker, Linda; Algina, James – 1986
This text was written to help the reader acquire a base of knowledge about classical psychometrics and to integrate new ideas into that framework of knowledge. The material is organized into five units: (1) introduction to measurement theory; (2) reliability; (3) validity; (4) item analysis in test development; and (5) test scoring and…
Descriptors: Item Analysis, Measurement Techniques, Psychometrics, Scoring


