Publication Date
| In 2026 | 2 |
| Since 2025 | 188 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2889 |
| Since 2007 (last 20 years) | 6174 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Wang, LihShing; Pan, Wei; Austin, James T. – 2003
Standard-setting research has yielded a rich array of more than 50 standard-setting procedures, but practitioners are likely to be confused about which to use. By synthesizing the accumulated research on standard setting and progress monitoring, this study developed a three-dimensional taxonomy for conceptualizing and operationalizing the various…
Descriptors: Accountability, Cutting Scores, Educational Research, Pass Fail Grading
Sultana, Qaisar – 2001
This study examined the reliability of scores assigned to the essays written by Kentucky students to meet the University Writing Requirement (UWR) at Eastern Kentucky University. Two sets of essays, 50 each, on the same prompt that had been read and scored in 1989 and 1997 by trained UWR scorers were read by 7 UWR scorers in 2000. A correlation…
Descriptors: College Students, Correlation, Essays, Higher Education
Jucovy, Linda – 2002
The material in this Technical Assistance Packet is intended to help mentoring programs monitor individual matches and develop a larger picture that provides a composite view of the strengths and shortcomings of all their matches. The packet contains the Youth Survey, a tool to monitor the quality of individual mentoring relationships, determine…
Descriptors: Evaluation Methods, Interpersonal Relationship, Mentors, Program Effectiveness
DeMauro, Gerald E. – 2003
An analysis was made of the cognitive processes that support the judgments made in standard setting activities. These processes were conceived as having two components: forming the domain needed to pass the test and identifying the criterion level of performance to pass the test. In fact, these processes are interactive, and were separated for the…
Descriptors: Cognitive Processes, Judges, Matrices, Performance Based Assessment
Almond, Russell; Steinberg, Linda; Mislevy, Robert – 2001
This paper describes a four-process model for the operation of a generic assessment: Activity Selection, Presentation, Response Processing (Evidence Identification), and Summary Scoring (Evidence Accumulation). It discusses the relationships between the functions and responsibilities of these processes and the objects in the Instructional…
Descriptors: Chinese, Evaluation Methods, Language Proficiency, Models
Papanastasiou, Elena C. – 2002
Due to the increased popularity of computerized adaptive testing (CAT), many admissions tests, as well as certification and licensure examinations, have been transformed from their paper-and-pencil versions to computerized adaptive versions. A major difference between paper-and-pencil tests and CAT, from an examinees point of view, is that in many…
Descriptors: Adaptive Testing, Cheating, Computer Assisted Testing, Review (Reexamination)
Fan, Xitao; Chen, Michael – 1999
It is erroneous to extend or generalize the inter-rater reliability coefficient estimated from only a (small) proportion of the sample to the rest of the sample data where only one rater is used for scoring, although such generalization is often made implicitly in practice. It is shown that if inter-rater reliability estimate from part of a sample…
Descriptors: Estimation (Mathematics), Generalizability Theory, Interrater Reliability, Sample Size
Marshall, James P.; Allen, Bradford D. – 2000
Many colleges and universities use a mathematics placement process to guide students to the appropriate entry-level mathematics course. The mathematics placement process presented here was developed over a four year period to make placement recommendations to Calculus I, Precalculus, and College Algebra. Placement recommendations are based on the…
Descriptors: Algebra, Calculus, Evaluation Methods, Higher Education
Patelis, Thanos – College Entrance Examination Board, 2000
Because different types of computerized tests exist and continue to emerge, the term "computer-based testing" does not encompass all of the various models that may exist. As a result, test delivery model (TDM) is used to describe the variety of methods that exist in delivering tests to examinees. The criterion that is used to distinguish…
Descriptors: Computer Assisted Testing, Adaptive Testing, Models, Delivery Systems
Peer reviewedWood, Robert – Review of Educational Research, 1973
A review of early and modern work concerned with response-contingent testing and a discussion of its applications, limitations, and future prospects is provided. (KM)
Descriptors: Educational Research, Individual Needs, Measurement Techniques, Scoring
Peer reviewedZimmerman, Donald W. – Educational and Psychological Measurement, 1972
Although a great deal of attention has been devoted over a period of years to the estimation of reliability from item statistics, there are still gaps in the mathematical derivation of the Kuder-Richardson results. The main purpose of this paper is to fill some of these gaps, using language consistent with modern probability theory. (Author)
Descriptors: Mathematical Applications, Probability, Scoring Formulas, Statistical Analysis
Peer reviewedWarren, Sue Allen; Brown, William G., Jr. – Psychology in the Schools, 1973
University instructors should provide more careful checks and feedback to students who are learning to do intelligence testing. It also is important for supervisors in service facilities to monitor tests that provide bases for crucial decisions about children. Intelligence test scores relate to many other variables. Improperly trained examiners…
Descriptors: Educational Research, Examiners, Intelligence Tests, Measurement
Peer reviewedEchternacht, Gary J. – Review of Educational Research, 1972
Descriptors: Educational Research, Educational Testing, Objective Tests, Psychological Testing
Entwistle, N. J.; Wilson, J. D. – Univ Quart, 1970
A questionnaire measuring four student personality types--stable introvert, unstable introvert, stable extrovert and unstable extrovert--along with the Eysenck Personality Inventory (Form A) were give to 72 graduate students at Aberdeen University and the results showed recognizable interaction between study methods, motivation and personality…
Descriptors: Academic Achievement, Behavior Theories, Higher Education, Motivation
Peer reviewedHipple, Theodore W. – English Journal, 1972
Suggestions to English teachers for evaluating students' work in English composition. (MB)
Descriptors: Evaluation Methods, Grading, Peer Groups, Scoring Formulas


