Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 40 |
| Since 2017 (last 10 years) | 104 |
| Since 2007 (last 20 years) | 912 |
Descriptor
Source
Author
| Thurlow, Martha | 22 |
| Popham, W. James | 17 |
| Baker, Eva L. | 14 |
| Shipman, Virginia C. | 13 |
| Sinharay, Sandip | 13 |
| Ebel, Robert L. | 12 |
| Haney, Walt | 11 |
| Herman, Joan L. | 10 |
| Mislevy, Robert J. | 10 |
| Hartley, Nancy K. | 8 |
| Koretz, Daniel | 8 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 291 |
| Teachers | 138 |
| Researchers | 79 |
| Administrators | 78 |
| Policymakers | 67 |
| Students | 20 |
| Parents | 19 |
| Counselors | 9 |
| Community | 6 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| California | 102 |
| Canada | 82 |
| Florida | 54 |
| Australia | 52 |
| United Kingdom | 51 |
| United Kingdom (England) | 50 |
| United States | 49 |
| New York | 47 |
| Texas | 42 |
| United Kingdom (Great Britain) | 28 |
| New Jersey | 27 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Pool, Carolyn; Bracey, Gerald W. – Instructor, 1992
Authentic assessment encompasses methods of evaluating what students know and can do, acknowledging that children learn best when they are actively involved and receive feedback. Criteria for judging authentic assessments include determining their consequences and noting whether they are cost-effective, meaningful, fair, and generalizable, with…
Descriptors: Educational Testing, Elementary Education, Evaluation Methods, Holistic Evaluation
Peer reviewedSnowman, Jack – Mid-Western Educational Researcher, 1993
A review of five recent studies concludes that on multiple-choice tests, changing uncertain answers improves results; testing plus feedback produces more learning than additional study time; students learn and retain more when they are tested more often; and question and completion formats are equally acceptable for multiple-choice items. (KS)
Descriptors: Academic Achievement, Educational Testing, Elementary Secondary Education, Test Construction
Peer reviewedTerwilliger, James S. – Educational Researcher, 1998
Responds to letters to the editor regarding comments on semantics, psychometrics, and assessment reform regarding authentic assessment of student academic achievements. Suggests that psychometric concerns need to be addressed. (MMU)
Descriptors: Academic Achievement, Educational Testing, Elementary Secondary Education, Performance Based Assessment
Peer reviewedMcFate, Craig; Olmsted, John III – Journal of Chemical Education, 1999
Describes the development and evaluation of a placement test used as a major determinant of eligibility to enroll in first-semester general chemistry at California State University, Fullerton. Contains 17 references. (WRM)
Descriptors: Chemistry, Educational Testing, Higher Education, Prior Learning
Peer reviewedWang, Chih-yen – Performance Improvement, 2000
Provides guidelines that trainers can use to make fair judgments in grading subjective essay examinations. Highlights include establishing criteria for scoring; clarifying lesson objectives; dividing each question into smaller components with weighted points for each section; developing grading guidelines; and the increasing use of computers to…
Descriptors: Computer Uses in Education, Educational Testing, Essay Tests, Evaluation Criteria
Peer reviewedTraub, Ross E. – Educational Measurement: Issues and Practice, 1997
Classical test theory is founded on the proposition that measurement error, a random latent variable, is a component of the observed score random variable. This article traces the history of the development of classical test theory, beginning in the early 20th century. (SLD)
Descriptors: Educational History, Educational Testing, Error of Measurement, Psychometrics
Peer reviewedSireci, Stephen – Applied Psychological Measurement, 2000
This collection of essays by measurement specialists addresses a variety of important issues in educational and psychological testing. Although not all of the "rules" are new, many topics of contemporary interest are discussed, some in detail that exceeds what the psychologist and educator really need to know. (SLD)
Descriptors: Educational Testing, Measurement Techniques, Psychological Testing, Psychology
Peer reviewedInternational Journal of Testing, 2001
Contains guidelines that provide an international view of areas of consensus about what constitutes "good practice" in test use. Guidelines address key competencies, such as knowledge and skills, and issues of professional and ethical standards in testing, the rights of test takers, test administration and scoring, and other issues. (SLD)
Descriptors: Competence, Educational Practices, Educational Testing, Guidelines
Peer reviewedShalom, Stephen R. – Journal of Blacks in Higher Education, 1999
"America in Black and White: One Nation Indivisible: Race in Modern America," by Stephan and Abigail Thernstrom, has become important in the movement to abolish affirmative action in higher education. Suggests the book seriously misinterprets and misuses evidence to make its case, refuting specific results the book cites related to…
Descriptors: Affirmative Action, Black Students, College Admission, Educational Testing
Popham, James W. – Educational Leadership, 2006
Government agencies administer exams to appraise educators' effectiveness. However, most teachers and administrators are unfamiliar with how such large-scale tests are put together or polished. A profession's adequacy is being judged on the basis of tools that the profession's members don't understand. As such, educators need to have a dose of…
Descriptors: Teacher Effectiveness, Educational Testing, Evaluation Criteria, Test Validity
Popham, W. James – Educational Leadership, 2006
What people mean when they use the phrase "content standard" varies all over the lot. In some states, content standards are little more than category labels describing collections of curricular aims in particular content areas. If a state's content standards are too numerous, then teachers do not know where to aim their instructional efforts. This…
Descriptors: Academic Standards, State Standards, Curriculum Development, Accountability
Wise, Lauress L. – Educational Measurement: Issues and Practice, 2006
Uses and consequences of educational testing have increased dramatically in recent years. Professional standards to ensure fair treatment of all affected by test results are more important than ever, but standards for developing and using educational tests are only helpful if they are followed. Test developers and users each have a role to play in…
Descriptors: Educational Testing, Standards, Accountability, Cooperation
Peer reviewedPopham, W. James – Educational Leadership, 2004
The importance of educational accountability and assessment literacy is recognized as a long-term challenge to the educational system. The complexities in becoming an assessment literate and provide an opportunity to the educators to display their effective learning is discussed.
Descriptors: Accountability, Student Evaluation, Evaluation Methods, Educational Testing
Karantonis, Ana; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2006
The Bookmark method for setting standards on educational tests is currently one of the most popular standard-setting methods. However, research to support the method is scarce. In this report, we review the published and unpublished literature on this method as well as some seminal work in the area of evaluating standard-setting studies. Our…
Descriptors: Academic Standards, Educational Testing, Literature Reviews, Validity
Wise, Vicki L.; Wise, Steven L.; Bhola, Dennison S. – Educational Assessment, 2006
Accountability for educational quality is a priority at all levels of education. Low-stakes testing is one way to measure the quality of education that students receive and make inferences about what students know and can do. Aggregate test scores from low-stakes testing programs are suspect, however, to the degree that these scores are influenced…
Descriptors: Motivation, Scores, Test Validity, Accountability

Direct link
