Publication Date
| In 2026 | 0 |
| Since 2025 | 72 |
| Since 2022 (last 5 years) | 332 |
| Since 2017 (last 10 years) | 657 |
| Since 2007 (last 20 years) | 1709 |
Descriptor
| Evaluation Methods | 4240 |
| Student Evaluation | 1492 |
| Testing | 1257 |
| Computer Assisted Testing | 1061 |
| Elementary Secondary Education | 722 |
| Foreign Countries | 720 |
| Educational Testing | 610 |
| Educational Assessment | 601 |
| Test Construction | 537 |
| Testing Problems | 516 |
| Higher Education | 465 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 29 |
| Thurlow, Martha L. | 22 |
| Tindal, Gerald | 12 |
| Ysseldyke, James E. | 12 |
| Baker, Eva L. | 10 |
| Alonzo, Julie | 9 |
| Herman, Joan L. | 9 |
| Popham, W. James | 8 |
| Hambleton, Ronald K. | 7 |
| Jaeger, Richard M. | 7 |
| Lai, Cheng Fei | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 263 |
| Teachers | 138 |
| Researchers | 100 |
| Administrators | 67 |
| Policymakers | 36 |
| Students | 19 |
| Counselors | 11 |
| Parents | 10 |
| Community | 9 |
| Support Staff | 7 |
| Media Staff | 2 |
| More ▼ | |
Location
| United Kingdom | 85 |
| Australia | 72 |
| Canada | 68 |
| United Kingdom (England) | 44 |
| United States | 44 |
| California | 41 |
| Florida | 40 |
| Germany | 34 |
| Turkey | 31 |
| Netherlands | 29 |
| New York | 27 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Peer reviewedJoyce, John F. – Journal of Education, 1975
An analysis of the content, process, and purposes of common evaluation practices has revealed ten specific dehumanizing effects on participating students and educators. More humanistic, alternative evaluation practices have been suggested for each. (Author/BJG)
Descriptors: Change Strategies, Evaluation Methods, Human Dignity, Humanism
Peer reviewedWilliams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984
This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)
Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing
Hirshoren, Alfred; McGuigan, Corrine – B. C. Journal of Special Education, 1984
The authors provide information about test construction and testing practices in order to help parents and teachers to ask important and critical questions about them. Issues pertinent to the appropriate selection, use, and interpretation of tests are presented. (Author/CL)
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Student Evaluation
Harris, Douglas N. – Policy Analysis for California Education, PACE (NJ3), 2010
In this policy brief, the author explores the problems with attainment measures when it comes to evaluating performance at the school level, and explores the best uses of value-added measures. These value-added measures, the author writes, are useful for sorting out-of-school influences from school influences or from teacher performance, giving…
Descriptors: Principals, Observation, Teacher Evaluation, Measurement Techniques
Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009
Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…
Descriptors: Word Problems (Mathematics), Probability, Automation, College Students
Moder, Carol Lynn; Halleck, Gene B. – Australian Review of Applied Linguistics, 2009
This study investigates the variation in oral proficiency demonstrated by 14 Air Traffic Controllers across two types of testing tasks: work-related radio telephony-based tasks and non-specific English tasks on aviation topics. Their performance was compared statistically in terms of level ratings on the International Civil Aviation Organization…
Descriptors: Testing, English (Second Language), Second Language Learning, Air Transportation
American Psychologist, 2009
Robert E. Ployhart, recipient of the Award for Distinguished Scientific Early Career Contributions to Psychology, is cited for innovative work in examining reactions to staffing practices and efforts to enhance the acceptability of recruitment and staffing practices; for exemplary use of applied statistical models in examining multilevel effects…
Descriptors: Recognition (Achievement), Personnel Selection, Psychology, Profiles
An Auto-Scoring Mechanism for Evaluating Problem-Solving Ability in a Web-Based Learning Environment
Chiou, Chuang-Kai; Hwang, Gwo-Jen; Tseng, Judy C. R. – Computers & Education, 2009
The rapid development of computer and network technologies has attracted researchers to investigate strategies for and the effects of applying information technologies in learning activities; simultaneously, learning environments have been developed to record the learning portfolios of students seeking web information for problem-solving. Although…
Descriptors: Learning Strategies, Problem Solving, Scoring, Student Evaluation
Arce-Ferrer, Alvaro J.; Guzman, Elvira Martinez – Educational and Psychological Measurement, 2009
This study investigates the effect of mode of administration of the Raven Standard Progressive Matrices test on distribution, accuracy, and meaning of raw scores. A random sample of high school students take counterbalanced paper-and-pencil and computer-based administrations of the test and answer a questionnaire surveying preferences for…
Descriptors: Factor Analysis, Raw Scores, Statistical Analysis, Computer Assisted Testing
Hutchings, Pat – Change: The Magazine of Higher Learning, 2009
Motivated in large part by accreditation pressures, campuses are turning to new providers for assistance with a wide range of assessment-related tasks and processes. Some offer help in formulating student learning outcomes--and in bringing (as one says) "the science of learning" to "the art of teaching." Several are in the rubric-development…
Descriptors: Higher Education, Campuses, Student Evaluation, Portfolio Assessment
Thompson, Sandra J.; Thurlow, Martha L.; Quenemoen, Rachel F.; Lehr, Camilla A. – 2002
With pressure to find more cost effective and less labor-intensive approaches to testing, states are seeing computer-based testing as a way to address the increasingly challenging prospect of assessing all students in a state at nearly all grades. Unfortunately, most states have not specifically considered the needs of students with disabilities.…
Descriptors: Accessibility (for Disabled), Achievement Tests, Check Lists, Computer Assisted Testing
Peer reviewedVockell, Edward L.; Hall, Jane – Social Studies, 1989
Examines the ways in which computers can assist teachers in developing good tests. Describes the program TESTWORKS in detail and provides charts comparing this program with 11 others in the areas of price, type of questions generated, computer functions, and the usefulness of each. Discusses the use of word processors and databases. (KO)
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Software, Computer Uses in Education
Nettles, Stephen M.; Petscher, Yaacov – Journal of Personnel Evaluation in Education, 2007
Measurement of principal implementation behaviors has proved difficult to researchers in educational leadership due to a lack of consensus on the operational definitions of leadership constructs. The Principal Implementation Questionnaire (PIQ) was developed and validated with the intention of providing clarity in the assessment of principal…
Descriptors: Reading Programs, Instructional Leadership, Hypothesis Testing, Causal Models
Christiansen, Peter – 1972
The activities of the Nucleus Testing Committee, particularly its Curriculum Related Subcommittee, of the Madison, Wisconsin public schools are described in relation to its effort to bring about increased awareness of the need for reexamination of both district-wide and local school evaluation procedures. The sub-committee's recommendations are…
Descriptors: Educational Testing, Evaluation Criteria, Evaluation Methods, Research Committees
Shuy, Roger W. – Georgetown Journal of Languages and Linguistics, 1990
Argues that reading comprehension is better measured in performance contexts than in decontextualized standardized tests (especially for the nonhearing) and that dialogue journal writing is a better method of testing reading comprehension. Five types of comprehension can be isolated in journals: propositions, questions, inferential messages,…
Descriptors: Deafness, Dialog Journals, Evaluation Methods, Journal Writing

Direct link
