Publication Date
| In 2026 | 0 |
| Since 2025 | 72 |
| Since 2022 (last 5 years) | 332 |
| Since 2017 (last 10 years) | 657 |
| Since 2007 (last 20 years) | 1709 |
Descriptor
| Evaluation Methods | 4240 |
| Student Evaluation | 1492 |
| Testing | 1257 |
| Computer Assisted Testing | 1061 |
| Elementary Secondary Education | 722 |
| Foreign Countries | 720 |
| Educational Testing | 610 |
| Educational Assessment | 601 |
| Test Construction | 537 |
| Testing Problems | 516 |
| Higher Education | 465 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 29 |
| Thurlow, Martha L. | 22 |
| Tindal, Gerald | 12 |
| Ysseldyke, James E. | 12 |
| Baker, Eva L. | 10 |
| Alonzo, Julie | 9 |
| Herman, Joan L. | 9 |
| Popham, W. James | 8 |
| Hambleton, Ronald K. | 7 |
| Jaeger, Richard M. | 7 |
| Lai, Cheng Fei | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 263 |
| Teachers | 138 |
| Researchers | 100 |
| Administrators | 67 |
| Policymakers | 36 |
| Students | 19 |
| Counselors | 11 |
| Parents | 10 |
| Community | 9 |
| Support Staff | 7 |
| Media Staff | 2 |
| More ▼ | |
Location
| United Kingdom | 85 |
| Australia | 72 |
| Canada | 68 |
| United Kingdom (England) | 44 |
| United States | 44 |
| California | 41 |
| Florida | 40 |
| Germany | 34 |
| Turkey | 31 |
| Netherlands | 29 |
| New York | 27 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Alturki, Raad A. – Informatics in Education, 2016
Students' performances in introductory programming courses show large variation across students. There may be many reasons for these variations, such as methods of teaching, teacher competence in the subject, students' coding backgrounds and abilities, students' self-discipline, the teaching environment, and the resources available to students,…
Descriptors: Introductory Courses, Programming, Student Evaluation, Measurement Techniques
Chen, Junjun; Cowie, Bronwen – Educational Practice and Theory, 2016
This study investigated the responses of 531 preservice teachers to a "Beliefs About Assessment" questionnaire in China. The questionnaire focused on understanding the purposes, practices and principles of assessment for and of learning. Using factor analysis, an inter-correlated two-order model fitted well to the responses. This model…
Descriptors: Preservice Teachers, Student Attitudes, Foreign Countries, Questionnaires
DeSimone, Charles P. – ProQuest LLC, 2016
Evaluation of instruction has typically occurred during development, before implementation, and after course completion. The problem is that evaluation is typically post delivery; courses are not traditionally updated in real time with feedback from students in the classroom. However the potential to evaluate and modify instruction during delivery…
Descriptors: Evaluation Methods, Qualitative Research, Delphi Technique, Attitude Measures
Hagley, Eric – Reading in a Foreign Language, 2017
Extensive graded reading (EGR) was carried out with a cohort of 600 engineering students in a university in northern Japan. Pre-and post-surveys were conducted to discover changes in the general reading habits of students, their attitudes toward the assessment method and how goals changed over the course of study. The first survey was carried out…
Descriptors: Foreign Countries, College Students, Engineering Education, Reading Instruction
Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010
Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…
Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing
Stark, Stephen; Chernyshenko, Oleksandr S. – International Journal of Testing, 2011
This article delves into a relatively unexplored area of measurement by focusing on adaptive testing with unidimensional pairwise preference items. The use of such tests is becoming more common in applied non-cognitive assessment because research suggests that this format may help to reduce certain types of rater error and response sets commonly…
Descriptors: Test Length, Simulation, Adaptive Testing, Item Analysis
Kim, Jiseon – ProQuest LLC, 2010
Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…
Descriptors: Test Length, Computer Assisted Testing, Classification, Probability
Nimehchisalem, Vahid – International Journal of Education and Literacy Studies, 2015
Antony John Kunnan is a language assessment specialist. His research interests are fairness of tests and testing practice, assessment literacy, research methods and statistics, ethics and standards, and language assessment policy. His most recent publications include a four-volume edited collection of 140 chapters titled "The Companion to…
Descriptors: Language Tests, Testing, Specialists, College Faculty
Cawthon, Stephanie – American Annals of the Deaf, 2015
Designing assessments and tests is one of the more challenging aspects of creating an accessible learning environment for students who are deaf or hard of hearing (DHH), particularly for deaf students with a disability (DWD). Standardized assessments are a key mechanism by which the educational system in the United States measures student…
Descriptors: Deafness, Hearing Impairments, Standardized Tests, Student Characteristics
Fields, Lanny; Spear, Jack – Psychological Record, 2012
Joint stimulus control occurs when responding is determined by the correspondence of elements of a complex sample and a complex comparison stimulus. In academic settings, joint stimulus control of behavior would be evidenced by the selection of an accurate description of a complex graph in which each element of a graph corresponded to particular…
Descriptors: Stimuli, Graphs, Behavioral Science Research, Evaluation Methods
Doroudi, Shayan; Holstein, Kenneth; Aleven, Vincent; Brunskill, Emma – Grantee Submission, 2016
How should a wide variety of educational activities be sequenced to maximize student learning? Although some experimental studies have addressed this question, educational data mining methods may be able to evaluate a wider range of possibilities and better handle many simultaneous sequencing constraints. We introduce Sequencing Constraint…
Descriptors: Sequential Learning, Data Collection, Information Retrieval, Evaluation Methods
Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014
The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…
Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis
Lin, Pei-Ying; Lin, Yu-Cheng – Educational and Psychological Measurement, 2014
This exploratory study investigated potential sources of setting accommodation resulting in differential item functioning (DIF) on math and reading assessments for examinees with varied learning characteristics. The examinees were those who participated in large-scale assessments and were tested in either standardized or accommodated testing…
Descriptors: Test Bias, Multivariate Analysis, Testing Accommodations, Mathematics Tests
Scott, Cheryl M. – Topics in Language Disorders, 2011
Purpose: Older school-aged children and adolescents with persistent language and literacy impairments vary in their individual profiles of linguistic strengths and weaknesses. Given the multidimensional nature and complexity of language, designing an assessment protocol capable of uncovering linguistic variation is challenging. A process of…
Descriptors: Language Variation, Linguistics, Language Impairments, Testing
Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas – Journal of Psychoeducational Assessment, 2011
The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…
Descriptors: Feedback (Response), Intelligence, Testing, Standardized Tests

Peer reviewed
Direct link
