Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 23 |
Descriptor
Evaluation Methods | 123 |
Test Results | 123 |
Testing | 39 |
Educational Assessment | 36 |
Testing Programs | 35 |
Student Evaluation | 33 |
Educational Testing | 31 |
Elementary Secondary Education | 29 |
Standardized Tests | 29 |
Test Interpretation | 29 |
Academic Achievement | 27 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 18 |
Elementary Education | 6 |
Secondary Education | 4 |
Higher Education | 3 |
Postsecondary Education | 3 |
Grade 6 | 2 |
Grade 9 | 2 |
Adult Education | 1 |
Early Childhood Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
More ▼ |
Location
United Kingdom | 8 |
Canada | 5 |
California | 2 |
Minnesota | 2 |
Sweden | 2 |
United Kingdom (England) | 2 |
Vermont | 2 |
Alaska | 1 |
Arizona | 1 |
Arkansas | 1 |
Australia | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Education Consolidation… | 1 |
Hawkins Stafford Act 1988 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Richardson, Mary – UCL Press, 2022
Educational assessment is important. But in the twenty-first century it is easy to feel that schooling and other phases of education are shaped entirely by certain assessments, and that assessment is only about exam results. The idea that test grades can accurately describe the aims and outcomes of education is unfair and reductive. Yet it is a…
Descriptors: Educational Assessment, Trust (Psychology), Tests, Grades (Scholastic)
Mattern, Krista; Radunzel, Justine – ACT, Inc., 2019
When applicants take the ACT® more than once, how do colleges and universities reconcile and make sense of the multiple scores? In terms of validity, fairness, and impact on subgroup differences, are certain score-use polices better than others? The focus of this issue brief is to summarize evidence on the validity and fairness of various…
Descriptors: Scoring, College Entrance Examinations, Test Validity, Evaluation Methods
Ferrara, Steve; Perie, Marianne; Johnson, Eugene – Journal of Applied Testing Technology, 2008
Psychometricians continue to introduce new approaches to setting cut scores for educational assessments in an attempt to improve on current methods. In this paper we describe the Item-Descriptor (ID) Matching method, a method based on IRT item mapping. In ID Matching, test content area experts match items (i.e., their judgments about the knowledge…
Descriptors: Test Results, Test Content, Testing Programs, Educational Testing
Woods, Carol M. – Applied Psychological Measurement, 2011
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…
Descriptors: Simulation, Item Response Theory, Testing, Questionnaires
Shermis, Mark D.; DiVesta, Francis J. – Rowman & Littlefield Publishers, Inc., 2011
"Classroom Assessment in Action" clarifies the multi-faceted roles of measurement and assessment and their applications in a classroom setting. Comprehensive in scope, Shermis and Di Vesta explain basic measurement concepts and show students how to interpret the results of standardized tests. From these basic concepts, the authors then…
Descriptors: Student Evaluation, Standardized Tests, Scores, Measurement
Klesch, Heather S. – ProQuest LLC, 2010
The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
Descriptors: Feedback (Response), Test Results, Focus Groups, Educational Testing
Barnes, Susan Kubic – ProQuest LLC, 2010
In this era of increased accountability in education, there is a need for tools to use in assessing the abilities and instructional levels of young children. Computers have been used successfully to assess older children and adults. However, there is a dearth of empirical research to provide evidence that computer-based testing (CBT) is…
Descriptors: Test Results, Testing, Phonological Awareness, Preschool Children
Coe, Robert – Research Papers in Education, 2010
Much of the argument about comparability of examination standards is at cross-purposes; contradictory positions are in fact often both defensible, but they are using the same words to mean different things. To clarify this, two broad conceptualisations of standards can be identified. One sees the standard in the observed phenomena of performance…
Descriptors: Foreign Countries, Tests, Evaluation Methods, Standards
Woods, Carol M. – Applied Psychological Measurement, 2009
Differential item functioning (DIF) occurs when items on a test or questionnaire have different measurement properties for one group of people versus another, irrespective of group-mean differences on the construct. Methods for testing DIF require matching members of different groups on an estimate of the construct. Preferably, the estimate is…
Descriptors: Test Results, Testing, Item Response Theory, Test Bias
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Newton, Paul E. – Research Papers in Education, 2010
Robert Coe has claimed that three broad conceptions of comparability can be identified from the literature: performance, statistical and conventional. Each of these he rejected, in favour of a single, integrated conception which relies upon the notion of a "linking construct" and which he termed "construct comparability".…
Descriptors: Psychometrics, Measurement Techniques, Foreign Countries, Tests
Bossone, Richard M., Ed. – 1978
The speeches presented at the second national conference on testing deal with models for program evaluation, reporting of test results, minimum competency programs, and the role of state and federal government in educational testing. Various approaches to program evaluation, and its relationship to testing are described by Michael Scriven, Lee J.…
Descriptors: Conference Reports, Educational Testing, Evaluation Methods, Federal Regulation

Koegel, Lynn Kern; Koegel, Robert L.; Smith, Annette – Journal of Autism and Developmental Disorders, 1997
A study of six children (ages 3-9) with autism assessed whether manipulation of variables related to motivation and attention in children with autism would influence performance on standardized tests. Results found the children showed higher scores when conditions were modified to improve the likelihood of responding to test stimuli. (Author/CR)
Descriptors: Attention, Autism, Evaluation Methods, Standardized Tests
Tindal, Gerald; Heath, Bill; Hollenbeck, Keith; Almond, Patricia; Harniss, Mark – 1998
In this study, fourth-grade special education students (n=78) and general education students (n=403) took a large-scale statewide test using standard test administration procedure and two major accommodations addressing response conditions and test administration. On both reading and math tests, students bubbled in answers on a separate sheet (the…
Descriptors: Disabilities, Educational Assessment, Elementary Education, Evaluation Methods
Holmen, Milton G.; Docter, Richard – 1971
The application of tests in clinical and counseling work, in educational achievement testing, and in personnel selection is discussed. An analysis of the organizations which comprise the testing industry, including the various publishers and developers of tests and also the test scoring organizations, is given. The concept of an assessment system…
Descriptors: Academic Achievement, Clinical Experience, Concept Formation, Counseling Services