Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 9 |
Descriptor
Educational Assessment | 13 |
Error of Measurement | 13 |
Data Collection | 4 |
Student Evaluation | 4 |
Test Reliability | 4 |
Evaluation Methods | 3 |
Sampling | 3 |
Scoring | 3 |
Test Construction | 3 |
Test Validity | 3 |
Achievement Rating | 2 |
More ▼ |
Source
Educational Measurement:… | 3 |
Biochemistry and Molecular… | 1 |
Educational Leadership | 1 |
Health Education Research | 1 |
National Centre for… | 1 |
Oxford Review of Education | 1 |
Practical Assessment,… | 1 |
Author
Anderson, Trevor R. | 1 |
Barr, James | 1 |
Boyer, Michelle | 1 |
Burkhardt, Amy | 1 |
Coverdale, Bradley J. | 1 |
Gardner, John | 1 |
Goldberg, Gail Lynn | 1 |
Hosman, Clemens M. H. | 1 |
Jin, Ying | 1 |
Kok, Gerjo J. | 1 |
Kolen, Michael J. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 13 |
Journal Articles | 8 |
Speeches/Meeting Papers | 3 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Reports - Evaluative | 1 |
Education Level
Elementary Secondary Education | 4 |
Higher Education | 4 |
Adult Education | 2 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 5 | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Teachers | 1 |
Location
Australia | 1 |
Maryland | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
National Assessment of… | 2 |
Iowa Tests of Basic Skills | 1 |
Iowa Tests of Educational… | 1 |
What Works Clearinghouse Rating
Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…
Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment
Schafer, William D.; Coverdale, Bradley J.; Luxenberg, Harlan; Jin, Ying – Practical Assessment, Research & Evaluation, 2011
There are relatively few examples of quantitative approaches to quality control in educational assessment and accountability contexts. Among the several techniques that are used in other fields, Shewart charts have been found in a few instances to be applicable in educational settings. This paper describes Shewart charts and gives examples of how…
Descriptors: Charts, Quality Control, Educational Assessment, Statistical Analysis
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Popham, W. James – Educational Leadership, 2009
If a person were to ask an educator to identify the two most important attributes of an education test, the response most certainly would be "validity and reliability." These two tightly wedded concepts have become icons in the field of education assessment. As far as validity is concerned, the term doesn't refer to the accuracy of a test. Rather,…
Descriptors: Educational Testing, Educational Assessment, Student Evaluation, Test Reliability
National Centre for Vocational Education Research (NCVER), 2012
This publication presents information on tertiary education and training during 2010, including statistics on participation and outcomes. The definition of tertiary education and training adopted for this publication is formal study in vocational education and training (VET) and higher education, including enrolments in Australian Qualifications…
Descriptors: Higher Education, Foreign Countries, Vocational Education, Postsecondary Education
Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010
"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…
Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores
Anderson, Trevor R.; Rogan, John M. – Biochemistry and Molecular Biology Education, 2010
Student assessment is central to the educational process and can be used for multiple purposes including, to promote student learning, to grade student performance and to evaluate the educational quality of qualifications. It is, therefore, of utmost importance that assessment instruments are of a high quality. In this article, we present various…
Descriptors: Educational Assessment, Educational Quality, Student Evaluation, Educational Research
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Molleman, Gerard R. M.; Peters, Louk W. H.; Hosman, Clemens M. H.; Kok, Gerjo J.; Oosterveld, Paul – Health Education Research, 2006
Preffi 2.0 is an evidence-based Dutch quality assessment instrument for health promotion interventions. It is mainly intended for both planning and assessing one's own projects but can also be used to assess other people's projects (external use). This article reports a study on the reliability of Preffi as an external quality assessment…
Descriptors: Expertise, Evidence, Generalizability Theory, Health Promotion
Searls, Donald T., Ed. – 1983
The purpose of this paper is to provide an overview of the analysis of data collected by the National Assessment of Educational Progress (NAEP). In simplest terms, the analysis can be characterized as establishing baseline estimates of the percentages of young Americans possessing certain skills, knowledge, understandings, and attitudes and…
Descriptors: Data Analysis, Data Collection, Databases, Educational Assessment
Goldberg, Gail Lynn; Walker-Bartnick, Leslie – 1988
A scoring rubric transition study is described. It was designed to evaluate possible drift in scoring the Maryland Writing Test from year to year (when using a modified holistic scoring method), to evaluate strategies for revising swing rubrics from narrative and explanatory writing while maintaining original scoring standards, and to establish…
Descriptors: Educational Assessment, Elementary Secondary Education, Error of Measurement, Grading
Terenzini, Patrick T. – 1986
Unobtrusive measures are recommended as a means of assessing educational outcomes of colleges. Such measures can counteract the response bias which is common in questionnaires and interviews. Outcomes researchers are, in fact, asked to supplement standard measures with unobtrusive measures. Interesting data may result from observation of students'…
Descriptors: Colleges, Cost Effectiveness, Educational Assessment, Error of Measurement
Rasor, Richard E.; Barr, James – 1998
This paper provides an overview of common sampling methods (both the good and the bad) likely to be used in community college self-evaluations and presents the results from several simulated trials. The report begins by reviewing various survey techniques, discussing the negative and positive aspects of each method. The increased accuracy and…
Descriptors: Community Colleges, Comparative Analysis, Cost Effectiveness, Data Collection