Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 6 |
Descriptor
Educational Testing | 9 |
Psychometrics | 9 |
Validity | 9 |
Educational Assessment | 4 |
Evaluation Methods | 4 |
Reliability | 4 |
Models | 3 |
Student Evaluation | 3 |
Test Construction | 3 |
Testing Problems | 3 |
Academic Achievement | 2 |
More ▼ |
Source
Assessing Writing | 1 |
Educational Measurement:… | 1 |
Journal of Educational… | 1 |
Journal of Faculty Development | 1 |
Measurement:… | 1 |
Multivariate Behavioral… | 1 |
Review of Research in… | 1 |
Author
Haberman, Shelby J. | 2 |
Sinharay, Sandip | 2 |
Berk, Ronald A. | 1 |
Bielinski, John | 1 |
Condon, William | 1 |
Forte, Ellen | 1 |
Kolen, Michael J. | 1 |
Minnema, Jane | 1 |
Puhan, Gautam | 1 |
Sigel, Irving E. | 1 |
Sireci, Stephen G. | 1 |
More ▼ |
Publication Type
Journal Articles | 7 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Reports - Research | 2 |
Information Analyses | 1 |
Opinion Papers | 1 |
Reports - General | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 2 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Race to the Top | 1 |
Assessments and Surveys
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Informing in the Information Age: How to Communicate Measurement Concepts to Education Policy Makers
Sireci, Stephen G.; Forte, Ellen – Educational Measurement: Issues and Practice, 2012
Current educational policies rely on educational assessments. However, the technical aspects of assessments are often unknown to policy makers, which is dangerous because sound assessment policy requires knowledge of the strengths and limitations of educational tests. In this article, we discuss the importance of informing policy makers of…
Descriptors: Educational Assessment, Psychometrics, Educational Policy, Educational Testing
Berk, Ronald A. – Journal of Faculty Development, 2016
Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…
Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Descriptors: Educational Testing, Scores, Reports, Psychometrics
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification

Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001
Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Sigel, Irving E. – 1978
This paper provides a theoretical discussion of educational program evaluation. Psychometric theory and developmental psychology are compared as they pertain to the testing of children. The nature of change in childhood makes it necessary to examine the assumptions and goals related to the testing of children as a means of evaluating educational…
Descriptors: Child Development, Cognitive Measurement, Developmental Psychology, Developmental Stages
Minnema, Jane; Thurlow, Martha; Bielinski, John – 2002
Two focus groups of test and measurement experts were held to explore the use of out-of-level testing for students with disabilities. The participants (n=17) included state and federal level assessment personnel, test company employees, and university professors. A content analysis of the narrative results indicated that there was no clear…
Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Disabilities