Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Author
Cason, Carolyn L. | 2 |
Bartram, Dave | 1 |
Becker, Kirk A. | 1 |
Behuniak, Peter | 1 |
Bergstrom, Betty A. | 1 |
Berk, Ronald A. | 1 |
Cason, Gerald J. | 1 |
Cavanagh, Robert F. | 1 |
Conley, D. T. | 1 |
Conway, Malcolm J. | 1 |
Dillon, Amanda | 1 |
More ▼ |
Publication Type
Education Level
Postsecondary Education | 4 |
Higher Education | 3 |
Elementary Secondary Education | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Secondary Education | 1 |
Two Year Colleges | 1 |
Audience
Researchers | 1 |
Location
New Mexico | 1 |
Texas | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
General Aptitude Test Battery | 1 |
What Works Clearinghouse Rating
Katz, Daniel S. – Kappa Delta Pi Record, 2016
Including growth models based on student test scores in teacher evaluations effectively holds teachers individually accountable for students improving their test scores. While an attractive policy for state administrators and advocates of education reform, value-added measures have been fraught with problems, and their use in teacher evaluation is…
Descriptors: Teacher Evaluation, Models, Scores, Evaluation Criteria
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Kane, Michael – Language Testing, 2012
The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…
Descriptors: Testing, Language Tests, Inferences, Test Validity
Becker, Kirk A.; Bergstrom, Betty A. – Practical Assessment, Research & Evaluation, 2013
The need for increased exam security, improved test formats, more flexible scheduling, better measurement, and more efficient administrative processes has caused testing agencies to consider converting the administration of their exams from paper-and-pencil to computer-based testing (CBT). Many decisions must be made in order to provide an optimal…
Descriptors: Testing, Models, Testing Programs, Program Administration
Cavanagh, Robert F.; Koehler, Matthew J. – Journal of Research on Technology in Education, 2013
The impetus for this paper stems from a concern about directions and progress in the measurement of the Technological Pedagogical Content Knowledge (TPACK) framework for effective technology integration. In this paper, we develop the rationale for using a seven-criterion lens, based upon contemporary validity theory, for critiquing empirical…
Descriptors: Technological Literacy, Pedagogical Content Knowledge, Measurement Techniques, Technology Integration
Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012
In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…
Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach
Lincove, Jane Arnold; Osborne, Cynthia; Dillon, Amanda; Mills, Nicholas – Journal of Teacher Education, 2014
Despite questions about validity and reliability, the use of value-added estimation methods has moved beyond academic research into state accountability systems for teachers, schools, and teacher preparation programs (TPPs). Prior studies of value-added measurement for TPPs test the validity of researcher-designed models and find that measuring…
Descriptors: Teacher Education Programs, Accountability, Politics of Education, School Statistics
Nellhaus, Jeffrey; Behuniak, Peter; Stancavage, Frances B. – American Institutes for Research, 2009
Most educational researchers have heard the adage, "If you want to measure change in performance, don't change the measure." At the same time, however, what students need to know and be able to do may change over time as research on teaching and learning provides new insights into the educational process, science and technology advance, and…
Descriptors: Educational Researchers, Evaluation Criteria, Educational Change, Test Validity
Lester, Dennis – ProQuest LLC, 2010
Conducting a worthwhile assessment of the performance of senior leaders such as university presidents poses unique challenges for public institutions of higher education. One of the most difficult issues is determining the "content" and "format" of the assessment instrument. Due to the breadth and complexity of the job, the…
Descriptors: Feedback (Response), Focus Groups, College Presidents, Test Construction
Seok, Soonhwa – Educational Technology Research and Development, 2009
The purpose of this study was to identify and validate items applicable to evaluating online courses at the postsecondary level. Items were derived from a review of the literature. Four judges rated the similarity of the items by making pair-wise comparisons utilizing multidimensional scaling (MDS). The study consisted of five stages. Stage I…
Descriptors: Online Courses, Multidimensional Scaling, Course Evaluation, Test Items
Conley, D. T. – Educational Policy Improvement Center (NJ1), 2009
In June 2007, the Educational Policy Improvement Center (EPIC) was awarded a grant from the Bill and Melinda Gates Foundation to develop the College Ready School Diagnostic, a web-based diagnostic instrument. The purpose of this tool is to provide individual school profiles and customized recommendations, enabling each institution to make…
Descriptors: Academic Achievement, Educational Policy, Profiles, Charter Schools
Oakland, Thomas – 1972
New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Evaluation Methods, Item Analysis

Berk, Ronald A. – Journal of Experimental Education, 1976
Attempts to select empirically the optimal cutting score or criterion level for a test based on response data from validation samples of instructed and uninstructed students. This score maximizes the probability of correct mastery-nonmastery decisions (or minimizes the probability of incorrect decisions). (Author/RK)
Descriptors: Charts, Criterion Referenced Tests, Cutting Scores, Educational Testing
Jager, Siegfried; Duhm, Dieter – Linguistik und Didaktik, 1971
Descriptors: Educational Strategies, Evaluation Criteria, Grading, Instructional Improvement
Forehand, Garlie A. – 1982
Problems in validating ability tests for handicapped students and research approaches to predictive validity are discussed. Validity for handicapped persons tested under regular conditions; for applicants to special programs, and for tests taken under special administrative conditions are considered. Item analysis and the construction of new…
Descriptors: Academic Ability, Disabilities, Evaluation Criteria, Measures (Individuals)
Previous Page | Next Page ยป
Pages: 1 | 2