ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	11

Descriptor

Evaluation Criteria	26
Models	26
Test Validity	26
Test Reliability	12
Evaluation Methods	8
Test Construction	6
Criterion Referenced Tests	4
Student Evaluation	4
Cutting Scores	3
Decision Making	3
Measurement Techniques	3
Program Effectiveness	3
Standards	3
Test Items	3
Testing	3
Testing Programs	3
Academic Achievement	2
Accountability	2
Achievement Gains	2
Classification	2
Construct Validity	2
Definitions	2
Diagnostic Tests	2
Educational Change	2
Educational Policy	2
More ▼

Source

American Institutes for…	1
Educational Policy…	1
Educational Technology…	1
Evaluation and the Health…	1
International Journal of…	1
Journal of Experimental…	1
Journal of Research on…	1
Journal of Teacher Education	1
Kappa Delta Pi Record	1
Language Testing	1
Linguistik und Didaktik	1
Measurement:…	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	9
Reports - Evaluative	8
Reports - Research	6
Speeches/Meeting Papers	4
Reports - Descriptive	3
Information Analyses	2
Opinion Papers	2
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Postsecondary Education	4
Higher Education	3
Elementary Secondary Education	1
Grade 12	1
Grade 4	1
Grade 8	1
High Schools	1
Secondary Education	1
Two Year Colleges	1

Audience

Researchers

Location

New Mexico	1
Texas	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

General Aptitude Test Battery

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Growth Models and Teacher Evaluation: What Teachers Need to Know and Do

Peer reviewed

Direct link

Katz, Daniel S. – Kappa Delta Pi Record, 2016

Including growth models based on student test scores in teacher evaluations effectively holds teachers individually accountable for students improving their test scores. While an attractive policy for state administrators and advocates of education reform, value-added measures have been fraught with problems, and their use in teacher evaluation is…

Descriptors: Teacher Evaluation, Models, Scores, Evaluation Criteria

Applications of Diagnostic Classification Models: A Literature Review and Critical Commentary

Peer reviewed

Direct link

Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…

Descriptors: Literature Reviews, Classification, Models, Criticism

Validating Score Interpretations and Uses: Messick Lecture, Language Testing Research Colloquium, Cambridge, April 2010

Peer reviewed

Direct link

Kane, Michael – Language Testing, 2012

The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…

Descriptors: Testing, Language Tests, Inferences, Test Validity

Test Administration Models

Peer reviewed
PDF on ERIC

Download full text

Becker, Kirk A.; Bergstrom, Betty A. – Practical Assessment, Research & Evaluation, 2013

The need for increased exam security, improved test formats, more flexible scheduling, better measurement, and more efficient administrative processes has caused testing agencies to consider converting the administration of their exams from paper-and-pencil to computer-based testing (CBT). Many decisions must be made in order to provide an optimal…

Descriptors: Testing, Models, Testing Programs, Program Administration

A Turn toward Specifying Validity Criteria in the Measurement of Technological Pedagogical Content Knowledge (TPACK)

Peer reviewed

Direct link

Cavanagh, Robert F.; Koehler, Matthew J. – Journal of Research on Technology in Education, 2013

The impetus for this paper stems from a concern about directions and progress in the measurement of the Technological Pedagogical Content Knowledge (TPACK) framework for effective technology integration. In this paper, we develop the rationale for using a seven-criterion lens, based upon contemporary validity theory, for critiquing empirical…

Descriptors: Technological Literacy, Pedagogical Content Knowledge, Measurement Techniques, Technology Integration

Use of the EFPA Test Review Model by the UK and Issues Relating to the Internationalization of Test Standards

Peer reviewed

Direct link

Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012

In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…

Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach

The Politics and Statistics of Value-Added Modeling for Accountability of Teacher Preparation Programs

Peer reviewed

Direct link

Lincove, Jane Arnold; Osborne, Cynthia; Dillon, Amanda; Mills, Nicholas – Journal of Teacher Education, 2014

Despite questions about validity and reliability, the use of value-added estimation methods has moved beyond academic research into state accountability systems for teachers, schools, and teacher preparation programs (TPPs). Prior studies of value-added measurement for TPPs test the validity of researcher-designed models and find that measuring…

Descriptors: Teacher Education Programs, Accountability, Politics of Education, School Statistics

Guiding Principles and Suggested Studies for Determining when the Introduction of a New Assessment Framework Necessitates a Break in Trend in NAEP

Download full text

Nellhaus, Jeffrey; Behuniak, Peter; Stancavage, Frances B. – American Institutes for Research, 2009

Most educational researchers have heard the adage, "If you want to measure change in performance, don't change the measure." At the same time, however, what students need to know and be able to do may change over time as research on teaching and learning provides new insights into the educational process, science and technology advance, and…

Descriptors: Educational Researchers, Evaluation Criteria, Educational Change, Test Validity

Developing an Effective Instrument for Assessing the Performance of Public University Presidents

Direct link

Lester, Dennis – ProQuest LLC, 2010

Conducting a worthwhile assessment of the performance of senior leaders such as university presidents poses unique challenges for public institutions of higher education. One of the most difficult issues is determining the "content" and "format" of the assessment instrument. Due to the breadth and complexity of the job, the…

Descriptors: Feedback (Response), Focus Groups, College Presidents, Test Construction

Item Validation of Online Postsecondary Courses: Rating the Proximity between Similarity and Dissimilarity among Item Pairs (Validation Study Series I: Multidimensional Scaling)

Peer reviewed

Direct link

Seok, Soonhwa – Educational Technology Research and Development, 2009

The purpose of this study was to identify and validate items applicable to evaluating online courses at the postsecondary level. Items were derived from a review of the literature. Four judges rated the similarity of the items by making pair-wise comparisons utilizing multidimensional scaling (MDS). The study consisted of five stages. Stage I…

Descriptors: Online Courses, Multidimensional Scaling, Course Evaluation, Test Items

Creating College Readiness: Profiles of 38 Schools That Know How

Download full text

Conley, D. T. – Educational Policy Improvement Center (NJ1), 2009

In June 2007, the Educational Policy Improvement Center (EPIC) was awarded a grant from the Bill and Melinda Gates Foundation to develop the College Ready School Diagnostic, a web-based diagnostic instrument. The purpose of this tool is to provide individual school profiles and customized recommendations, enabling each institution to make…

Descriptors: Academic Achievement, Educational Policy, Profiles, Charter Schools

An Evaluation of Available Models for Estimating the Reliability and Validity of Criterion Referenced Measures.

Download full text

Oakland, Thomas – 1972

New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…

Descriptors: Criterion Referenced Tests, Evaluation Criteria, Evaluation Methods, Item Analysis

Determination of Optimal Cutting Scores in Criterion-Referenced Measurement

Peer reviewed

Berk, Ronald A. – Journal of Experimental Education, 1976

Attempts to select empirically the optimal cutting score or criterion level for a test based on response data from validation samples of instructed and uninstructed students. This score maximizes the probability of correct mastery-nonmastery decisions (or minimizes the probability of incorrect decisions). (Author/RK)

Descriptors: Charts, Criterion Referenced Tests, Cutting Scores, Educational Testing

Notengebung: Kritik und Alternativen (Grading: Criticism and Alternatives)

Jager, Siegfried; Duhm, Dieter – Linguistik und Didaktik, 1971

Descriptors: Educational Strategies, Evaluation Criteria, Grading, Instructional Improvement

Testing the Handicapped: Validation and Test Interpretation.

Download full text

Forehand, Garlie A. – 1982

Problems in validating ability tests for handicapped students and research approaches to predictive validity are discussed. Validity for handicapped persons tested under regular conditions; for applicants to special programs, and for tests taken under special administrative conditions are considered. Item analysis and the construction of new…

Descriptors: Academic Ability, Disabilities, Evaluation Criteria, Measures (Individuals)

Previous Page | Next Page »

Pages: 1 | 2

Cason, Carolyn L.	2
Bartram, Dave	1
Becker, Kirk A.	1
Behuniak, Peter	1
Bergstrom, Betty A.	1
Berk, Ronald A.	1
Cason, Gerald J.	1
Cavanagh, Robert F.	1
Conley, D. T.	1
Conway, Malcolm J.	1
Dillon, Amanda	1
Duhm, Dieter	1
Feezel, Jerry D.	1
Forehand, Garlie A.	1
Giesen, Linda A.	1
Henson, Robert A.	1
Herman, Joan L.	1
Jager, Siegfried	1
Kane, Michael	1
Katz, Daniel S.	1
Klein, Stephen P.	1
Koehler, Matthew J.	1
Lester, Dennis	1
Lincove, Jane Arnold	1
More ▼