Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Program Evaluation | 12 |
Scores | 12 |
Test Reliability | 12 |
Test Validity | 7 |
Evaluation Methods | 4 |
Scoring | 4 |
Test Construction | 4 |
Test Interpretation | 4 |
Elementary Secondary Education | 3 |
Rating Scales | 3 |
Statistical Analysis | 3 |
More ▼ |
Source
International Journal of… | 1 |
International Journal of… | 1 |
Journal of Faculty Development | 1 |
Leadership and Research in… | 1 |
Phi Delta Kappan | 1 |
Author
Koretz, Daniel | 2 |
Allen, Patricia J. | 1 |
Barker, Pierce | 1 |
Brooks, D. Christopher | 1 |
Chang, Rong | 1 |
Cohen, Allan S., Comp. | 1 |
Cohen, Brad | 1 |
Donovan, Courtney | 1 |
Fink, Arlene | 1 |
Fowler, R. Clarke | 1 |
Fukuda, Eriko | 1 |
More ▼ |
Publication Type
Reports - Research | 7 |
Journal Articles | 5 |
Reports - Evaluative | 3 |
Reference Materials -… | 1 |
Reports - Descriptive | 1 |
Reports - General | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
Metropolitan Achievement Tests | 1 |
What Works Clearinghouse Rating
Yesildag Hasancebi, Funda; Yuksel, Busra Tuncay; Mesci, Gunkut – International Journal of Assessment Tools in Education, 2022
The purpose of this study was to develop a reliable and valid rating scale for the use of the assessment and evaluation of lesson plans and teaching practices that are based on argumentation-based inquiry (ABI). The study covered two academic years (four academic semesters). Qualitative and quantitative methods were utilized throughout the…
Descriptors: Foreign Countries, Rating Scales, Test Construction, Test Validity
Little, Todd D.; Chang, Rong; Gorrall, Britt K.; Waggenspack, Luke; Fukuda, Eriko; Allen, Patricia J.; Noam, Gil G. – International Journal of Behavioral Development, 2020
We revisit the merits of the retrospective pretest-posttest (RPP) design for repeated-measures research. The underutilized RPP method asks respondents to rate survey items twice during the same posttest measurement occasion from two specific frames of reference: "now" and "then." Individuals first report their current attitudes…
Descriptors: Pretesting, Alternative Assessment, Program Evaluation, Evaluation Methods
Donovan, Courtney; Green, Kathy E.; Seidel, Kent – Leadership and Research in Education, 2017
Core competencies essential for effective teaching were identified via a literature review and a review of standards for teacher education, and vetted by state groups with interests in teacher education. Survey items based on these competencies asked teacher candidates, graduates, and teacher education program faculty how well the program prepared…
Descriptors: Teacher Effectiveness, Item Response Theory, Item Analysis, Test Items
Brooks, D. Christopher; Marsh, Lauren; Wilcox, Kimerly; Cohen, Brad – Journal of Faculty Development, 2011
In response to the well-documented need for rigorous evaluations of faculty development programs and increasing demands for institutional accountability, University of Minnesota's Office of Information Technology (OIT) researchers have developed an approach to program evaluation that assesses individual level changes to participants' attitudes,…
Descriptors: Program Evaluation, Information Technology, Faculty Development, Accountability
Kosecoff, Jacqueline; Fink, Arlene – 1976
The feasibility of using criterion referenced tests (CRTs) in a large-scale evaluation conducted in an effectiveness evaluation context was investigated. The study began by examining the theory that structures the development and validation of CRTs to discover whether, on theoretical grounds alone, CRTs are suitable or not suitable for large-scale…
Descriptors: Criteria, Criterion Referenced Tests, Definitions, Feasibility Studies
Koretz, Daniel; And Others – 1994
The 1992-93 school year saw the second statewide implementation of the Vermont portfolio-assessment program, and RAND continued its ongoing evaluation of the program's implementation, effects, and data quality. While the first year's study found evidence of the impact of the assessment program and low reliability of portfolio scoring, this year's…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Mathematics
Fowler, R. Clarke – Phi Delta Kappan, 2001
Research says the school-improvement mechanisms favored by policymakers-more certification tests (like the Massachusetts Educator Certification Test that 59 percent of candidates failed in 1998), higher cut scores, and severe penalties for institutions not meeting pass rates-are unlikely to deliver increased accountability and better teachers.…
Descriptors: Cutting Scores, Elementary Secondary Education, Instructional Improvement, Mass Media
Barker, Pierce; Pelavin, Sol H. – 1976
This study was mounted to assess the validity of standard score transformations of raw test scores and test bias on the 1970 edition of the Metropolitan Achievement Test Battery, in the context of a controversial federally funded compensatory education program, the Educational Voucher Demonstration (EVD). On an individual level the validity of the…
Descriptors: Achievement Gains, Achievement Tests, Educationally Disadvantaged, Elementary Education
Powills, Judith A.; And Others – 1979
Language arts teachers participated in an inservice program, The Writer's Clinic, and evaluated their students' improvement in writing ability using holistic essay scoring techniques. Seventh and eighth grade students were administered an essay composition pretest in December; the same essay topic was given as the post test in May. Students were…
Descriptors: Age Differences, Cost Effectiveness, Essay Tests, Essays
Koretz, Daniel – 1994
Since 1988 the Vermont Department of Education has been developing an innovative statewide performance assessment program. In 1990, the RAND Corporation began evaluating the Vermont assessment program, focusing specifically on the portfolio component of its assessment system. This report presents results from the evaluation in the 1992-93 school…
Descriptors: Educational Assessment, Elementary Secondary Education, Interviews, Mathematics Tests
Cohen, Allan S., Comp. – 1979
This partially annotated bibliography of journal articles, dissertations, convention papers, research reports, and a few books and unpublished manuscripts provides a comprehensive coverage of work on latent trait theory and practice. Documents are arranged alphabetically by author. The period covered ranges from the early 1950's to the present.…
Descriptors: Attitude Measures, Career Development, Computer Assisted Testing, Computer Programs
Millman, Jason – 1974
This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…
Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis