ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Program Evaluation	12
Scores	12
Test Reliability	12
Test Validity	7
Evaluation Methods	4
Scoring	4
Test Construction	4
Test Interpretation	4
Elementary Secondary Education	3
Rating Scales	3
Statistical Analysis	3
Criterion Referenced Tests	2
Cutting Scores	2
Educational Assessment	2
Faculty Development	2
Goodness of Fit	2
Item Analysis	2
Performance Based Assessment	2
Portfolios (Background…	2
Preservice Teachers	2
Pretests Posttests	2
Program Effectiveness	2
State Programs	2
Teacher Attitudes	2
Test Bias	2
More ▼

Source

International Journal of…	1
International Journal of…	1
Journal of Faculty Development	1
Leadership and Research in…	1
Phi Delta Kappan	1

Publication Type

Reports - Research	7
Journal Articles	5
Reports - Evaluative	3
Reference Materials -…	1
Reports - Descriptive	1
Reports - General	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	1

Audience

Location

Vermont	2
Minnesota	1
Turkey	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Metropolitan Achievement Tests

What Works Clearinghouse Rating

Showing all 12 results Save | Export

A Rating Scale Development Study for the Evaluation of Lesson Plans and Teaching Practices on Argumentation-Based Inquiry

Peer reviewed
PDF on ERIC

Download full text

Yesildag Hasancebi, Funda; Yuksel, Busra Tuncay; Mesci, Gunkut – International Journal of Assessment Tools in Education, 2022

The purpose of this study was to develop a reliable and valid rating scale for the use of the assessment and evaluation of lesson plans and teaching practices that are based on argumentation-based inquiry (ABI). The study covered two academic years (four academic semesters). Qualitative and quantitative methods were utilized throughout the…

Descriptors: Foreign Countries, Rating Scales, Test Construction, Test Validity

The Retrospective Pretest-Posttest Design Redux: On Its Validity as an Alternative to Traditional Pretest-Posttest Measurement

Peer reviewed

Direct link

Little, Todd D.; Chang, Rong; Gorrall, Britt K.; Waggenspack, Luke; Fukuda, Eriko; Allen, Patricia J.; Noam, Gil G. – International Journal of Behavioral Development, 2020

We revisit the merits of the retrospective pretest-posttest (RPP) design for repeated-measures research. The underutilized RPP method asks respondents to rate survey items twice during the same posttest measurement occasion from two specific frames of reference: "now" and "then." Individuals first report their current attitudes…

Descriptors: Pretesting, Alternative Assessment, Program Evaluation, Evaluation Methods

Differential Item Functioning on a Measure of Perceptions of Preparation for Teachers, Teacher Candidates, and Program Personnel

Peer reviewed
PDF on ERIC

Download full text

Donovan, Courtney; Green, Kathy E.; Seidel, Kent – Leadership and Research in Education, 2017

Core competencies essential for effective teaching were identified via a literature review and a review of standards for teacher education, and vetted by state groups with interests in teacher education. Survey items based on these competencies asked teacher candidates, graduates, and teacher education program faculty how well the program prepared…

Descriptors: Teacher Effectiveness, Item Response Theory, Item Analysis, Test Items

Beyond Satisfaction: Toward an Outcomes-Based, Procedural Model of Faculty Development Program Evaluation

Peer reviewed

Direct link

Brooks, D. Christopher; Marsh, Lauren; Wilcox, Kimerly; Cohen, Brad – Journal of Faculty Development, 2011

In response to the well-documented need for rigorous evaluations of faculty development programs and increasing demands for institutional accountability, University of Minnesota's Office of Information Technology (OIT) researchers have developed an approach to program evaluation that assesses individual level changes to participants' attitudes,…

Descriptors: Program Evaluation, Information Technology, Faculty Development, Accountability

The Feasibility of Using Criterion-Referenced Tests for Large-Scale Evaluations.

Download full text

Kosecoff, Jacqueline; Fink, Arlene – 1976

The feasibility of using criterion referenced tests (CRTs) in a large-scale evaluation conducted in an effectiveness evaluation context was investigated. The study began by examining the theory that structures the development and validation of CRTs to discover whether, on theoretical grounds alone, CRTs are suitable or not suitable for large-scale…

Descriptors: Criteria, Criterion Referenced Tests, Definitions, Feasibility Studies

The Reliability of Vermont Portfolio Scores in the 1992-93 School Year. Interim Report. RAND Reprints Series.

Download full text

Koretz, Daniel; And Others – 1994

The 1992-93 school year saw the second statewide implementation of the Vermont portfolio-assessment program, and RAND continued its ongoing evaluation of the program's implementation, effects, and data quality. While the first year's study found evidence of the impact of the assessment program and low reliability of portfolio scoring, this year's…

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Mathematics

What Did the Massachusetts Teacher Tests Say about American Education?

Fowler, R. Clarke – Phi Delta Kappan, 2001

Research says the school-improvement mechanisms favored by policymakers-more certification tests (like the Massachusetts Educator Certification Test that 59 percent of candidates failed in 1998), higher cut scores, and severe penalties for institutions not meeting pass rates-are unlikely to deliver increased accountability and better teachers.…

Descriptors: Cutting Scores, Elementary Secondary Education, Instructional Improvement, Mass Media

Issues of Reliability and Directional Bias in Standardized Achievement Tests: The Case of Mat70. P-5689.

Download full text

Barker, Pierce; Pelavin, Sol H. – 1976

This study was mounted to assess the validity of standard score transformations of raw test scores and test bias on the 1970 edition of the Metropolitan Achievement Test Battery, in the context of a controversial federally funded compensatory education program, the Educational Voucher Demonstration (EVD). On an individual level the validity of the…

Descriptors: Achievement Gains, Achievement Tests, Educationally Disadvantaged, Elementary Education

Holistic Essay Scoring: An Application of the Model for the Evaluation of Writing Ability and the Measurement of Growth in Writing Ability Over Time.

Powills, Judith A.; And Others – 1979

Language arts teachers participated in an inservice program, The Writer's Clinic, and evaluated their students' improvement in writing ability using holistic essay scoring techniques. Seventh and eighth grade students were administered an essay composition pretest in December; the same essay topic was given as the post test in May. Students were…

Descriptors: Age Differences, Cost Effectiveness, Essay Tests, Essays

The Evolution of a Portfolio Program: The Impact and Quality of the Vermont Portfolio Program in Its Second Year (1992-93).

Download full text

Koretz, Daniel – 1994

Since 1988 the Vermont Department of Education has been developing an innovative statewide performance assessment program. In 1990, the RAND Corporation began evaluating the Vermont assessment program, focusing specifically on the portfolio component of its assessment system. This report presents results from the evaluation in the 1992-93 school…

Descriptors: Educational Assessment, Elementary Secondary Education, Interviews, Mathematics Tests

Bibliography of Papers on Latent Trait Assessment.

Cohen, Allan S., Comp. – 1979

This partially annotated bibliography of journal articles, dissertations, convention papers, research reports, and a few books and unpublished manuscripts provides a comprehensive coverage of work on latent trait theory and practice. Documents are arranged alphabetically by author. The period covered ranges from the early 1950's to the present.…

Descriptors: Attitude Measures, Career Development, Computer Assisted Testing, Computer Programs

Criterion-Referenced Measurement.

Millman, Jason – 1974

This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…

Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis

Koretz, Daniel	2
Allen, Patricia J.	1
Barker, Pierce	1
Brooks, D. Christopher	1
Chang, Rong	1
Cohen, Allan S., Comp.	1
Cohen, Brad	1
Donovan, Courtney	1
Fink, Arlene	1
Fowler, R. Clarke	1
Fukuda, Eriko	1
Gorrall, Britt K.	1
Green, Kathy E.	1
Kosecoff, Jacqueline	1
Little, Todd D.	1
Marsh, Lauren	1
Mesci, Gunkut	1
Millman, Jason	1
Noam, Gil G.	1
Pelavin, Sol H.	1
Powills, Judith A.	1
Seidel, Kent	1
Waggenspack, Luke	1
Wilcox, Kimerly	1
More ▼