ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	42

Descriptor

Evaluation Research	50
Program Validation	50
Evaluation Methods	20
Item Analysis	17
Test Validity	13
Psychometrics	10
Research Methodology	10
Test Reliability	10
Foreign Countries	7
Program Evaluation	7
Intervention	6
Literature Reviews	6
Measures (Individuals)	6
Item Response Theory	5
Measurement Techniques	5
Replication (Evaluation)	5
Scoring Rubrics	5
Student Evaluation	5
Construct Validity	4
Educational Research	4
Evaluation Criteria	4
Interrater Reliability	4
Program Descriptions	4
Program Development	4
Program Effectiveness	4
More ▼

Publication Type

Journal Articles	44
Reports - Evaluative	20
Reports - Research	17
Reports - Descriptive	8
Dissertations/Theses -…	3
Information Analyses	3
Books	1
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Higher Education	10
Elementary Secondary Education	8
Postsecondary Education	4
Elementary Education	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
High Schools	1
Preschool Education	1
Secondary Education	1

Audience

Policymakers	1
Practitioners	1
Researchers	1

Location

Australia	3
California	1
Croatia	1
Hong Kong	1
Pennsylvania	1
United Kingdom	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…

Assessments and Surveys

Minnesota Multiphasic…	2
California Psychological…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing 1 to 15 of 50 results Save | Export

WWC Process Brief: The Study Review Process

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2017

The What Works Clearinghouse (WWC) evaluates research studies that look at the effectiveness of education programs, products, practices, and policies, which the WWC calls "interventions." Many studies of education interventions make claims about impacts on students' outcomes. Some studies have designs that enable readers to make causal…

Descriptors: Program Design, Program Development, Program Effectiveness, Program Evaluation

Measuring Program Quality, Part 2: Addressing Potential Cultural Bias in a Rater Reliability Exam

Peer reviewed
PDF on ERIC

Download full text

Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018

Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…

Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation

Student Online Readiness Assessment Tools: A Systematic Review Approach

Peer reviewed
PDF on ERIC

Download full text

Farid, Alem – Electronic Journal of e-Learning, 2014

Although there are tools to assess student's readiness in an "online learning context," little is known about the "psychometric" properties of the tools used or not. A systematic review of 5107 published and unpublished papers identified in a literature search on student online readiness assessment tools between 1990 and…

Descriptors: Online Courses, Electronic Learning, Learning Readiness, Psychometrics

Toward Universal Definitions for Direct and Indirect Assessment

Peer reviewed

Direct link

Elbeck, Matt; Bacon, Don – Journal of Education for Business, 2015

The absence of universally accepted definitions for direct and indirect assessment motivates the purpose of this article: to offer definitions that are literature-based and theoretically driven, meeting K. Lewin's (1945) dictum that, "There is nothing so practical as a good theory" (p. 129). The authors synthesize the literature to…

Descriptors: Definitions, Evaluation Methods, Global Approach, Evidence

Variety and Drift in the Functions and Purposes of Assessment in K-12 Education

Peer reviewed

Direct link

Ho, Andrew D. – Teachers College Record, 2014

Background/Context: The target of assessment validation is not an assessment but the use of an assessment for a purpose. Although the validation literature often provides examples of assessment purposes, comprehensive reviews of these purposes are rare. Additionally, assessment purposes posed for validation are generally described as discrete and…

Descriptors: Elementary Secondary Education, Standardized Tests, Measurement Objectives, Educational Change

Using Rasch Measurement to Score, Evaluate, and Improve Examinations in an Anatomy Course

Peer reviewed

Direct link

Royal, Kenneth D.; Gilliland, Kurt O.; Kernick, Edward T. – Anatomical Sciences Education, 2014

Any examination that involves moderate to high stakes implications for examinees should be psychometrically sound and legally defensible. Currently, there are two broad and competing families of test theories that are used to score examination data. The majority of instructors outside the high-stakes testing arena rely on classical test theory…

Descriptors: Item Response Theory, Scoring, Evaluation Methods, Anatomy

Exploring Alternative Approaches for Presenting Evaluation Results

Peer reviewed

Direct link

Johnson, Jeremiah; Hall, Jori; Greene, Jennifer C.; Ahn, Jeehae – American Journal of Evaluation, 2013

Evaluators have an obligation to present clearly the results of their evaluative efforts. Traditionally, such presentations showcase formal written and oral reports, with dispassionate language and graphs, tables, quotes, and vignettes. These traditional forms do not reach all audiences nor are they likely to include the most powerful presentation…

Descriptors: Evaluation Research, Change Strategies, Research Reports, Usability

An Empirical Examination of IRT Information for School Climate Surveys

Peer reviewed

Direct link

Mo, Lun; Yang, Fang; Hu, Xiangen – Educational Research and Evaluation, 2011

School climate surveys are widely applied in school districts across the nation to collect information about teacher efficacy, principal leadership, school safety, students' activities, and so forth. They enable school administrators to understand and address many issues on campus when used in conjunction with other student and staff data.…

Descriptors: Evidence, Academic Achievement, Questionnaires, Item Response Theory

Evaluation of the "e-rater"® Scoring Engine for the "GRE"® Issue and Argument Prompts. Research Report. ETS RR-12-02

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012

Automated scoring models for the "e-rater"® scoring engine were built and evaluated for the "GRE"® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in…

Descriptors: Scoring, Test Scoring Machines, Automation, Models

Self-Evaluation of Assessment Programs: A Cross-Case Analysis

Peer reviewed

Direct link

Baartman, Liesbeth K. J.; Prins, Frans J.; Kirschner, Paul A.; van der Vleuten, Cees P. M. – Evaluation and Program Planning, 2011

The goal of this article is to contribute to the validation of a self-evaluation method, which can be used by schools to evaluate the quality of their Competence Assessment Program (CAP). The outcomes of the self-evaluations of two schools are systematically compared: a novice school with little experience in competence-based education and…

Descriptors: Educational Innovation, Competency Based Education, Self Evaluation (Groups), Program Validation

A Comparison of Three Strategies for Scale Construction to Predict a Specific Behavioral Outcome

Peer reviewed

Direct link

Garb, Howard N.; Wood, James M.; Fiedler, Edna R. – Assessment, 2011

Using 65 items from a mental health screening questionnaire, the History Opinion Inventory-Revised (HOI-R), the present study compared three strategies of scale construction--(1) internal (based on factor analysis), (2) external (based on empirical performance) and (3) intuitive (based on clinicians' opinion)--to predict whether 203,595 U.S. Air…

Descriptors: Opinions, Mental Health, Test Validity, Measures (Individuals)

Performance Assessment for California Teachers (PACT): An Evaluation of Inter-Rater Reliability

Direct link

Porter, Jennifer Marie – ProQuest LLC, 2010

This research evaluated the inter-rater reliability of the Performance Assessment for California Teachers (PACT). Multiple methods for estimating overall rater consistency include percent agreement and Cohen's Kappa (1960), which yielded discrepancies between rater agreement in terms of whether candidates passed or failed particular PACT rubrics.…

Descriptors: Interrater Reliability, Program Effectiveness, Scoring Rubrics, Item Analysis

Development of the Policy Advocacy Behavior Scale: Initial Reliability and Validity

Peer reviewed

Direct link

Donaldson, Linda Plitt; Shields, Joseph – Research on Social Work Practice, 2009

Contemporary trends in social service delivery systems require human service agencies to engage in greater levels of advocacy to reform structures and protect programs that serve vulnerable populations. Objective: The purpose of this study was to develop an instrument to measure the policy advocacy behavior of nonprofit human service agencies.…

Descriptors: Human Services, Delivery Systems, Measures (Individuals), Social Work

Assessing Mentoring in Organizations: An Evaluation of Commercial Mentoring Instruments

Peer reviewed

Direct link

Gilbreath, Brad; Rose, Gail L.; Dietrich, Kim E. – Mentoring & Tutoring: Partnership in Learning, 2008

The purpose of this article is to inform readers about the types of instruments available for assessing and improving mentoring in organizations. Extensive review of the psychological, business and medical literature was conducted to identify commercially published, practitioner-oriented instruments. All of the instruments that were…

Descriptors: Mentors, Psychometrics, Literature Reviews, Evaluation Methods

Preliminary Validation of the Implementation Phases Inventory for Assessing Fidelity of Schoolwide Positive Behavior Supports

Peer reviewed

Direct link

Bradshaw, Catherine P.; Debnam, Katrina; Koth, Christine W.; Leaf, Philip – Journal of Positive Behavior Interventions, 2009

Schoolwide positive behavioral interventions and supports (SWPBIS) are becoming increasingly popular with schools across the country to help create safer learning environments for students. An important aspect of SWPBIS is the ongoing monitoring and evaluation of implementation fidelity. Although a few measures have been created to assess the…

Descriptors: Interrater Reliability, Positive Reinforcement, Behavior Modification, Program Validation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

American Journal of Evaluation	3
ProQuest LLC	3
Research on Social Work…	3
Journal of Positive Behavior…	2
Afterschool Matters	1
Anatomical Sciences Education	1
Assessment	1
Assessment for Effective…	1
British Journal of Music…	1
Bulletin of Science,…	1
ETS Research Report Series	1
Early Childhood Research…	1
Early Education and…	1
Educational Psychology in…	1
Educational Research and…	1
Electronic Journal of…	1
Evaluation and Program…	1
Evaluation and Research in…	1
Journal of Career Assessment	1
Journal of College Science…	1
Journal of Education for…	1
Journal of Education for…	1
Journal of Educational…	1
Journal of Information…	1
Journal of Interpersonal…	1
More ▼

Ahn, Jeehae	1
Baartman, Liesbeth K. J.	1
Babarovic, Toni	1
Bacon, Don	1
Benton, Tom	1
Bham, Mohammed	1
Bledsoe, Sarah E.	1
Bogaerts, Stefan	1
Boland, Joseph B.	1
Borgmeier, Christopher J.	1
Bracken, Stacey Storch	1
Bradshaw, Catherine P.	1
Bridgeman, Brent	1
Brown, Jacqueline A.	1
Campbell, Todd	1
Ceder, Ineke	1
Chamberlain, Tamsin	1
Charmaraman, Linda	1
Conklin, Sarah M.	1
Cornwall, Marie	1
Coryn, Chris L. S.	1
Coyne, Molly	1
Daniel, Ryan	1
Davey, Tim	1
Debnam, Katrina	1
More ▼