ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	11

Descriptor

Evaluation Research	13
Test Reliability	13
Evaluation Methods	8
Test Validity	7
Educational Assessment	5
Item Analysis	4
Research Methodology	4
Educational Quality	3
Error of Measurement	3
Measurement Techniques	3
Models	3
Psychometrics	3
Accuracy	2
Correlation	2
Cultural Differences	2
Evaluation Problems	2
Foreign Countries	2
Interrater Reliability	2
Measurement	2
Meta Analysis	2
Performance Based Assessment	2
Pretests Posttests	2
Program Effectiveness	2
Program Evaluation	2
Quality Control	2
More ▼

Source

Educational Assessment	2
Afterschool Matters	1
Applied Psychological…	1
Australian Journal of Adult…	1
Career Development Quarterly	1
Child Trends	1
Educational Studies in Japan:…	1
Higher Education Studies	1
International Journal of…	1
Measurement and Evaluation in…	1
Oxford Review of Education	1
Psychology in the Schools	1
More ▼

Publication Type

Reports - Descriptive	13
Journal Articles	12

Education Level

Elementary Secondary Education	5
Higher Education	2
Adult Education	1
Early Childhood Education	1
Elementary Education	1
Postsecondary Education	1

Audience

Policymakers	1
Practitioners	1
Researchers	1

Location

Japan	1
United Kingdom	1
United Kingdom (Reading)	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Early Childhood Environment…	1
Graduate Record Examinations	1
Infant Toddler Environment…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Measuring Program Quality, Part 2: Addressing Potential Cultural Bias in a Rater Reliability Exam

Peer reviewed
PDF on ERIC

Download full text

Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018

Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…

Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation

On the Validity of Educational Evaluation and Its Construction

Peer reviewed
PDF on ERIC

Download full text

Huang, Xiaoping; Hu, Zhongfeng – Higher Education Studies, 2015

The main problem of the educational evaluation validity is that it just copies the conceptual framework system of validity from educational measurement to its own conceptual system. The validity conceptual system that fits the need of theory and practice of educational evaluation has not been established yet. According to the inherent attributive…

Descriptors: Test Validity, Educational Assessment, Evaluation Problems, Theory Practice Relationship

The Public Understanding of Error in Educational Assessment

Peer reviewed

Direct link

Gardner, John – Oxford Review of Education, 2013

Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…

Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries

Academic Achievement Survey and Educational Assessment Research

Peer reviewed
PDF on ERIC

Download full text

Tanaka, Koji – Educational Studies in Japan: International Yearbook, 2009

The recent "Nationwide academic achievement and study situation survey" was clearly influenced by the idea of "authentic assessment", an educational assessment perspective focused on "quality" and "engagement". However, when "performance assessment", the assessment method corresponding to this…

Descriptors: Educational Assessment, Performance Based Assessment, Academic Achievement, Educational Research

A Framework for Test Validity Research on Content Assessments Taken by English Language Learners

Peer reviewed

Direct link

Young, John W. – Educational Assessment, 2009

In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…

Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education

Issues of Stability and Change in Interest Development

Peer reviewed

Direct link

Tracey, Terence J. G.; Sodano, Sandro M. – Career Development Quarterly, 2008

Interest development is not an easily studied process. There are at least 4 methods for examining the process of stability and change over time: relative stability, absolute stability, profile stability, and structural stability. A program of research that focuses on examining these 4 types of stability is summarized relative to the issues…

Descriptors: Vocational Interests, Childhood Interests, Attitude Change, Research Projects

Keeping the Focus on the Child: Supporting and Reporting on Teaching and Learning with a Classroom-Based Performance Assessment System

Peer reviewed

Direct link

Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007

This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…

Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research

Self-Evaluations in Adult Education and Training

Peer reviewed
PDF on ERIC

Download full text

Direct link

Athanasou, James A. – Australian Journal of Adult Learning, 2005

This paper focuses on two key aspects of self-evaluation in adult education and training through the perspective of (a) a social-cognitive framework which is used to categorise those factors that enhance self-efficacy and self-evaluation, and (b) the accuracy of self-evaluation. The social-cognitive framework categorises the factors that enhance…

Descriptors: Self Efficacy, Adult Education, Self Evaluation (Individuals), Social Cognition

Quality in Early Childhood Care and Education Settings: A Compendium of Measures

Direct link

Flood, Mirjam; Weinstein, Debra; Halle, Tamara; Martin, Laurie; Tout, Kathryn; Wandner, Laura; Vick, Jessica; Sherman, Juli; Hair, Elizabeth – Child Trends, 2007

Quality measures were originally developed for research aimed at describing the settings that children spend time in and identifying the characteristics of these environments that contribute to children's development. They were also developed to guide improvements in practice. Increasingly, however, measures of quality are being used for further…

Descriptors: Validity, Reliability, Child Care, Educational Quality

How Reliable Are Informal Reading Inventories?

Peer reviewed

Direct link

Spector, Janet E. – Psychology in the Schools, 2005

Informal Reading Inventories (IRI) are often recommended as instructionally relevant measures of reading. However, they have also been criticized for inattention to technical quality. Examination of reliability evidence in nine recently revised IRIs revealed that fewer than half report reliability. Several appear to have sufficient reliability for…

Descriptors: Informal Reading Inventories, Reading Instruction, Reading Difficulties, Reading Research

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Item Difficulty Modeling of Paragraph Comprehension Items

Peer reviewed

Direct link

Gorin, Joanna S.; Embretson, Susan E. – Applied Psychological Measurement, 2006

Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

Descriptors: Difficulty Level, Test Items, Modeling (Psychology), Paragraph Composition

Athanasou, James A.	1
Bardhoshi, Gerta	1
Ceder, Ineke	1
Charmaraman, Linda	1
Embretson, Susan E.	1
Erford, Bradley T.	1
Falk, Beverly	1
Flood, Mirjam	1
Foster, Jeff L.	1
Gardner, John	1
Gorin, Joanna S.	1
Hair, Elizabeth	1
Halle, Tamara	1
Hu, Zhongfeng	1
Huang, Xiaoping	1
Martin, Laurie	1
Meyer, Kevin D.	1
Moirs, Katie	1
Ort, Suzanne Wichterle	1
Richer, Amanda	1
Sherman, Juli	1
Sodano, Sandro M.	1
Spector, Janet E.	1
Tanaka, Koji	1
Tout, Kathryn	1
More ▼