Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Author
Athanasou, James A. | 1 |
Bardhoshi, Gerta | 1 |
Ceder, Ineke | 1 |
Charmaraman, Linda | 1 |
Embretson, Susan E. | 1 |
Erford, Bradley T. | 1 |
Falk, Beverly | 1 |
Flood, Mirjam | 1 |
Foster, Jeff L. | 1 |
Gardner, John | 1 |
Gorin, Joanna S. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 13 |
Journal Articles | 12 |
Education Level
Elementary Secondary Education | 5 |
Higher Education | 2 |
Adult Education | 1 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Postsecondary Education | 1 |
Audience
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Location
Japan | 1 |
United Kingdom | 1 |
United Kingdom (Reading) | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Early Childhood Environment… | 1 |
Graduate Record Examinations | 1 |
Infant Toddler Environment… | 1 |
What Works Clearinghouse Rating
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Richer, Amanda; Charmaraman, Linda; Ceder, Ineke – Afterschool Matters, 2018
Like instruments used in afterschool programs to assess children's social and emotional growth or to evaluate staff members' performance, instruments used to evaluate program quality should be free from bias. Practitioners and researchers alike want to know that assessment instruments, whatever their type or intent, treat all people fairly and do…
Descriptors: Cultural Differences, Social Bias, Interrater Reliability, Program Evaluation
Huang, Xiaoping; Hu, Zhongfeng – Higher Education Studies, 2015
The main problem of the educational evaluation validity is that it just copies the conceptual framework system of validity from educational measurement to its own conceptual system. The validity conceptual system that fits the need of theory and practice of educational evaluation has not been established yet. According to the inherent attributive…
Descriptors: Test Validity, Educational Assessment, Evaluation Problems, Theory Practice Relationship
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Tanaka, Koji – Educational Studies in Japan: International Yearbook, 2009
The recent "Nationwide academic achievement and study situation survey" was clearly influenced by the idea of "authentic assessment", an educational assessment perspective focused on "quality" and "engagement". However, when "performance assessment", the assessment method corresponding to this…
Descriptors: Educational Assessment, Performance Based Assessment, Academic Achievement, Educational Research
Young, John W. – Educational Assessment, 2009
In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…
Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education
Tracey, Terence J. G.; Sodano, Sandro M. – Career Development Quarterly, 2008
Interest development is not an easily studied process. There are at least 4 methods for examining the process of stability and change over time: relative stability, absolute stability, profile stability, and structural stability. A program of research that focuses on examining these 4 types of stability is summarized relative to the issues…
Descriptors: Vocational Interests, Childhood Interests, Attitude Change, Research Projects
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Athanasou, James A. – Australian Journal of Adult Learning, 2005
This paper focuses on two key aspects of self-evaluation in adult education and training through the perspective of (a) a social-cognitive framework which is used to categorise those factors that enhance self-efficacy and self-evaluation, and (b) the accuracy of self-evaluation. The social-cognitive framework categorises the factors that enhance…
Descriptors: Self Efficacy, Adult Education, Self Evaluation (Individuals), Social Cognition
Flood, Mirjam; Weinstein, Debra; Halle, Tamara; Martin, Laurie; Tout, Kathryn; Wandner, Laura; Vick, Jessica; Sherman, Juli; Hair, Elizabeth – Child Trends, 2007
Quality measures were originally developed for research aimed at describing the settings that children spend time in and identifying the characteristics of these environments that contribute to children's development. They were also developed to guide improvements in practice. Increasingly, however, measures of quality are being used for further…
Descriptors: Validity, Reliability, Child Care, Educational Quality
Spector, Janet E. – Psychology in the Schools, 2005
Informal Reading Inventories (IRI) are often recommended as instructionally relevant measures of reading. However, they have also been criticized for inattention to technical quality. Examination of reliability evidence in nine recently revised IRIs revealed that fewer than half report reliability. Several appear to have sufficient reliability for…
Descriptors: Informal Reading Inventories, Reading Instruction, Reading Difficulties, Reading Research
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Gorin, Joanna S.; Embretson, Susan E. – Applied Psychological Measurement, 2006
Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…
Descriptors: Difficulty Level, Test Items, Modeling (Psychology), Paragraph Composition