Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 6 |
Descriptor
Evaluation Methods | 17 |
Reliability | 17 |
Test Validity | 17 |
Elementary Secondary Education | 4 |
Test Construction | 4 |
Validity | 4 |
Program Evaluation | 3 |
Student Evaluation | 3 |
Test Reliability | 3 |
Adolescents | 2 |
Correlation | 2 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 14 |
Reports - Research | 9 |
Opinion Papers | 2 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Speeches/Meeting Papers | 2 |
Guides - Classroom - Teacher | 1 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Education Level
Early Childhood Education | 1 |
Elementary Education | 1 |
Higher Education | 1 |
Audience
Practitioners | 2 |
Researchers | 2 |
Media Staff | 1 |
Policymakers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Early Childhood Environment… | 1 |
Infant Toddler Environment… | 1 |
What Works Clearinghouse Rating
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Geiger, Tray J.; Amrein-Beardsley, Audrey – AASA Journal of Scholarship & Practice, 2017
In this commentary, we discuss three types of data manipulations that can occur within teacher evaluation methods: artificial inflation, artificial deflation, and artificial conflation. These types of manipulation are more popularly known in the education profession as instances of Campbell's Law (1976), which states that the higher the…
Descriptors: Teacher Evaluation, Evaluation Methods, Data Analysis, Personnel Policy
Greenberg, Kathleen Puglisi – Teaching of Psychology, 2012
The scoring instrument described in this article is based on a deconstruction of the seven sections of an American Psychological Association (APA)-style empirical research report into a set of learning outcomes divided into content-, expression-, and format-related categories. A double-weighting scheme used to score the report yields a final grade…
Descriptors: Scoring, Research Reports, Grading, Outcome Measures
Geng, Yaoguo; Xia, Dan; Qin, Beibei – Child Psychiatry and Human Development, 2012
The purpose of this study was to evaluate the reliability and validity of the Chinese version of the Basic Empathy Scale (BES). The Chinese version of BES was administered to a sample (n = 1,524) aged 9-18 and 65 males with conduct disorder aged 13-18. The result of confirmatory factor analysis showed a two-factor structure with four items deleted…
Descriptors: Emotional Problems, Test Validity, Factor Structure, Measures (Individuals)
Hager, Karen D.; Slocum, Timothy A. – Education and Training in Developmental Disabilities, 2008
Alternate assessments are the means through which students with significant cognitive disabilities participate in accountability testing, thus measurement validity of alternate assessments is a critical aspect of state educational accountability systems. When evaluating the validity of assessment systems, it is important to take a broad view of…
Descriptors: Test Content, Student Evaluation, Alternative Assessment, Test Validity

Horan, John J.; Williams, John M. – Journal of Drug Education, 1975
Difficulties involved with the evaluation of drug abuse prevention programs are numerous. Tentative Drug Use Scale (TDUS) was designed in response to a number of specific problems associated with obtaining behavioral data. Advantages of this scale over others are discussed. Reliability and validity information are provided. (Author)
Descriptors: Drug Abuse, Drug Addiction, Evaluation Methods, Prevention

Moody, Donna K.; And Others – Language, Speech, and Hearing Services in Schools, 1979
The validity and reliability of Wilson Voice Profile System (WVPS) was assessed by using taped matched voice samples of groups of 20 previously evaluated voice disordered and 20 normal children. A group of 11 trained listener judges evaluated these vocal samples. Results for differentiating normal from abnormal voiced children were similar to…
Descriptors: Evaluation Methods, Exceptional Child Research, Reliability, Speech Handicaps

Blakely, Craig H.; And Others – Criminal Justice and Behavior, 1980
This article presents the initial methodology involved in the construction of a self-report instrument of delinquent behavior, as well as the comparison of 10 frequency and seriousness weighting schemes. Results indicate that the inclusion of weighting schemes does not strengthen the instrument or add to its applicability. (Author)
Descriptors: Adolescents, Delinquency, Evaluation Methods, Males
The Constant Danger of Sacrificing Validity to Reliability: Making Writing Assessment Serve Writers.

Wiggins, Grant – Assessing Writing, 1994
Suggests that assessment must be built into the curriculum and focused upon the kinds of skills students need. Considers much educational testing in writing to be reductionist, unrealistic, and detrimental to learning. Critiques writing assessment's trust and reliance on a single or small sample of student work collected and scored outside of a…
Descriptors: Elementary Secondary Education, Evaluation Methods, Reliability, Student Evaluation

Kirkley, Karen N.; Fisher, Anne G. – Journal of Outcome Measurement, 1999
The alternate-forms reliability of the Assessment of Motor and Process Skills (AMPS) (A. Fisher, 1997), where alternate forms means different pairs of AMPS tasks, was studied with 91 people who had performed four AMPS tasks. Results support use of the AMPS activities of daily-living motor and process scales. (SLD)
Descriptors: Adults, Daily Living Skills, Diagnostic Tests, Disabilities

Ryser, Gail R. – Journal of Secondary Gifted Education, 1994
The meanings of reliability and validity as they apply to standardized measures are used as a framework for applying the concepts of reliability and validity to authentic assessments. This article sees reliability as scorability and stability, whereas validity is seen as students' ability to use knowledge authentically in the field. (DB)
Descriptors: Elementary Secondary Education, Evaluation Methods, Performance Based Assessment, Reliability

Stelmachers, Zigfrids T.; Sherman, Robert E. – Suicide and Life-Threatening Behavior, 1990
Presented 33 case histories of suicidal patients to crisis workers (N=19) for ratings of short- and long-term suicide risk. Ratings revealed considerable variability raising question about reliability of such global assessments. The variability, as measured by the standard deviation, was comparable between short-term and long-term ratings.…
Descriptors: Anger, At Risk Persons, Case Studies, Crisis Intervention

Royer, James M. – Journal of Adolescent & Adult Literacy, 2001
Describes a team-based approach for creating Sentence Verification Technique (SVT) tests, a development procedure that allows teachers and other school personnel to develop comprehension tests from curriculum materials in use in their schools. Finds that if tests are based on materials that are appropriate for the population to be tested, the…
Descriptors: Elementary Secondary Education, Evaluation Methods, Listening Comprehension Tests, Reading Tests
Flood, Mirjam; Weinstein, Debra; Halle, Tamara; Martin, Laurie; Tout, Kathryn; Wandner, Laura; Vick, Jessica; Sherman, Juli; Hair, Elizabeth – Child Trends, 2007
Quality measures were originally developed for research aimed at describing the settings that children spend time in and identifying the characteristics of these environments that contribute to children's development. They were also developed to guide improvements in practice. Increasingly, however, measures of quality are being used for further…
Descriptors: Validity, Reliability, Child Care, Educational Quality

Zweizig, Douglas L. – Public Libraries, 1987
Discusses current issues in measuring library effectiveness, including rapid technological changes and increasing obsolescence in librarians' competencies; increased pressure for accountability; tools to help libraries become what they determine they should be rather than conforming to national standards; and how to determine the validity and…
Descriptors: Evaluation Methods, Library Administration, Library Automation, Library Planning
Previous Page | Next Page ยป
Pages: 1 | 2