NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 15,391 to 15,405 of 27,122 results Save | Export
Griffin, Patrick – 1990
Results of the International English Language Testing System (IELTS) battery trials in Australia are reported. The IELTS tests of productive language skills use direct assessment strategies and subjective scoring according to detailed guidelines. The receptive skills tests use indirect assessment strategies and clerical scoring procedures.…
Descriptors: English (Second Language), Foreign Countries, Grammar, Interrater Reliability
Breland, Hunter M. – 1983
Direct assessment of writing skill, usually considered to be synonymous with assessment by means of writing samples, is reviewed in terms of its history and with respect to evidence of its reliability and validity. Reliability is examined as it is influenced by reader inconsistency, domain sampling, and other sources of error. Validity evidence is…
Descriptors: Essay Tests, Evaluation Needs, Higher Education, Interrater Reliability
Barter, Alice K.; And Others – 1980
A follow-up study of two instruments for evaluating college writing was conducted. The experimental scale (E Scale) was developed in 1976 and revised for this study. The control scale (C Scale) was described in the literature in 1977. Ten English majors graded ten essays from diagnostic entrance exams. Both the E Scale and the C Scale were used,…
Descriptors: College Entrance Examinations, Comparative Testing, Essay Tests, Evaluation Criteria
Van Velsor, Ellen; Leslie, Jean Brittain; Fleenor, John W. – 1997
This book presents a nontechnical, step-by-step process that shows how to evaluate any 360-degree-feedback instrument intended for management or leadership development. The 360-degree-feedback instruments collect information from different sources about a target manager's performance, and they offer multiple perspectives. The 16 steps in…
Descriptors: Administrator Characteristics, Evaluation Methods, Feedback, Interrater Reliability
Peer reviewed Peer reviewed
Smith, Philip L. – Journal of Educational Measurement, 1979
In this study, generalizability theory is used to examine the dependability of student rating data for making judgments about courses and instruction. The importance of giving adequate attention to the specification of the universe of admissible observations in generalizability theory is discussed. (Author/CTM)
Descriptors: Analysis of Variance, Course Evaluation, Definitions, Higher Education
Peer reviewed Peer reviewed
Whitely, Susan E. – Applied Psychological Measurement, 1979
Two sources of inconsistency were separated by reanalyzing data from a major study on short-term consistency. Little evidence was found for generalizability or behavioral predictability. Results supported the assumption that measurement error from short-term fluctuations is not due to systematic individual differences in response consistency.…
Descriptors: Behavior Change, Cognitive Processes, College Freshmen, Error of Measurement
Peer reviewed Peer reviewed
Eley, Malcolm G.; Stecher, Erica J. – Assessment & Evaluation in Higher Education, 1997
Three studies compared the common Likert agree/disagree question form to a behavioral observation form for faculty evaluation. The Likert-type format prompted global, impressionistic responses; the behavioral observation form prompted more objective responses. Results suggest use of behavioral observation rather than agree/disagree questions can…
Descriptors: Behavior Rating Scales, College Faculty, Faculty Evaluation, Higher Education
Peer reviewed Peer reviewed
Direct linkDirect link
Berg, Marie; Jahnsen, Reidun; Froslie, Kathrine Frey; Hussain, Aktahr – Physical & Occupational Therapy in Pediatrics, 2004
Pediatric Evaluation of Disability Inventory (PEDI) is an instrument for evaluating function in children with disabilities aged 6 months to 7.5 years. The PEDI measures both functional performance and capability in three domains: (1) self-care, (2) mobility, and (3) social function. The PEDI has recently been translated into Norwegian. The purpose…
Descriptors: Disabilities, Young Children, Measures (Individuals), Norwegian
Peer reviewed Peer reviewed
Direct linkDirect link
Scahill, Lawrence; McDougle, Christopher J.; Williams, Susan K.; Dimitropoulos, Anastasia; Aman, Michael G.; McCracken, James T.; Tierney, Elaine; Arnold, L. Eugene; Cronin, Pegeen; Grados, Marco; Ghuman, Jaswinder; Koenig, Kathleen; Lam, Kristen S. L.; McGough, James; Posey, David J.; Ritz, Louise; Swiezy, Naomi B.; Vitiello, Benedetto – Journal of the American Academy of Child and Adolescent Psychiatry, 2006
Objective: To examine the psychometric properties of the Children's Yale-Brown Obsessive Compulsive Scales (CYBOCS) modified for pervasive developmental disorders (PDDs). Method: Raters from five Research Units on Pediatric Psychopharmacology (RUPP) Autism Network were trained to reliability. The modified scale (CYBOCS-PDD), which contains only…
Descriptors: Children, Severity (of Disability), Test Reliability, Behavior Disorders
New Mexico Public Education Department, 2007
The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…
Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring
Alderson, J. Charles; And Others – 1995
The guide is intended for teachers who must construct language tests and for other professionals who may need to construct, evaluate, or use the results of language tests. Most examples are drawn from the field of English-as-a-Second-Language instruction in the United Kingdom, but the principles and practices described may be applied to the…
Descriptors: Educational Trends, English (Second Language), Interrater Reliability, Language Tests
Morgan, George A.; Bartholomew, Sheridan – 1998
This study examined the reliability and construct validity of two types of measures of mastery motivation for elementary school children: a new version of the Dimensions of Mastery Questionnaires (DMQ) and behavioral mastery tasks. Participating were 64 mostly middle class and Caucasian 7- and 10-year-olds living in a middle-sized western city.…
Descriptors: Childhood Attitudes, Construct Validity, Elementary Education, Elementary School Students
Rothman, M. L.; And Others – 1982
A practical application of generalizability theory, demonstrating how the variance components contribute to understanding and interpreting the data collected to evaluate a program, is described. The evaluation concerned 120 learning modules developed for the Dental Auxiliary Education Project. The goals of the project were to design, implement,…
Descriptors: Correlation, Data Collection, Dental Schools, Educational Research
Reed, Donald B.; And Others – 1988
An instrument was developed to assess principal leadership. Two studies were then conducted to assess the reliability, validity, and utility of the instrument. Leadership style is the relative intensity of the presence of four modes of authority (traditional, charismatic, legal, and expert authority) and four modes of power (moral, psychological,…
Descriptors: Administrator Evaluation, Administrators, Construct Validity, Educational Assessment
Cronin, Linda L.; Capie, William – 1985
The purpose of this study was to compare the scoring of Teacher Performance Assessment Instruments (TPAI) indicators using discrete descriptors when some are considered "essential" with the scoring of these same indicators, and when no descriptors are considered essential. The two questions addressed in this study were: (1) To what…
Descriptors: Analysis of Variance, Behavior Rating Scales, Classroom Observation Techniques, Data Collection
Pages: 1  |  ...  |  1023  |  1024  |  1025  |  1026  |  1027  |  1028  |  1029  |  1030  |  1031  |  ...  |  1809