Descriptor
Scoring | 19 |
Test Reliability | 19 |
Test Validity | 11 |
Interrater Reliability | 6 |
Writing Evaluation | 6 |
Test Construction | 4 |
Elementary Secondary Education | 3 |
Essay Tests | 3 |
Higher Education | 3 |
Item Analysis | 3 |
Scores | 3 |
More ▼ |
Author
Carlson, Sybil B. | 2 |
Alliger, R. J. | 1 |
Camp, Roberta | 1 |
Capie, William | 1 |
Costantino, Giuseppe | 1 |
Cronin, Linda L. | 1 |
Gilmer, Jerry S. | 1 |
Haladyna, Thomas M. | 1 |
Harvey, A. L. | 1 |
Johnson, William L. | 1 |
Koretz, Daniel | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 19 |
Practitioners | 4 |
Policymakers | 2 |
Location
Vermont | 1 |
West Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 2 |
Test of English as a Foreign… | 2 |
Childrens Report of Parental… | 1 |
Teacher Performance… | 1 |
Thematic Apperception Test | 1 |
What Works Clearinghouse Rating
Alliger, R. J.; Harvey, A. L. – 1984
This article discusses practical and theoretical problems related to the measurement of formal operations. The first section of the article discusses problems in measuring formal operations using the clinical interview method. These problems include the lack of both a standardized interview and a uniform scoring procedure. Section two discusses…
Descriptors: Developmental Stages, Group Testing, Interviews, Objective Tests
Wainer, Howard – 1985
Techniques derived from item response theory are useful for estimating the reliability of test classification above and below the cutting score. Test developers can construct a test whose information is peaked in the region of the cutting score; users can select a test which provides the most information in this region. The Cut-Score…
Descriptors: Cutting Scores, Item Analysis, Latent Trait Theory, Mastery Tests

Sweitzer, H. Frederick; Weinstein, Gerald – 1985
Self-Knowledge Development Theory (SKDT) by Weinstein and A. Alschuler (1985) is a structural developmental theory positing four stages in the development of self-knowledge. The Experience Recall Test-2 (ERT2) is described, which is the most recent instrument developed for assessing the SKDT. Self-knowledge is defined as the ability to describe…
Descriptors: Classification, Developmental Stages, Group Testing, Individual Development
Haladyna, Thomas M. – 1984
The purpose of this study is to examine an option-weighting method as it affects pass-fail decisions in formative and summative evaluation of student achievement for instructional units, certification, advancement, licensure, admissions, placement, and selection. A database was constructed using high school achievement test data where a…
Descriptors: Achievement Tests, Cutting Scores, High Schools, Multiple Choice Tests
Lehmann, Rainer H. – 1987
A total of 1,487 eleventh grade students from the Hamburg (West Germany) school system were asked to complete four writing assignments used in an International Association for the Evaluation of Educational Achievement (IEA) study of writing assessment. In analyzing the writing samples, the study focused on: (1) between-rater effects; (2)…
Descriptors: Evaluation Problems, Foreign Countries, High Schools, International Programs
Livingston, Samuel A. – 1984
Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…
Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

Schwarz, J. Conrad; And Others – Child Development, 1985
Examines the reliability and validity of the scores of diverse informants from the Child's Report of Parental Behavior Inventory (CRPBI). Also considers the utility of aggregating scores of parental behavior derived from multiple observers. CRPBI items were adapted to obtain mother's, father's, sibling's, and subject's ratings of parental behavior…
Descriptors: Child Rearing, Data Collection, Measures (Individuals), Parent Child Relationship
Taylor, Marcia B; Porterfield, William D. – 1984
This paper describes the Measure of Epistemological Reflection (MER), an instrument to assess cognitive developmental level according to the Perry scheme of intellectual and ethical development. It contains sets of questions for each of the six cognitive domains: decision making, learner role, instructor role in the learning process, peer role in…
Descriptors: Cognitive Development, Cognitive Tests, Epistemology, Higher Education

Nelson, Larry R. – Educational Measurement: Issues and Practice, 1984
The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)
Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing
Johnson, William L.; And Others – 1984
This report describes a diagnostic instrument designed for school districts to use in assessing the training needs of bilingual staff. The construction and trial administration of the instrument involved several stages. Among these were the development of a pool of subscaled questions, questionnaire forms, and a pool of instrument questions. The…
Descriptors: Bilingual Education, Bilingual Teachers, Inservice Teacher Education, Needs Assessment
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Younglove, William A. – 1983
In the early twentieth century behaviorist Edward L. Thorndike began the development and use of measurement scales to replace personal judgment to evaluate student compositions in U.S. public schools. In 1912, utilizing the Fullerton and Catell equal difference theorem, Milo B. Hillegas released the first scientifically designed scale to measure…
Descriptors: Behavior Theories, Educational History, Elementary Secondary Education, Evaluation Methods
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Cronin, Linda L.; Capie, William – 1985
The purpose of this study was to compare the scoring of Teacher Performance Assessment Instruments (TPAI) indicators using discrete descriptors when some are considered "essential" with the scoring of these same indicators, and when no descriptors are considered essential. The two questions addressed in this study were: (1) To what…
Descriptors: Analysis of Variance, Behavior Rating Scales, Classroom Observation Techniques, Data Collection
Costantino, Giuseppe; And Others – 1985
The theoretical framework and cross-cultural validation of Tell-Me-A-Story (TEMAS), a projective test developed to measure personality development in ethnic minority children, is presented. The TEMAS test consists of 23 chromatic pictures which incorporate the following characteristics: (1) representation of antithetical concepts which the…
Descriptors: Black Students, Culture Fair Tests, Elementary Education, Hispanic Americans
Previous Page | Next Page ยป
Pages: 1 | 2