NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)2
Audience
Researchers38
Practitioners5
Administrators2
Teachers2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 38 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen; Murphy, Joseph; Elliott, Stephen N.; May, Henry – Educational Administration Quarterly, 2010
Research has consistently shown that principal leadership matters for successful schools. Evaluating principals on the behaviors shown to improve student learning should be an important leverage point for raising leadership quality. Yet principals are often evaluated with the use of instruments with no theoretical background and little, if any,…
Descriptors: Psychometrics, Instructional Leadership, Principals, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Beglar, David – Language Testing, 2010
The primary purpose of this study was to provide preliminary validity evidence for a 140-item form of the Vocabulary Size Test, which is designed to measure written receptive knowledge of the first 14,000 words of English. Nineteen native speakers of English and 178 native speakers of Japanese participated in the study. Analyses based on the Rasch…
Descriptors: Test Items, Native Speakers, Test Validity, Vocabulary
Holden, Ronald R. – 1985
Modern test construction strategies in the areas of personality and psychopathology differ in the use of disguise within test stimulus material. Previous research on the validity of using disguised test item content has favored the rational strategy of test construction which views disguise as a liability under normal test-taking circumstances.…
Descriptors: Adults, Evaluation Methods, Psychopathology, Test Construction
Peer reviewed Peer reviewed
Antonak, Richard F.; Harth, Robert – Mental Retardation, 1994
Psychometric analyses of data from 230 individuals yielded a 29-item 4-scale revision of the original 50-item 5-scale Mental Retardation Attitude Inventory. Results showed adequate item characteristics; adequate reliability and homogeneity; adequate reliability, homogeneity, specificity, and independence of the four scales; and initial validity…
Descriptors: Attitude Measures, Attitudes toward Disabilities, Mental Retardation, Psychometrics
Peer reviewed Peer reviewed
Brambring, M.; Troster, H. – Journal of Visual Impairment and Blindness, 1994
This study evaluated the Bielefeld Developmental Test for Blind Infants and Preschoolers by comparing cognitive performance of blind and sighted children (ages three and four). Results indicated that even this test (with "blind-neutral" items) did not permit a fair comparative assessment, though it did prove suitable for within-group…
Descriptors: Blindness, Cognitive Development, Cognitive Tests, Infants
Furst, Edward J. – 1983
Enough evidence has accumulated on Bloom's "Taxonomy of Educational Objectives" for the cognitive domain to justify a review of its communicability. This article covers both published and unpublished studies as well as certain informal reports that bear on this property. It also examines possibilities for improving agreement among…
Descriptors: Achievement Tests, Classification, Cognitive Processes, Diffusion (Communication)
Korpi, Meg; Haertel, Edward – 1984
The purpose of this paper is to further the cause of clarifying construct interpretations of tests, by proposing that non-metric multidimensional scaling may be more useful than factor analysis or other latent structure models for investigating the internal structure of tests. It also suggests that typical problems associated with scaling…
Descriptors: Correlation, Factor Structure, Intermediate Grades, Item Analysis
Peer reviewed Peer reviewed
Collis, Kevin F.; And Others – Journal for Research in Mathematics Education, 1986
Described are procedures followed in developing, administering, and scoring a set of mathematical problem-solving superitems and examining their construct validity through a recently developed evaluation technique associated with a taxonomy of the structure of learned outcomes. Data strongly support the validity of the underlying theoretical…
Descriptors: Educational Research, Elementary Secondary Education, Mathematics Education, Problem Solving
Jackson, Douglas N. – 1983
Concern for enhancing construct validity of vocational interest measures provides a focus for scale construction quite distinct from that derived from a criterion-referenced strategy: Construct-oriented measurement implies: (1) substantive definitions of dimensions; (2) concern for internal consistency reliability, as well as generalizability; (3)…
Descriptors: Career Counseling, Criterion Referenced Tests, Factor Analysis, Interest Inventories
Peer reviewed Peer reviewed
Shepard, Lorrie A.; And Others – Journal of Educational Measurement, 1985
The purpose of this research was to recommend an item bias procedure when the number of minority examinees is too small to use preferred three-parameter item response theory (IRT) methods. The chi-square, Angoff delta-plot, and pseudo-IRT indices were compared with both real and simulated data. (Author/DWH)
Descriptors: Estimation (Mathematics), Item Analysis, Latent Trait Theory, Minority Groups
Peer reviewed Peer reviewed
Germann, Paul J. – Journal of Research in Science Teaching, 1989
Describes a paper-and-pencil test for high school biology students measuring science process skills, such as developing hypotheses; making predictions; identifying assumptions; analyzing data; and formulating conclusions. Reports some data on reliability and validity of the test. Provides all 35 items of the test. (YP)
Descriptors: Biology, Science Materials, Science Tests, Secondary Education
Aghbar, Ali A.; Tang, Huixing – 1991
A study was undertaken to develop a partial credit scheme for scoring cloze-type questions on an English collocation test, obtain construct validity evidence for the test and the scoring scheme using the Rasch Partial Credit Model, and compare partial credit scoring with the more commonly used dichotomous scoring with the same test instrument.…
Descriptors: Cloze Procedure, College Students, English (Second Language), Language Tests
Haladyna, Thomas M.; Downing, Steven M. – 1985
In this paper 45 item-writing rules for multiple-choice tests presented in textbooks on educational measurement in a previous study are identified. The current study presents a quantitative review of the literature with respect to the empirical and theoretical evaluation of these principles of item-writing. Fifty-six studies that addressed at…
Descriptors: Educational Research, Elementary Secondary Education, Item Analysis, Multiple Choice Tests
Harvill, Leo M. – 1984
The objectives for this study were to: (1) develop a valid, reliable measure of test-wiseness with equivalent forms for use with students in the health sciences; and (2) determine the level of test-wiseness of entering medical students. The test-wiseness areas included in this study were: similar options, umbrella term, item give-away, convergence…
Descriptors: Higher Education, Measurement Techniques, Medical Students, Multiple Choice Tests
Sax, Gilbert; Reiter, Pauline B. – 1980
Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…
Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis
Previous Page | Next Page ยป
Pages: 1  |  2  |  3