NotesFAQContact Us
Collection
Advanced
Search Tips
Education Level
Audience
Researchers25
Practitioners2
Teachers2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 25 results Save | Export
Wainer, Howard – 1985
Techniques derived from item response theory are useful for estimating the reliability of test classification above and below the cutting score. Test developers can construct a test whose information is peaked in the region of the cutting score; users can select a test which provides the most information in this region. The Cut-Score…
Descriptors: Cutting Scores, Item Analysis, Latent Trait Theory, Mastery Tests
Hodgin, Robert F. – 1984
Guidelines for the construction and use of an attitude instrument are presented, and the application of the instrument to measure student attitude toward economics is described. Attention is directed to the Likert-like summated forced-choice variety of attitude instrument, whereby attitude toward the object is inferred from the summed responses to…
Descriptors: Attitude Measures, Economics Education, Higher Education, Item Analysis
Bliss, Leonard B. – 1984
A model for the validation of standardized tests of academic achievement upon populations not represented in the samples used to standardize the tests is presented, and the results of a field testing of the model are described. The 1973 editions of the Stanford Achievement Test and the Test of Academic Skills were administered to a sample of…
Descriptors: Achievement Tests, Basic Skills, Elementary Secondary Education, Item Analysis
Peer reviewed Peer reviewed
Wilson, Mark – Journal for Research in Mathematics Education, 1990
Summarizes a reanalysis of the data from an investigation of a test designed to measure a learning sequence in geometry based on the work of van Hiele (1986). Discusses the test based on the Rasch model. (YP)
Descriptors: Geometric Concepts, Geometry, Item Analysis, Mathematical Concepts
Rojahn, Johannes; Tasse, Marc J.; Sturmey, Peter – American Journal on Mental Retardation, 1997
Development of the Stereotyped Behavior Scale for adolescents and adults with mental retardation is described. Use with 600 individuals resulted in refinement and a 26-item scale with an internal consistency alpha of 0.88, test-retest reliability of p=0.90, and interrater reliability of p=0.76. (DB)
Descriptors: Adolescents, Adults, Behavior Patterns, Behavior Rating Scales
Peer reviewed Peer reviewed
Jacobs, Stanley S. – Research in Higher Education, 1995
Comparison of college freshman performance on two different forms of the California Critical Thinking Skills Test (n=684, 692) found a lack of equivalence between forms and low internal consistency reliability. It is suggested that, although the test may be useful for research, it is not appropriate for decision making about individual students.…
Descriptors: College Freshmen, Comparative Analysis, Critical Thinking, Educational Research
Haladyna, Thomas M.; Downing, Steven M. – 1985
In this paper 45 item-writing rules for multiple-choice tests presented in textbooks on educational measurement in a previous study are identified. The current study presents a quantitative review of the literature with respect to the empirical and theoretical evaluation of these principles of item-writing. Fifty-six studies that addressed at…
Descriptors: Educational Research, Elementary Secondary Education, Item Analysis, Multiple Choice Tests
Sax, Gilbert; Reiter, Pauline B. – 1980
Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…
Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis
Snyder, Scott; Sheehan, Robert – Diagnostique, 1992
Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)
Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity
Peer reviewed Peer reviewed
Nelson, Larry R. – Educational Measurement: Issues and Practice, 1984
The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)
Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing
Torardi, Mary Montag – 1985
This report describes the procedures used to construct and validate the Standardized Test of Computer Literacy (STCL), a criterion-referenced instrument designed to assess students' computer literacy. Following a statement of the study problem and purpose, a description of the study methodology outlines the use of a 12-step model for development…
Descriptors: Academic Achievement, Competence, Computer Literacy, Computer Science Education
Chissom, Brad; Chukabarah, Prince C. O. – 1985
The comparative effects of various sequences of test items were examined for over 900 graduate students enrolled in an educational research course at The University of Alabama, Tuscaloosa. experiment, which was conducted a total of four times using four separate tests, presented three different arrangements of 50 multiple-choice items: (1)…
Descriptors: Analysis of Variance, Comparative Testing, Difficulty Level, Graduate Students
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Forster, Fred – 1987
Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis
Phillips, Gary W.; Huynh, Huynh – 1985
A procedure which may be used to project the frequency distribution of one test onto that of another test is described and illustrated. The procedure is useful when a test developer wishes to construct an alternate form with preferred distributional characteristics. For example, the test developer may wish to construct a new test form with a…
Descriptors: Achievement Tests, Elementary Secondary Education, Item Analysis, Item Banks
Previous Page | Next Page ยป
Pages: 1  |  2