NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wallace, Colin S.; Prather, Edward E.; Duncan, Douglas K. – Astronomy Education Review, 2011
This is the first in a series of five articles describing a national study of general education astronomy students' conceptual and reasoning difficulties with cosmology. In this paper, we describe the process by which we designed four new surveys to assess general education astronomy students' conceptual cosmology knowledge. These surveys focused…
Descriptors: General Education, Astronomy, Surveys, Evolution
Strasler, Gregg M. – 1980
The relationship between classical discrimination indices (CDI) and criterion-referenced discrimination indices (CRDI) and the appropriateness of each for use on criterion-referenced tests are investigated. A CRDI is proposed that attempts to separate those who master material from those who do not master material. A 26 item multiple-choice…
Descriptors: Criterion Referenced Tests, Discriminant Analysis, Higher Education, Mastery Learning
Peer reviewed Peer reviewed
Blixt, Sonya L.; Shama, Deborah D. – Educational and Psychological Measurement, 1986
Methods of estimating the standard error at different ability levels were compared. Overall, it was found that at a given ability level the standard errors calculated using different formulas are not appreciably different. Further, for most situations the traditional method of calculating a standard error probably provides sufficient precision.…
Descriptors: College Freshmen, Error of Measurement, Higher Education, Mathematics Achievement
Peer reviewed Peer reviewed
Balch, William R. – Teaching of Psychology, 1989
Studies the effect of item order on test scores and completion time. Students scored slightly higher when test items were grouped sequentially (relating to text and lectures) than on tests when test items were grouped by text chapter but ordered randomly, or when test items were ordered randomly. Found no differences in completion time. (Author/LS)
Descriptors: Educational Research, Higher Education, Performance, Psychology
Ross, Steven; Hua, Te-Fang – 1994
A general issue related to language program development involves the empirical rationalization of cut score decisions in criterion-referenced language tests. Cut score dependability focuses on the consistency of the decisions in repeated testing or the assessment of language learner performances. In this case, the issue is to determine the optimal…
Descriptors: Achievement Gains, Criterion Referenced Tests, English (Second Language), Higher Education
Cope, Ronald T.; Kolen, Michael J. – 1987
This study compared five density estimation techniques applied to samples from a population of 272,244 examinees' ACT English Usage and Mathematics Usage raw scores. Unsmoothed frequencies, kernel method, negative hypergeometric, four-parameter beta compound binomial, and Cureton-Tukey methods were applied to 500 replications of random samples of…
Descriptors: College Entrance Examinations, Estimation (Mathematics), Higher Education, Mathematical Models
Gamache, LeAnn M. – 1983
Scales constructed under procedures and criteria outlined by the various traditional and latent trait methods were examined as to whether they varied in characteristics related to scale quality. Scales were constructed from a common pool of items analyzed in full form according to Likert and a one-parameter Rasch model for non-dichotomous data.…
Descriptors: Comparative Analysis, Correlation, Higher Education, Item Analysis
McDaniel, Barbara A. – 1985
A study was conducted to determine whether evaluators of large scale essay tests respond the same way toward essays written by English as a second language (ESL) and non-ESL students. The data examined came from the English Placement Test (EPT) administered in the province of British Columbia, Canada, in March 1979. The test was used to identify…
Descriptors: Chinese, Comparative Analysis, English (Second Language), Higher Education
Levine, Michael V.; Drasgow, Fritz – 1984
Some examinees' test-taking behavior may be so idiosyncratic that their scores are not comparable to the scores of more typical examinees. Appropriateness indices, which provide quantitative measures of response-pattern atypicality, can be viewed as statistics for testing a null hypothesis of normal test-taking behavior against an alternative…
Descriptors: Cheating, College Entrance Examinations, Computer Simulation, Estimation (Mathematics)
Boldt, R. F.; And Others – 1986
A review of the literature concerned with validity data and policies for various methods of treating multiple scores is reported, as are analyses of data from the College Board Validity Study Service. The analyses evaluated the use of Scholastic Aptitude Test--Verbal alone, Scholastic Aptitude Test--Mathematical alone, and the use of both in…
Descriptors: Analysis of Variance, College Entrance Examinations, College Students, Evaluation Methods
Sullivan, Francis J. – 1986
A study examined how pragmatic form influences evaluation of student essays in university placement testing. Specifically, the study documented how patterns in students' use of information (assumed to be either old, inferable, or new for readers) affected the holistic scores for quality given to the essays. Subjects, 99 randomly selected entering…
Descriptors: College Freshmen, Essay Tests, Evaluation Criteria, Evaluation Methods
College Entrance Examination Board, Princeton, NJ. – 1990
This guide is designed to provide essential background material about the College Board's Computerized Placement Tests (CPTs). It is recommended for administrators and staff alike. It contains the theory on which the tests are based, information concerning how to administer them, and discussions of the reports produced and how to interpret the…
Descriptors: Adaptive Testing, Algebra, Arithmetic, College Entrance Examinations
Hambleton, Ronald K.; Rogers, H. Jane – 1986
This report was designed to respond to two major methodological shortcomings in the item bias literature: (1) misfitting test models; and (2) the use of significance tests. Specifically, the goals of the research were to describe a newly developed method known as the "plot method" for identifying potentially biased test items and to…
Descriptors: Criterion Referenced Tests, Culture Fair Tests, Difficulty Level, Estimation (Mathematics)
Educational Testing Service, Princeton, NJ. – 1977
The 1976 Educational Testing Service (ETS) Invitational Conference served as a platform for individuals who have been prominent in educational measurement and research to present their views on issues surrounding the testing controversy. The 1976 ETS "The Testing Scene: Chaos and Controversy," presents a historical review of events surrounding the…
Descriptors: Achievement Tests, Adaptive Testing, Awards, Career Development