NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1,006 to 1,020 of 1,161 results Save | Export
Cason, Gerald J.; And Others – 1983
Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…
Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education
Goodstein, H. A. – 1982
The proposed standard for judging proficiency test score reliability requires that the proportion of items passed for each objective assessed be a dependable estimate of the universe score for the domain strata established by the objective. Domain breadth is the focusing issue. Data from a field trial of the Tennessee Proficiency Test are analyzed…
Descriptors: Basic Skills, Criterion Referenced Tests, Educational Testing, Elementary Secondary Education
Slotnick, Henry B. – 1984
This manual is designed to assist faculty members at the University of North Dakota (UND) in the construction, scoring, and analysis of their classroom tests. A computer program is described which will assist staff in test scoring and analysis. The program was developed by the author in the Office of Medical Education and Evaluation of the…
Descriptors: Answer Sheets, Computer Software, Criterion Referenced Tests, Educational Testing
White, Karl; And Others – 1981
To explain discrepancies in Utah's elementary school test results under the Elementary and Secondary Education Act's Title I Evaluation and Reporting System (TIERS), researchers investigated the adequacy and validity of TIERS evaluation models. Model A (norm-referenced testing) is used in most Utah school districts, in preference to Models B or C…
Descriptors: Achievement Tests, Elementary Education, Evaluation Methods, Norm Referenced Tests
Tatsuoka, Kikumi – 1980
This paper presents a new method for estimating a given latent trait variable by the least-squares approach. The beta weights are obtained recursively with the help of Fourier series and expressed as functions of item parameters of response curves. The values of the latent trait variable estimated by this method and by maximum likelihood method…
Descriptors: Computer Assisted Testing, Error of Measurement, Higher Education, Latent Trait Theory
Mellenbergh, Gideon J.; Vijn, Pieter – 1980
Data are summarized in Scheuneman's Score x Group x Response frequency table in order to investigate item bias. The data can arise from two different sampling models: (1) multinomial sampling in which a fixed sample size is used and the responses are cross-classified according to score, group, and response; and (2) product-multinomial sampling in…
Descriptors: Black Students, Cognitive Measurement, Foreign Countries, Latent Trait Theory
Marshall, J. Laird – 1976
A summary is provided of the rationale for questioning the applicability of classical reliability measures to criterion referenced tests; an extension of the classical theory of true and error scores to incorporate a theory of dichotomous decisions; a presentation of the mean split-half coefficient of agreement, a single-administration test index…
Descriptors: Career Development, Computer Programs, Criterion Referenced Tests, Decision Making
Peer reviewed Peer reviewed
Campbell, J. F.; Tiller, Dale K. – Educational and Psychological Measurement, 1987
A visual analogue mood scale was developed that included an option indicating subjects didn't understand the meaning of an adjective in an item. The item content significantly affected responses, raising questions about the adequacy of recently proposed affective taxonomies that were based on restricted samples of emotion-descriptive adjectives.…
Descriptors: Adjectives, Affective Measures, Check Lists, Comprehension
Shrock, Sharon; And Others – Performance and Instruction, 1986
Presents major stages in design and development of criterion referenced tests (CRT) with emphasis on differences between CRT construction and norm-referenced test construction. Discussion covers test interpretation; test theory; preparation for test construction (hierarchical analysis, item type selection, and choosing number of items); test…
Descriptors: Adoption (Ideas), Comparative Analysis, Criterion Referenced Tests, Industrial Training
Peer reviewed Peer reviewed
Mitchell, G.; And Others – Medical Teacher, 1986
Describes a study designed to determine if the amount of time allocated for answering multiple true/false type questions affects the grades of the medical students taking the tests. Students who had 2-1/4 minutes to answer each question scored significantly better than those who had 1-1/2 minutes or 3 minutes. (TW)
Descriptors: Biochemistry, College Science, Higher Education, Medical Education
Peer reviewed Peer reviewed
Everett, Kenneth G.; DeLoach, Will S. – Journal of Chemical Education, 1986
Analyzes an old chemistry examination that was given to a college chemistry class in 1984. Contrasts several of the examination's features against most modern ones. Notes the emphasis on observable properties and the practical applications of substances. Argues that the newer examinations may stress too much theory and don't stress communication…
Descriptors: Chemistry, College Science, Evaluation Methods, Higher Education
Peer reviewed Peer reviewed
Ludlow, Larry H.; Bell, Karen N. – Educational and Psychological Measurement, 1996
Fifty education majors in two sections responded to an Attitudes toward Mathematics and Its Teaching (ATMAT) scale. Results with two psychometric models, classical true-score theory and the one-parameter Rasch model, supported the ATMAT's reliability, content and construct validity, and invariance over three time points. (SLD)
Descriptors: College Students, Construct Validity, Education Majors, Elementary Education
Peer reviewed Peer reviewed
Brown, James Dean – TESOL Quarterly, 1989
Criterion-referenced testing was used to complement norm-referenced procedures in a revision of a university's English-as-a-Second-Language placement test for reading. Test validation results indicated that the revised test better matched the university's program and included more items related to the content and skills that students were…
Descriptors: Criterion Referenced Tests, English (Second Language), Higher Education, Language Tests
Peer reviewed Peer reviewed
Dassa, Clement – Alberta Journal of Educational Research, 1990
Describes two Quebec studies integrating educational diagnosis with classroom assessment to develop learning strategies. Outlines new model based on studies and new mastery learning developments. Argues that the unit of measurement should shift from the student to the teacher-student interaction, and the methodology should be…
Descriptors: Classroom Communication, Classroom Research, Educational Diagnosis, Evaluation Methods
Peer reviewed Peer reviewed
Traub, Ross E. – Alberta Journal of Educational Research, 1990
Describes five propositions concerning classroom assessment. Uses propositions to review seven conference papers. Propositions refer to the following: nature of achievement; student information necessary to interpret assessments; tension between need to describe and need to praise; distinction between formal and informal assessment; and need for…
Descriptors: Achievement, Educational Research, Evaluation Methods, Measures (Individuals)
Pages: 1  |  ...  |  64  |  65  |  66  |  67  |  68  |  69  |  70  |  71  |  72  |  ...  |  78