NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 211 to 225 of 418 results Save | Export
Federico, Pat-Anthony; Liggett, Nina L. – 1989
Seventy-five subjects (Naval F-14 and E-2C crew members) were administered computer-based and paper-based tests of threat-parameter knowledge represented as a semantic network in order to determine the relative reliabilities and validities of these two assessment modes. Estimates of internal consistencies, equivalences, and discriminant validities…
Descriptors: Comparative Analysis, Computer Assisted Testing, Knowledge Level, Military Personnel
Brittain, Mary M.; Brittain, Clay V. – 1981
A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…
Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction
Peer reviewed Peer reviewed
Drain, Susan; Manos, Kenna – English Quarterly, 1986
Reviews a writing abilities competency test based on samples of essay writing. A copy of the test is appended. (NKA)
Descriptors: Essays, Higher Education, Language Tests, Test Construction
Peer reviewed Peer reviewed
Askegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982
Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)
Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction
Peer reviewed Peer reviewed
Plastre, Guy – Canadian Modern Language Review, 1981
Starting from the premise that the assessment of second language learners' competence is a must at several points during the learning process, discusses the usefulness of laboratory testing. Argues that it allows for standardized testing, facility of test administration, easy scoring, objective measures, reliability and validity of results. (MES)
Descriptors: English, French, Language Laboratories, Scoring
Peer reviewed Peer reviewed
Singer, Peter A.; And Others – Academic Medicine, 1996
Final-year Ontario medical students (n=88) took a 4-station objective structured clinical examination (OSCE) using standardized patients and involving decisions to forgo life-sustaining treatment. Performance was scored on a checklist of behaviors unique to each case. Results indicated that because of low reliability, the OSCE is not a feasible…
Descriptors: Clinical Experience, Competency Based Education, Ethics, Foreign Countries
Peer reviewed Peer reviewed
Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989
Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…
Descriptors: Classification, College Students, Evaluation Methods, Higher Education
Peer reviewed Peer reviewed
Demsky, Yvonne I.; Gass, Carlton S.; Golden, Charles J. – Assessment, 1998
Standardization data based on responses of 616 Puerto Ricans to the Spanish version of the Wechsler Adult Intelligence Scale (D. Wechlser, 1981) reveal reliability data and base rates to assist in evaluating the clinical significance of differences between Performance Intelligence Quotient (PIQ) and Verbal Intelligence Quotient (VIQ).…
Descriptors: Adults, Clinical Diagnosis, Intelligence Tests, Performance Factors
Peer reviewed Peer reviewed
Paik, Chie; Michael, William B. – Educational and Psychological Measurement, 1999
Studied the internal consistency reliability and construct validity of scores on each of five dimensions of a Japanese version of the Dimensions of Self-Concept Scale. Results for 354 female high school students show that a five-factor oblique model accounts for the greatest proportion of covariance in the matrix of 15 subtests. Contains 20…
Descriptors: Construct Validity, Factor Structure, Females, Foreign Countries
Mircea-Pines, Walter J. – ProQuest LLC, 2009
This dissertation study examined the reliability and validity claims of a modified version of the Spanish Modern Language Association Foreign Language Proficiency Test for Teachers and Advanced Students administered at George Mason University (GMU). The study used the 1999 computerized GMU version that was administered to 277 test-takers via…
Descriptors: College Students, Advanced Students, Second Language Learning, Test Validity
Vansickle, Timothy R.; Kapes, Jerome T. – 1988
First-, second-, and third-year students enrolled in an introductory educational psychology class at Texas A&M University in College Station were administered either a pencil-and-paper or computerized version of the Strong-Campbell Interest Inventory. The same or other version of the test was administered after 2 weeks. Focus is on equivalence…
Descriptors: College Students, Comparative Analysis, Computer Assisted Testing, Computer Uses in Education
Sachse, Thomas P. – 1981
Consistent with its mission of synthesizing available information on performance assessment in a variety of skill areas, the Clearinghouse for Applied Performance Assessment has prepared this summary of the role of performance assessment in selected published tests of problem-solving skill. Performance assessment is defined in terms of assessment…
Descriptors: Cognitive Tests, Mastery Tests, Measurement Objectives, Performance Based Assessment
Peer reviewed Peer reviewed
Smith, Lawrence L.; Johns, Jerry L. – Reading Psychology, 1984
Finds some evidence that out-of-level tests are more suitable and reliable for poor readers than are on-level tests. (FL)
Descriptors: Academic Aptitude, Intermediate Grades, Reading Difficulties, Reading Instruction
Peer reviewed Peer reviewed
Hodson, Derek – Journal of Research in Science Teaching, 1984
Investigated the validity of the assumption that arranging items in objective tests in order of increasing difficulty increases student motivation and produces more reliable tests. Results indicate that test reliability was largely independent of item sequence. (Author/JN)
Descriptors: Chemistry, High Schools, Multiple Choice Tests, Science Education
National Evaluation and Technical Assistance Center for the Education of Children and Youth Who Are Neglected, Delinquent, or At-Risk, 2006
This guide was developed for State, agency, and/or facility administrators who provide education for children and youth who are neglected, delinquent, or at risk (N or D). The guide provides basic information about the ideal characteristics of a pre-post test and highlights important features to consider when requesting and evaluating information…
Descriptors: Test Selection, Pretests Posttests, Child Neglect, Delinquency
Pages: 1  |  ...  |  11  |  12  |  13  |  14  |  15  |  16  |  17  |  18  |  19  |  ...  |  28