NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 316 to 330 of 418 results Save | Export
Hambleton, Ronald K. – 1986
The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…
Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics
Federico, Pat-Anthony – 1989
To determine the relative reliabilities and validities of paper-based and computer-based measurement procedures, 83 male student pilots and radar intercept officers were administered computer and paper-based tests of aircraft recognition. The subject matter consisted of line drawings of front, side, and top silhouettes of aircraft. Reliabilities…
Descriptors: Armed Forces, Comparative Analysis, Computer Assisted Testing, Discriminant Analysis
Bruno, Rachelle M.; Walker, Stephen C. – Diagnostique, 1999
This article describes the Comprehensive Test of Phonological Processing, an assessment instrument designed to assess the phonological processing skills of individuals between the ages of 5 and 24.11 years and to identify those with phonological processing difficulties. Its administration, standardization, reliability, and validity are discussed.…
Descriptors: Adolescents, Children, Disability Identification, Language Impairments
Bachor, Dan G. – Diagnostique, 1999
This article reviews the revised KeyMath Revised-Normative Update American and Canadian editions, and a draft version of the 1999 pending revision. The assessment instrument is intended to test basic mathematical concepts, operations, and applications of students in grades K-12. Its administration, standardization, reliability, and validity are…
Descriptors: Disabilities, Elementary Secondary Education, Foreign Countries, Mathematical Concepts
Peer reviewed Peer reviewed
Direct linkDirect link
Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004
The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…
Descriptors: Ability, Testing, Futures (of Society), Psychometrics
Dirir, Mohamed A. – 1995
The effectiveness of an optimal item selection method in designing parallel test forms was studied during the development of two forms that were parallel to an existing form for each of three language arts tests for fourth graders used in the Connecticut Mastery Test. Two listening comprehension forms, two reading comprehension forms, and two…
Descriptors: Elementary School Students, Grade 4, Intermediate Grades, Item Banks
Henning, Grant – 1993
This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…
Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)
Kahl, Stuart R. – 1995
Although few question the positive impacts alternative forms of assessment can have on instruction, concerns about the psychometric quality of data obtained from such assessments are taking their toll. Scoring issues are at the heart of many of these concerns. This paper addresses the causes of these concerns: misinformation about psychometric…
Descriptors: Alternative Assessment, Educational Assessment, Equated Scores, Performance Based Assessment
Larson, Jerry W. – 1985
A study at Brigham Young University (Utah) investigated the feasibility of computer-assisted language placement testing in higher education. Benefits and problems of this approach for test administration, individualization of item selection, and recordkeeping were examined. Four steps were followed in production of a test for Spanish placement:…
Descriptors: College Second Language Programs, Computer Assisted Testing, Higher Education, Language Tests
Trevisan, Michael S.; Sax, Gilbert – 1991
The purpose of this study was to compare the reliabilities of two-, three-, four-, and five-choice tests using an incremental option paradigm. Test forms were created incrementally, a method approximating actual test construction procedures. Participants were 154 12th-grade students from the Portland (Oregon) area. A 45-item test with two options…
Descriptors: Comparative Testing, Distractors (Tests), Estimation (Mathematics), Grade 12
Chang, Lei – 1993
Equivalence in reliability and validity across 4-point and 6-point scales was assessed by fitting different measurement models through confirmatory factor analysis of a multitrait-multimethod covariance matrix. Responses to nine Likert-type items designed to measure perceived quantitative ability, self-perceived usefulness of quantitative…
Descriptors: Ability, Comparative Testing, Education Majors, Graduate Students
Meld, Andrea – 1990
Surveys used for program and institutional evaluation, such as self-studies conducted for accreditation review, are discussed. Frequently, these evaluations take the form of faculty surveys and student surveys. This paper explores the following general considerations associated with mail surveys and other surveys: avoidance of response bias;…
Descriptors: Accreditation (Institutions), Comparative Analysis, Higher Education, Mail Surveys
Phillips, Gary W.; Huynh, Huynh – 1985
A procedure which may be used to project the frequency distribution of one test onto that of another test is described and illustrated. The procedure is useful when a test developer wishes to construct an alternate form with preferred distributional characteristics. For example, the test developer may wish to construct a new test form with a…
Descriptors: Achievement Tests, Elementary Secondary Education, Item Analysis, Item Banks
Peer reviewed Peer reviewed
Budescu, David V. – Applied Psychological Measurement, 1988
A multiple matching test--a 24-item Hebrew vocabulary test--was examined, in which distractors from several items are pooled into one list at the test's end. Construction of such tests was feasible. Reliability, validity, and reduction of random guessing were satisfactory when applied to data from 717 applicants to Israeli universities. (SLD)
Descriptors: College Applicants, Feasibility Studies, Foreign Countries, Guessing (Tests)
Peer reviewed Peer reviewed
Zeidner, Moshe – Higher Education, 1986
A study of possible test bias in the Arabic and Hebrew versions of a standardized scholastic aptitude test used in Israel found a slight overprediction of performance for Arabs, but the findings appear to be more consistent with psychometric than cultural bias. (Author/MSE)
Descriptors: Aptitude Tests, Arabic, Arabs, College Bound Students
Pages: 1  |  ...  |  18  |  19  |  20  |  21  |  22  |  23  |  24  |  25  |  26  |  27  |  28