Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Test Format | 22 |
Test Length | 22 |
Test Validity | 22 |
Test Reliability | 14 |
Test Construction | 9 |
Test Items | 9 |
Testing Problems | 6 |
Intelligence Tests | 5 |
Comparative Analysis | 4 |
Test Use | 4 |
Computer Assisted Testing | 3 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 2 |
Wainer, Howard | 2 |
Alonso, Jordi | 1 |
Arbet, Scott E. | 1 |
Boer, Marian | 1 |
Browne, Janet | 1 |
Camilli, Gregory | 1 |
Coats, Pamela K. | 1 |
Donders, Jacques | 1 |
Eignor, Daniel R. | 1 |
Embretson, Susan E. | 1 |
More ▼ |
Publication Type
Reports - Research | 13 |
Journal Articles | 10 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Guides - Non-Classroom | 2 |
Reference Materials -… | 2 |
Reports - Descriptive | 2 |
Information Analyses | 1 |
Opinion Papers | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Practitioners | 2 |
Community | 1 |
Support Staff | 1 |
Location
New Jersey | 1 |
United Kingdom | 1 |
Vermont | 1 |
Laws, Policies, & Programs
Job Training Partnership Act… | 1 |
Assessments and Surveys
Minnesota Multiphasic… | 2 |
Wechsler Adult Intelligence… | 2 |
Wechsler Intelligence Scale… | 2 |
Bar Examinations | 1 |
Kaufman Brief Intelligence… | 1 |
Marlowe Crowne Social… | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Streiner, David L.; Miller, Harold R. – Journal of Clinical Psychology, 1986
Numerous short forms of the Minnesota Multiphasic Personality Inventory have been proposed in the last 15 years. In each case, the initial enthusiasm has been replaced by the questions about the clinical utility of the abbreviated version. Argues that the statistical properties of the test and reduced reliability due to shortening the scales…
Descriptors: Test Construction, Test Format, Test Length, Test Reliability

Silverstein, A. B. – Perceptual and Motor Skills, 1983
Formulas for estimating the validity of random short forms were applied to the standardization data for the Wechsler Adult Intelligence Scale-Revised, the Minnesota Multiphasic Personality Inventory, and the Marlowe-Crowne Social Desirability Scale. These formulas demonstrated how much "better than random" the best short forms of these…
Descriptors: Comparative Analysis, Intelligence Tests, Measures (Individuals), Test Format

Thompson, Anthony; Browne, Janet; Schmidt, Fred; Boer, Marian – Assessment, 1997
The validity of a four-subtest short form of the third edition of the Wechsler Intelligence Scale for Children (WISC-III) and the Kaufman Brief Intelligence Test (K-BIT) was evaluated with 42 adolescent offenders. Findings support the clinical use of the short form as a good estimate of WISC-III full-scale IQ. (SLD)
Descriptors: Adolescents, Criminals, Delinquency, Intelligence Quotient

Owen, Steven V.; Froman, Robin D. – Educational and Psychological Measurement, 1987
To test further for efficacy of three-option achievement items, parallel three- and five-option item tests were distributed randomly to college students. Results showed no differences in mean item difficulty, mean discrimination or total test score, but a substantial reduction in time spent on three-option items. (Author/BS)
Descriptors: Achievement Tests, Higher Education, Multiple Choice Tests, Test Format

Donders, Jacques – Psychological Assessment, 1997
Eight subtests were selected from the Wechsler Intelligence Scale for Children--Third Edition (WISC-III) to make a short form for clinical use. Results with the 2,200 children from the WISC-III standardization sample indicated the adequate reliability and validity of the short form for clinical use. (SLD)
Descriptors: Children, Clinical Diagnosis, Intelligence Tests, Test Format

Ward, L. Charles; Ryan, Joseph J. – Psychological Assessment, 1996
Validity and reliability were calculated from data in the standardization sample of the Wechsler Adult Intelligence Scale--Revised for 565 proposed short forms. Time saved in comparison with use of the long form was estimated. The most efficient combinations were generally those composed of subtests that were quick to administer. (SLD)
Descriptors: Cost Effectiveness, Intelligence Tests, Selection, Test Format

Prieto, Luis; Alonso, Jordi; Lamarca, Rosa; Wright, Benjamin D. – Journal of Outcome Measurement, 1998
Data from 45 studies involving 9,149 people were used to develop a short form of the Spanish version of the Nottingham Health Profile through Rasch analysis. Results confirmed the validity of using the developed 22-item short form to measure different groups of people categorized by gender, clinical, and health status. (SLD)
Descriptors: Groups, Health, Individual Characteristics, Item Response Theory

Green, Kathy – Journal of Experimental Education, 1979
Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)
Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format
Wainer, Howard; And Others – 1991
A series of computer simulations was run to measure the relationship between testlet validity and the factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Results confirmed the generality of earlier empirical findings of H. Wainer and others (1991) that making a testlet adaptive yields only marginal…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Item Banks
Kunce, Charles S.; Arbet, Scott E. – 1994
The National Conference of Bar Examiners commissioned American College Testing, Inc., to help them in the development and evaluation of a performance test for use in bar admissions decisions. Because it was recognized that candidate perceptions would provide valuable information, a candidate-perception questionnaire was developed to be…
Descriptors: Attitudes, Demography, Languages, Lawyers
Haladyna, Tom; Roid, Gale – 1981
Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…
Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction
Metropolitan Atlanta Consortium of Consultants and Lead Speech-Language Pathologists, GA. – 1990
This guide presents ratings of assessment instruments for use by speech-language pathologists with preschool students. Tests are reviewed in alphabetical order on forms filled out by practicing speech-language pathologists, including data on speech components covered by each test, age range, factors of norms where norms are used, reliability,…
Descriptors: Diagnostic Tests, Examiners, Preschool Education, Preschool Tests
Oosterhof, Albert C.; Coats, Pamela K. – 1981
Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…
Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education
Eignor, Daniel R.; Hambleton, Ronald K. – 1979
The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…
Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores
Previous Page | Next Page ยป
Pages: 1 | 2