Descriptor
Computer Programs | 18 |
Test Reliability | 18 |
Test Validity | 18 |
Test Construction | 6 |
Item Analysis | 5 |
Scoring Formulas | 5 |
Statistical Analysis | 5 |
Testing | 5 |
Measurement Techniques | 4 |
Test Items | 4 |
Adaptive Testing | 3 |
More ▼ |
Author
Publication Type
Reports - Research | 11 |
Journal Articles | 3 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
General Aptitude Test Battery | 1 |
Stanford Binet Intelligence… | 1 |
What Works Clearinghouse Rating

Callender, John C.; Osburn, H. G. – Educational and Psychological Measurement, 1977
A FORTRAN program for maximizing and cross-validating split-half reliability coefficients is described. Externally computed arrays of item means and covariances are used as input for each of two samples. The user may select a number of subsets from the complete set of items for analysis in a single run. (Author/JKS)
Descriptors: Computer Programs, Item Analysis, Test Reliability, Test Validity

Schafer, William D. – Educational and Psychological Measurement, 1972
A listing of the program and a program description may be obtained by writing the author at the Department of Measurement and Statistics, College of Education, University of Maryland. (Author/MB)
Descriptors: Computer Programs, Program Descriptions, Test Reliability, Test Validity

Aleamoni, Lawrence M. – Educational and Psychological Measurement, 1971
Descriptors: Computer Programs, Data Analysis, Feedback, Questionnaires
Stocking, Martha; And Others – 1973
For two tests measuring the same trait, the program, BIV20, equates the scores using the two True score distributions estimated by the univariate method 20 program (see Wingersky, Lees, Lennon, and Lord, 1969) and, with these equated true scores and their distributions, estimates the bivariate distribution scores and the relative efficiency of the…
Descriptors: Computer Programs, Equated Scores, Statistical Analysis, Test Reliability

Wackerly, D. D.; Robinson, D. H. – Psychometrika, 1983
A statistical method for testing the agreement between a judge's assessment of an object or subject and a known standard is developed and shown to be superior to two other methods which appear in the literature. (Author/JKS)
Descriptors: Algorithms, Computer Programs, Judges, Measurement Techniques

Krus, David J.; Ceurvorst, Robert W. – Educational and Psychological Measurement, 1978
An algorithm for updating the means of variances of a norm group after each computer-assisted administration of a test is described. The algorithm does not require storage of the whole data set, and provides for unlimited, continuous expansion of the test norms. (Author)
Descriptors: Computer Assisted Testing, Computer Programs, Norms, Statistical Data

Linden, Kathryn W.; Garrison, Wayne M. – NALLD Journal, 1977
This paper provides users of teacher-made tests with a computer program designed to improve reporting of student performance on academic tasks. Test planning and construction and form and technique for reporting student performance are described. (CHK)
Descriptors: Achievement Tests, Computer Programs, Computers, Objective Tests

Hambleton, Ronald K.; And Others – Journal of Educational Measurement, 1983
A new method was developed to assist in the selection of a test length by utilizing computer simulation procedures and item response theory. A demonstration of the method presents results which address the influences of item pool heterogeneity matched to the objectives of interest and the method of item selection. (Author/PN)
Descriptors: Computer Programs, Criterion Referenced Tests, Item Banks, Latent Trait Theory

Aiken, Lewis R. – Educational and Psychological Measurement, 1980
Procedures for computing content validity and consistency reliability coefficients and determining the statistical significance of these coefficients are described. Procedures employing the multinomial probability distribution for small samples and normal curve probability estimates for large samples, can be used where judgments are made on…
Descriptors: Computer Programs, Measurement Techniques, Probability, Questionnaires
Ree, Malcolm James – 1976
A method for developing statistically parallel tests based on the analysis of unique item variance was developed. A test population of 907 basic airmen trainees were required to estimate the angle at which an object in a photograph was viewed, selecting from eight possibilities. A FORTRAN program known as VARSEL was used to rank all the test items…
Descriptors: Comparative Analysis, Computer Programs, Enlisted Personnel, Item Analysis
Reckase, Mark D. – 1977
The reliability and validity of a tailored testing procedure based on the simple logistic model was determined for an achievement test in statistics and measurement. The test was administered on a CRT terminal to students from graduate and undergraduate measurement courses. Equivalent form reliability over a one-week interval was found to be 0.595…
Descriptors: Achievement Tests, Adaptive Testing, College Students, Computer Programs
Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1969
The United States Training and Employment Service General Aptitude Test Battery (GATB), first published in 1947, has been included in a continuing program of research to validate the tests against success in many different occupations. The GATB consists of 12 tests which measure nine aptitudes: General Learning Ability; Verbal Aptitude; Numerical…
Descriptors: Aptitude Tests, Career Guidance, Computer Programs, Cutting Scores
Larkin, Kevin C.; Weiss, David J. – 1975
A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…
Descriptors: Ability, Aptitude Tests, Comparative Analysis, Computer Programs
Hansen, Duncan N.; And Others – 1977
A computerized adaptive testing model was assessed in a technical training system. The model, a modification of Lord's flexilevel paradigm, consisted of: the sequencing of test items in a difficulty hierarchy, adaptive entry of students into the test at a difficulty level appropriate to their predicted score, and systematic movement of students…
Descriptors: Adaptive Testing, Branching, Comparative Analysis, Computer Programs
Brennan, Robert L. – 1974
The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…
Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement
Previous Page | Next Page ยป
Pages: 1 | 2