Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Mathematical Models | 41 |
| Test Validity | 41 |
| Test Reliability | 18 |
| Test Construction | 14 |
| Testing Problems | 14 |
| Adaptive Testing | 12 |
| Item Analysis | 11 |
| Statistical Analysis | 11 |
| Computer Assisted Testing | 9 |
| Test Bias | 8 |
| Measurement Techniques | 7 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 3 |
| Wainer, Howard | 3 |
| Hambleton, Ronald K. | 2 |
| Samejima, Fumiko | 2 |
| Weiss, David J., Ed. | 2 |
| Ackerman, Terry A. | 1 |
| Baker, Frank B. | 1 |
| Bergstrom, Betty | 1 |
| Braun, Henry I. | 1 |
| Cliff, Norman | 1 |
| Cudeck, Robert | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 23 |
| Journal Articles | 13 |
| Reports - Evaluative | 5 |
| Speeches/Meeting Papers | 5 |
| Collected Works - Proceedings | 2 |
| Reports - Descriptive | 1 |
Education Level
Audience
| Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Armed Services Vocational… | 3 |
| SAT (College Admission Test) | 1 |
| Stanford Binet Intelligence… | 1 |
What Works Clearinghouse Rating
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Dickison, Philip; Luo, Xiao; Kim, Doyoung; Woo, Ada; Muntean, William; Bergstrom, Betty – Journal of Applied Testing Technology, 2016
Designing a theory-based assessment with sound psychometric qualities to measure a higher-order cognitive construct is a highly desired yet challenging task for many practitioners. This paper proposes a framework for designing a theory-based assessment to measure a higher-order cognitive construct. This framework results in a modularized yet…
Descriptors: Thinking Skills, Cognitive Tests, Test Construction, Nursing
Peer reviewedCudeck, Robert – Multivariate Behavioral Research, 1985
Twelve structural models of similarity were fitted to data from conventional and computer adaptive test (CAT) batteries measuring the same aptitude in a double cross-validation design. Three of the 12 models, including a multiplicative structure model, performed well, providing support for using CATs as replacements for conventional tests. (NSF)
Descriptors: Adaptive Testing, Aptitude Tests, Comparative Testing, Computer Assisted Testing
Ackerman, Terry A.; Davey, Tim C. – 1991
An adaptive test can usually match or exceed the measurement precision of conventional tests several times its length. This increased efficiency is not without costs, however, as the models underlying adaptive testing make strong assumptions about examinees and items. Most troublesome is the assumption that item pools are unidimensional. Truly…
Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Equations (Mathematics)
Samejima, Fumiko – 1990
Test validity is a concept that has often been ignored in the context of latent trait models and in modern test theory, particularly as it relates to computerized adaptive testing. Some considerations about the validity of a test and of a single item are proposed. This paper focuses on measures that are population-free and that will provide local…
Descriptors: Adaptive Testing, Computer Assisted Testing, Equations (Mathematics), Item Response Theory
Peer reviewedDrasgow, Fritz; And Others – Applied Psychological Measurement, 1991
Extensions of unidimensional appropriateness indices are developed for multiunidimensional tests (multidimensional tests composed of unidimensional subtests). Simulated and real data (scores of 2,978 students on the Armed Services Vocational Aptitude Battery) were used to evaluate the indices' effectiveness in determining individuals who are…
Descriptors: Comparative Testing, Computer Simulation, Equations (Mathematics), Graphs
Wainer, Howard; Kiely, Gerard L. – 1986
Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…
Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity
Peer reviewedVargha, Andras; Delaney, Harold D. – Journal of Educational and Behavioral Statistics, 1998
A theorem is developed to show that the Kruskal-Wallis test (W. Kruskal and W. Wallis, 1952) (KWt) can be used validly to test a hypothesis about location under much less restrictive conditions than required by the recently accepted, mathematically correct shift model. The specific null hypothesis for which the KWt can be used validly is…
Descriptors: Hypothesis Testing, Mathematical Models, Test Validity
Cliff, Norman; And Others – 1977
TAILOR is a computer program that uses the implied orders concept as the basis for computerized adaptive testing. The basic characteristics of TAILOR, which does not involve pretesting, are reviewed here and two studies of it are reported. One is a Monte Carlo simulation based on the four-parameter Birnbaum model and the other uses a matrix of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Difficulty Level
Peer reviewedWainer, Howard; And Others – Journal of Educational Measurement, 1991
Hierarchical (adaptive) and linear methods of testlet construction were compared. The performance of 2,080 ninth and tenth graders on a 4-item testlet was used to predict performance on the entire test. The adaptive test was slightly superior as a predictor, but the cost of obtaining that superiority was considerable. (SLD)
Descriptors: Adaptive Testing, Algebra, Comparative Testing, High School Students
Samejima, Fumiko – 1990
This paper is the final report of a multi-year project sponsored by the Office of Naval Research (ONR) in 1987 through 1990. The main objectives of the research summarized were to: investigate the non-parametric approach to the estimation of the operating characteristics of discrete item responses; revise and strengthen the package computer…
Descriptors: Adaptive Testing, Computer Assisted Testing, Distractors (Tests), Equations (Mathematics)
Lord, Frederic M. – 1972
An elementary survey of item characteristic curve theory, centered around the problems of individualized (tailored) testing, is presented. Following the introduction, discussions are provided of the following: Test Theory for Itemized Tests; The Guttman Scale; Item Characteristic Curve Theory; An Alternative Model; Specialization, Application, and…
Descriptors: Achievement Tests, Bulletins, Citations (References), Evaluation Methods
Weiss, David J., Ed. – 1980
This report is the Proceedings of the third conference of its type. Included are 23 of the 25 papers presented at the conference, discussion of these papers by invited discussants, and symposium papers by a group of leaders in adaptive testing and latent trait test theory research and applications. The papers are organized into the following…
Descriptors: Academic Ability, Academic Achievement, Comparative Testing, Computer Assisted Testing
Koch, William R.; Reckase, Mark D. – 1979
Tailored testing procedures for achievement testing were applied in a situation that failed to meet some of the specifications generally considered to be necessary for tailored testing. Discrepancies from the appropriate conditions included the use of small samples for calibrating items, and the use of an item pool that was not designed to be…
Descriptors: Achievement Tests, Adaptive Testing, Educational Testing, Higher Education
Peer reviewedHubert, Lawrence J.; Baker, Frank B. – Multivariate Behavioral Research, 1978
The strategy for investigating convergent and discriminant test validity, known as the multitrait-multimethod matrix, is investigated. A nonparametric significance testing procedure is suggested and demonstrated. (JKS)
Descriptors: Correlation, Hypothesis Testing, Mathematical Models, Matrices

Direct link
