Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Test Validity | 27 |
Test Construction | 16 |
Test Reliability | 15 |
Criterion Referenced Tests | 14 |
Models | 6 |
Test Interpretation | 6 |
Test Length | 6 |
Psychometrics | 5 |
Test Format | 5 |
Test Items | 5 |
Achievement Tests | 4 |
More ▼ |
Source
Educational and Psychological… | 3 |
Applied Measurement in… | 2 |
Journal of Educational… | 2 |
Cypriot Journal of… | 1 |
Educational Measurement:… | 1 |
J Educ Meas | 1 |
Review of Educational Research | 1 |
Author
Hambleton, Ronald K. | 27 |
Eignor, Daniel R. | 3 |
Traub, Ross E. | 3 |
Novick, Melvin R. | 2 |
Bollwark, John | 1 |
Dogan, Nuri | 1 |
Gifford, Janice A. | 1 |
Gorth, William P. | 1 |
Kanjee, Anil | 1 |
Linn, Robert L. | 1 |
Rovinelli, Richard J. | 1 |
More ▼ |
Publication Type
Reports - Research | 9 |
Reports - Evaluative | 8 |
Journal Articles | 6 |
Speeches/Meeting Papers | 6 |
Information Analyses | 3 |
Guides - Classroom - Teacher | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Practitioners | 1 |
Location
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
The Comparison of Differential Item Functioning Predicted through Experts and Statistical Techniques
Dogan, Nuri; Hambleton, Ronald K.; Yurtcu, Meltem; Yavuz, Sinan – Cypriot Journal of Educational Sciences, 2018
Validity is one of the psychometric properties of the achievement tests. To determine the validity, one of the examination is item bias studies, which are based on differential item functioning (DIF) analyses and field experts' opinion. In this study, field experts were asked to estimate the DIF levels of the items to compare the estimations…
Descriptors: Test Bias, Comparative Analysis, Predictor Variables, Statistical Analysis
Hambleton, Ronald K.; Traub, Ross E. – 1970
The purpose of this study was to determine the efficiency of the estimates of ability provided by the one-parameter logistic model as compared to the estimates provided by the more general two- and three-parameter models. Several tests were simulated with item parameters meeting the assumptions of either the two- or three-parameter model. For each…
Descriptors: Ability Identification, Data Collection, Models, Scoring
Hambleton, Ronald K.; And Others – J Educ Meas, 1970
Descriptors: Comparative Analysis, Evaluation Methods, Multiple Choice Tests, Test Reliability
Hambleton, Ronald K.; Bollwark, John – 1991
The validity of results from international assessments depends on the correctness of the test translations. If the tests presented in one language are more or less difficult because of the manner in which they are translated, the validity of any interpretation of the results can be questioned. Many test translation methods exist in the literature,…
Descriptors: Cultural Differences, Educational Assessment, English, Foreign Countries

Hambleton, Ronald K. – Educational and Psychological Measurement, 1987
This paper presents an algorithm for determining the number of items to measure each objective in a criterion-referenced test when testing time is fixed and when the objectives vary in their levels of importance, reliability, and validity. Results of four special applications of the algorithm are presented. (BS)
Descriptors: Algorithms, Behavioral Objectives, Criterion Referenced Tests, Test Construction
Rovinelli, Richard J.; Hambleton, Ronald K. – 1976
Essential for an effective criterion-referenced testing program is a set of test items that are "valid" indicators of the objectives they have been designed to measure. Unfortunately, the complex matter of assessing item validity has received only limited attention from educational measurement specialists. One promising approach to the item…
Descriptors: Content Analysis, Criterion Referenced Tests, Data Collection, Evaluation Methods

Traub, Ross E.; Hambleton, Ronald K. – Educational and Psychological Measurement, 1973
Descriptors: Grade 8, Guessing (Tests), Multiple Choice Tests, Pacing

Hambleton, Ronald K.; And Others – Journal of Educational Measurement, 1983
A new method was developed to assist in the selection of a test length by utilizing computer simulation procedures and item response theory. A demonstration of the method presents results which address the influences of item pool heterogeneity matched to the objectives of interest and the method of item selection. (Author/PN)
Descriptors: Computer Programs, Criterion Referenced Tests, Item Banks, Latent Trait Theory

Hambleton, Ronald K.; Novick, Melvin R. – Journal of Educational Measurement, 1973
In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. (Editor)
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Decision Making, Definitions
Hambleton, Ronald K.; And Others – 1988
Tests to assess problem-solving ability being provided for the Air Force are described, and some details on the development and validation of these computer-administered diagnostic achievement tests are discussed. Three measurement approaches were employed: (1) sequential problem solving; (2) context-free assessment of fundamental skills and…
Descriptors: Achievement Tests, Aircraft Pilots, Computer Assisted Testing, Occupational Tests
Hambleton, Ronald K.; Novick, Melvin R. – 1972
In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…
Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling

Traub, Ross E.; Hambleton, Ronald K. – Educational and Psychological Measurement, 1972
Findings of this study suggest that it is preferable to attempt to control guessing through the use of the reward instruction rather than to attempt to control it using the penalty instruction or to encourage it using the insttruction to guess. (Authors/MB)
Descriptors: Grade 8, Guessing (Tests), Multiple Choice Tests, Pacing
Hambleton, Ronald K.; And Others – 1993
The development and evaluation of methods for detecting potentially biased items or differentially functioning items (DIF) represent a critical area of research for psychometricians because of the negative impact of biased items on test validity. A summary is provided of the authors' 12 years of research at the University of Massachusetts…
Descriptors: Educational Research, Effect Size, Guidelines, Item Bias
Hambleton, Ronald K.; Eignor, Daniel R. – 1978
In light of the widespread use of competency testing, the authors consider that it is important to determine ways of developing and using competency testing to insure that it achieves its full potential. The paper, in three parts, introduces a model for the development and validation of competency tests, reviews several methods for setting…
Descriptors: Competence, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education
Gifford, Janice A.; Hambleton, Ronald K. – 1980
Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…
Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models
Previous Page | Next Page ยป
Pages: 1 | 2