ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Test Validity	27
Test Construction	16
Test Reliability	15
Criterion Referenced Tests	14
Models	6
Test Interpretation	6
Test Length	6
Psychometrics	5
Test Format	5
Test Items	5
Achievement Tests	4
Cutting Scores	4
Elementary Secondary Education	4
Evaluation Methods	4
Item Analysis	4
Norm Referenced Tests	4
Scores	4
Scoring	4
Statistical Analysis	4
Testing	4
Testing Problems	4
Comparative Analysis	3
Cultural Differences	3
Data Collection	3
Foreign Countries	3
More ▼

Source

Educational and Psychological…	3
Applied Measurement in…	2
Journal of Educational…	2
Cypriot Journal of…	1
Educational Measurement:…	1
J Educ Meas	1
Review of Educational Research	1

Author

Hambleton, Ronald K.	27
Eignor, Daniel R.	3
Traub, Ross E.	3
Novick, Melvin R.	2
Bollwark, John	1
Dogan, Nuri	1
Gifford, Janice A.	1
Gorth, William P.	1
Kanjee, Anil	1
Linn, Robert L.	1
Rovinelli, Richard J.	1
Slater, Sharon C.	1
Smith, I. Leon	1
Yavuz, Sinan	1
Yurtcu, Meltem	1
More ▼

Publication Type

Reports - Research	9
Reports - Evaluative	8
Journal Articles	6
Speeches/Meeting Papers	6
Information Analyses	3
Guides - Classroom - Teacher	1
Tests/Questionnaires	1

Education Level

Elementary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Practitioners

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

The Comparison of Differential Item Functioning Predicted through Experts and Statistical Techniques

Peer reviewed
PDF on ERIC

Download full text

Dogan, Nuri; Hambleton, Ronald K.; Yurtcu, Meltem; Yavuz, Sinan – Cypriot Journal of Educational Sciences, 2018

Validity is one of the psychometric properties of the achievement tests. To determine the validity, one of the examination is item bias studies, which are based on differential item functioning (DIF) analyses and field experts' opinion. In this study, field experts were asked to estimate the DIF levels of the items to compare the estimations…

Descriptors: Test Bias, Comparative Analysis, Predictor Variables, Statistical Analysis

Information Curves and Efficiency of Three Logistic Test Models.

Download full text

Hambleton, Ronald K.; Traub, Ross E. – 1970

The purpose of this study was to determine the efficiency of the estimates of ability provided by the one-parameter logistic model as compared to the estimates provided by the more general two- and three-parameter models. Several tests were simulated with item parameters meeting the assumptions of either the two- or three-parameter model. For each…

Descriptors: Ability Identification, Data Collection, Models, Scoring

A A Comparison of the Reliability and Validity of Two Methods for Assessing Partial Knowledge on a Multiple-Choice Test

Hambleton, Ronald K.; And Others – J Educ Meas, 1970

Descriptors: Comparative Analysis, Evaluation Methods, Multiple Choice Tests, Test Reliability

Adapting Tests for Use in Different Cultures: Technical Issues and Methods.

Download full text

Hambleton, Ronald K.; Bollwark, John – 1991

The validity of results from international assessments depends on the correctness of the test translations. If the tests presented in one language are more or less difficult because of the manner in which they are translated, the validity of any interpretation of the results can be questioned. Many test translation methods exist in the literature,…

Descriptors: Cultural Differences, Educational Assessment, English, Foreign Countries

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Peer reviewed

Hambleton, Ronald K. – Educational and Psychological Measurement, 1987

This paper presents an algorithm for determining the number of items to measure each objective in a criterion-referenced test when testing time is fixed and when the objectives vary in their levels of importance, reliability, and validity. Results of four special applications of the algorithm are presented. (BS)

Descriptors: Algorithms, Behavioral Objectives, Criterion Referenced Tests, Test Construction

On the Use of Content Specialists in the Assessment of Criterion-Referenced Test Item Validity.

Download full text

Rovinelli, Richard J.; Hambleton, Ronald K. – 1976

Essential for an effective criterion-referenced testing program is a set of test items that are "valid" indicators of the objectives they have been designed to measure. Unfortunately, the complex matter of assessing item validity has received only limited attention from educational measurement specialists. One promising approach to the item…

Descriptors: Content Analysis, Criterion Referenced Tests, Data Collection, Evaluation Methods

Note of Correction on the Article Entitled: The Effect of Scoring Instructions and Degree of Speededness on the Validity and Reliability of Multiple-Choice Tests

Peer reviewed

Traub, Ross E.; Hambleton, Ronald K. – Educational and Psychological Measurement, 1973

Descriptors: Grade 8, Guessing (Tests), Multiple Choice Tests, Pacing

Determining the Lengths for Criterion-Referenced Tests.

Peer reviewed

Hambleton, Ronald K.; And Others – Journal of Educational Measurement, 1983

A new method was developed to assist in the selection of a test length by utilizing computer simulation procedures and item response theory. A demonstration of the method presents results which address the influences of item pool heterogeneity matched to the objectives of interest and the method of item selection. (Author/PN)

Descriptors: Computer Programs, Criterion Referenced Tests, Item Banks, Latent Trait Theory

Toward an Integration of Theory and Method for Criterion-Referenced Tests

Peer reviewed

Hambleton, Ronald K.; Novick, Melvin R. – Journal of Educational Measurement, 1973

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Decision Making, Definitions

New Testing Methods to Assess Technical Problem-Solving Ability.

Download full text

Hambleton, Ronald K.; And Others – 1988

Tests to assess problem-solving ability being provided for the Air Force are described, and some details on the development and validation of these computer-administered diagnostic achievement tests are discussed. Three measurement approaches were employed: (1) sequential problem solving; (2) context-free assessment of fundamental skills and…

Descriptors: Achievement Tests, Aircraft Pilots, Computer Assisted Testing, Occupational Tests

Toward an Integration of Theory and Method for Criterion-Referenced Tests.

Download full text

Hambleton, Ronald K.; Novick, Melvin R. – 1972

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity…

Descriptors: Criterion Referenced Tests, Measurement Instruments, Measurement Techniques, Scaling

The Effect of Scoring Instructions and Degree of Speededness on the Validity and Reliability of Multiple-Choice Tests

Peer reviewed

Traub, Ross E.; Hambleton, Ronald K. – Educational and Psychological Measurement, 1972

Findings of this study suggest that it is preferable to attempt to control guessing through the use of the reward instruction rather than to attempt to control it using the penalty instruction or to encourage it using the insttruction to guess. (Authors/MB)

Descriptors: Grade 8, Guessing (Tests), Multiple Choice Tests, Pacing

Advances in the Detection of Differentially Functioning Test Items.

Download full text

Hambleton, Ronald K.; And Others – 1993

The development and evaluation of methods for detecting potentially biased items or differentially functioning items (DIF) represent a critical area of research for psychometricians because of the negative impact of biased items on test validity. A summary is provided of the authors' 12 years of research at the University of Massachusetts…

Descriptors: Educational Research, Effect Size, Guidelines, Item Bias

Competency Test Development, Validation, and Standard-Setting.

Download full text

Hambleton, Ronald K.; Eignor, Daniel R. – 1978

In light of the widespread use of competency testing, the authors consider that it is important to determine ways of developing and using competency testing to insure that it achieves its full potential. The paper, in three parts, introduces a model for the development and validation of competency tests, reviews several methods for setting…

Descriptors: Competence, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education

Construction and Use of Criterion-Referenced Tests in Program Evaluation Studies. Laboratory of Psychometric and Evaluation Research Report No. 102.

Download full text

Gifford, Janice A.; Hambleton, Ronald K. – 1980

Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…

Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models

Previous Page | Next Page »

Pages: 1 | 2