ERIC - Search Results

Descriptor

Item Analysis	16
Item Sampling	16
Test Reliability	16
Test Validity	9
Test Construction	8
Criterion Referenced Tests	7
Mathematical Models	6
Test Interpretation	6
Test Theory	5
Achievement Tests	4
Career Development	4
Latent Trait Theory	4
Difficulty Level	3
Error of Measurement	3
Individual Differences	3
Item Banks	3
Mastery Tests	3
Norm Referenced Tests	3
Test Items	3
Check Lists	2
Comparative Analysis	2
Decision Making	2
Matrices	2
Performance Tests	2
Response Style (Tests)	2
More ▼

Source

Applied Psychological…	1
Educational and Psychological…	1
Illinois School Research	1

Publication Type

Reports - Research	9
Speeches/Meeting Papers	2
Guides - General	1
Reference Materials -…	1
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Adjective Check List	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Conceptualization of Issues in Construct and Content Validity. Studies in Measurement and Methodology, Work Unit No. 1: Conceptual and Design Problems in Competency-Based Measurements.

Linn, Robert – 1978

A series of studies on conceptual and design problems in competency-based measurements are explained. The concept of validity within the context of criterion-referenced measurement is reviewed. The authors believe validation should be viewed as a process rather than an end product. It is the process of marshalling evidence to support…

Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Test Bias

The Effects of Various Item Selection Methods on the Classification Accuracy and Classification Consistency of Criterion-Referenced Instruments.

Smith, Douglas U. – 1978

This study examined the effects of certain item selection methods on the classification accuracy and classification consistency of criterion-referenced instruments. Three item response data sets, representing varying situations of instructional effectiveness, were simulated. Five methods of item selection were then applied to each data set for the…

Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Latent Trait Theory

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Latent Trait Estimation: Theory vs. Practice.

Download full text

Kolakowski, Donald – 1972

Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…

Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques

Individuality of Item Interpretation in Interchangeable ACL Scales

Peer reviewed

Fiske, Donald W.; Barack, Leonard I. – Educational and Psychological Measurement, 1976

The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…

Descriptors: Adjectives, Check Lists, Individual Differences, Item Analysis

Decision Reliability and Classification Validity for Decision Oriented Criterion-Referenced Tests.

Faggen, Jane – 1978

Formulas are presented for decision reliability and for classification validity for mastery/nonmastery decisions based on criterion referenced tests. Two item parameters are used: the probability of a master answering an item correctly, and the probability of a nonmaster answering an item incorrectly. The theory explores the relationships of…

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Item Analysis, Item Banks

A Basic Test Theory Generalizable to Tailored Testing. Technical Report No. 1.

Download full text

Cliff, Norman – 1975

Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…

Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences

Aspects and Applications of Criterion-Referenced Tests

Kriewall, Thomas E. – Illinois School Research, 1972

Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)

Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis

Some Item Analysis and Test Theory for a System of Computer-Assisted Test Construction for Individualized Instruction

Peer reviewed

Lord, Frederic M. – Applied Psychological Measurement, 1977

Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…

Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms

Scale-Score Reporting of National Assessment Data (Final Report).

Download full text

Mislevy, Robert J.; And Others – 1982

An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…

Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory

Achievement Test Items--Methods of Study. CSE Monograph Series in Evaluation, 6.

Harris, Chester W.; And Others – 1977

The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…

Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

Characteristics of Samples and Linking Items Affecting a Partial Pre-Calibrations Design.

Download full text

Cook, Linda L.; And Others – 1987

This study tests several explanations for discrepant results in an earlier study (Cook et al., 1985) which presented a partial pre-calibration method for equating new editions of the Scholastic Aptitude Test (SAT) to the same scale as older editions. In contrast to full pre-calibration, which seeks to equate all items from two or more editions,…

Descriptors: College Entrance Examinations, Concurrent Validity, Equated Scores, Estimation (Mathematics)

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Wilcox, Rand R. – 1979

Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…

Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests

Guidebook for Developing Criterion-Referenced Tests.

Download full text

Swezey, Robert W.; Pearlstein, Richard B. – 1975

This manual outlines the rationale for using the Criterion Referenced Test (CRT) approach and suggests specific guidelines for test developers to use in constructing test items. Methods for assessing the adequacy of a CRT are also provided. (Author/RC)

Descriptors: Behavioral Objectives, Check Lists, Comparative Analysis, Criterion Referenced Tests

Previous Page | Next Page »

Pages: 1 | 2

Barack, Leonard I.	1
Cliff, Norman	1
Cook, Linda L.	1
Epstein, Kenneth I.	1
Faggen, Jane	1
Fiske, Donald W.	1
Forster, Fred	1
Haladyna, Tom	1
Harris, Chester W.	1
Knerr, Claramae S.	1
Kolakowski, Donald	1
Kriewall, Thomas E.	1
Linn, Robert	1
Lord, Frederic M.	1
Mislevy, Robert J.	1
Pearlstein, Richard B.	1
Smith, Douglas U.	1
Swezey, Robert W.	1
Wilcox, Rand R.	1
More ▼