ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Descriptor

Item Sampling	23
Test Reliability	23
Test Construction	10
Item Analysis	9
Test Validity	8
Criterion Referenced Tests	7
Test Items	7
Mathematical Models	6
Achievement Tests	5
Item Banks	5
Statistical Analysis	5
Test Interpretation	5
Latent Trait Theory	4
Mastery Tests	4
Sampling	4
Test Theory	4
Career Development	3
Comparative Analysis	3
Decision Making	3
Educational Assessment	3
Elementary Secondary Education	3
Norm Referenced Tests	3
Analysis of Variance	2
College Students	2
Difficulty Level	2
More ▼

Source

Assessment & Evaluation in…	1
International Journal of…	1
Journal of Educational…	1
Physical Review Physics…	1
Practical Assessment,…	1

Publication Type

Reports - Research	23
Speeches/Meeting Papers	7
Journal Articles	5

Education Level

Elementary Education	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Researchers

Location

Bosnia and Herzegovina	1
Croatia	1
Slovenia	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
Pennsylvania Educational…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

A Practical Guide to Item Bank Calibration with Multiple Matrix Sampling

Peer reviewed
PDF on ERIC

Download full text

Eren Can Aybek; Serkan Arikan; Günes Ertas – International Journal of Assessment Tools in Education, 2024

When it is required to estimate item parameters of a large item bank, Multiple Matrix Sampling (MMS) design provides an efficient way while minimizing the test burden on students. The current study exemplifies how to calibrate a large item pool using MMS design for various purposes, such as developing a CAT administration. The purpose of the…

Descriptors: Elementary School Mathematics, Elementary School Students, Grade 4, Item Banks

Maintaining Item Banks with the Rasch Model: An Example from Wave Optics

Peer reviewed

Direct link

Glamocic, Džana Salibašic; Mešic, Vanes; Neumann, Knut; Sušac, Ana; Boone, William J.; Aviani, Ivica; Hasovic, Elvedin; Erceg, Nataša; Repnik, Robert; Grubelnik, Vladimir – Physical Review Physics Education Research, 2021

Item banks are generally considered the basis of a new generation of educational measurement. In combination with specialized software, they can facilitate the computerized assembling of multiple pre-equated test forms. However, for advantages of item banks to become fully realized it is important that the item banks store a relatively large…

Descriptors: Item Banks, Test Items, Item Response Theory, Item Sampling

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Sampling Knowledge and Understanding: How Long Should a Test Be?

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006

Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…

Descriptors: Item Sampling, Tests, Test Length, Test Reliability

An Evaluation of a Multiple Matrix Sampling Procedure for a State Assessment Program.

Download full text

Kohr, Richard L. – 1976

Pennsylvania's Educational Quality Assessment Program provides each participating school with a building level report in which state percentiles are a prominent part. Multiple matrix sampling was being considered as a technique to reduce testing time. However, there was great concern that the error associated with estimating the school mean might…

Descriptors: Educational Assessment, Elementary Secondary Education, Item Sampling, Measurement Techniques

Techniques for Analyzing Test Response Data.

Download full text

Harris, Chester W. – 1975

Achievement tests which are specifically linked to an instructional program and have been developed in relation to an objectives base and/or to an item generation rule are considered, as well as student response data. Three types of studies are outlined and the kind of procedures thought useful illustrated. As various methods for examining…

Descriptors: Achievement Tests, Instructional Programs, Item Banks, Item Sampling

An Empirical Investigation of the Applicability of Multiple Matrix Sampling to the Method of Rank Order.

Peer reviewed

Askegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982

Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)

Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction

Content Validity in Behavioral Assessment.

Linehan, Marsha M. – 1976

Both criterion-referenced testing and behavioral assessment share the basic assumption that test behavior is a sample rather than a sign. In addition, both types of assessment focus on response capabilities and performance in specified content domains. Although content validity has been traditionally recognized as essential to criterion-referenced…

Descriptors: Behavior Patterns, Content Analysis, Criterion Referenced Tests, Informal Assessment

The Effects of Various Item Selection Methods on the Classification Accuracy and Classification Consistency of Criterion-Referenced Instruments.

Smith, Douglas U. – 1978

This study examined the effects of certain item selection methods on the classification accuracy and classification consistency of criterion-referenced instruments. Three item response data sets, representing varying situations of instructional effectiveness, were simulated. Five methods of item selection were then applied to each data set for the…

Descriptors: Criterion Referenced Tests, Item Analysis, Item Sampling, Latent Trait Theory

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Decision Reliability and Classification Validity for Decision Oriented Criterion-Referenced Tests.

Faggen, Jane – 1978

Formulas are presented for decision reliability and for classification validity for mastery/nonmastery decisions based on criterion referenced tests. Two item parameters are used: the probability of a master answering an item correctly, and the probability of a nonmaster answering an item incorrectly. The theory explores the relationships of…

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Item Analysis, Item Banks

Estimating a Correlation Coefficient Using a Multiple Matrix Sampling Disign.

PDF pending restoration

Estes, Carole; Estes, Gary D. – 1980

Multiple matrix sampling is a sampling design in which both test items and examinees are randomly sampled from their respective populations. This study was designed to develop and assess a method for computing an estimate of a correlation coefficient when a multiple matrix sampling design is used. The examinee populations included 212 third-grade…

Descriptors: Correlation, Elementary Secondary Education, Evaluation Methods, Grade 3

A Basic Test Theory Generalizable to Tailored Testing. Technical Report No. 1.

Download full text

Cliff, Norman – 1975

Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…

Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences

Construction and Use of Criterion-Referenced Tests in Program Evaluation Studies. Laboratory of Psychometric and Evaluation Research Report No. 102.

Download full text

Gifford, Janice A.; Hambleton, Ronald K. – 1980

Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…

Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models

Scale-Score Reporting of National Assessment Data (Final Report).

Download full text

Mislevy, Robert J.; And Others – 1982

An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…

Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory

Previous Page | Next Page »

Pages: 1 | 2

Askegaard, Lewis D.	1
Aviani, Ivica	1
Bashkov, Bozhidar M.	1
Boone, William J.	1
Brown, James Dean	1
Burton, Richard F.	1
Carloni, John A.	1
Clauser, Jerome C.	1
Cliff, Norman	1
Cook, Linda L.	1
Epstein, Kenneth I.	1
Erceg, Nataša	1
Eren Can Aybek	1
Estes, Carole	1
Estes, Gary D.	1
Faggen, Jane	1
Forster, Fred	1
Gifford, Janice A.	1
Gillmore, Gerald M.	1
Glamocic, Džana Salibašic	1
Grubelnik, Vladimir	1
Günes Ertas	1
Haladyna, Tom	1
Hambleton, Ronald K.	1
More ▼