ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Item Sampling	5
Models	5
Test Reliability	5
Test Validity	3
Academic Achievement	2
Criterion Referenced Tests	2
Evaluation Methods	2
Measurement Techniques	2
Scoring	2
Test Interpretation	2
Test Items	2
Ability Grouping	1
Decision Making	1
Difficulty Level	1
Error of Measurement	1
Formative Evaluation	1
Group Testing	1
Grouping (Instructional…	1
Individualized Instruction	1
Instructional Design	1
Item Analysis	1
Knowledge Level	1
Norm Referenced Tests	1
Prediction	1
Program Evaluation	1
More ▼

Source

Assessment & Evaluation in…	1
Educational and Psychological…	1

Author

Burton, Richard F.	1
Gifford, Janice A.	1
Glasnapp, Douglas R.	1
Hambleton, Ronald K.	1
Kolakowski, Donald	1
Kriewall, Thomas E.	1
Poggio, John P.	1

Publication Type

Reports - Research	2
Journal Articles	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Sampling Knowledge and Understanding: How Long Should a Test Be?

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006

Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…

Descriptors: Item Sampling, Tests, Test Length, Test Reliability

Content-Sampling as an Evaluation and Research Technique

Peer reviewed

Poggio, John P.; Glasnapp, Douglas R. – Educational and Psychological Measurement, 1973

Descriptors: Academic Achievement, Evaluation Methods, Formative Evaluation, Item Sampling

Latent Trait Estimation: Theory vs. Practice.

Download full text

Kolakowski, Donald – 1972

Empirical results are presented as regards the implementation of a latent-trait psychometric model by means of conditional maximum likelihood estimation. Items are scored polychotomously into varying numbers of nominal categories and the test and item characteristic curves and information functions are examined. It is concluded that scoring items…

Descriptors: Error of Measurement, Item Analysis, Item Sampling, Measurement Techniques

Construction and Use of Criterion-Referenced Tests in Program Evaluation Studies. Laboratory of Psychometric and Evaluation Research Report No. 102.

Download full text

Gifford, Janice A.; Hambleton, Ronald K. – 1980

Technical considerations associated with item selection and reliability assessment are considered in relation to criterion-referenced tests constructed to provide group information. The purpose is to emphasize test building and the evaluation of test scores in program evaluation studies. It is stressed that an evaluator employ a performance or…

Descriptors: Criterion Referenced Tests, Group Testing, Item Sampling, Models

Aspects and Applications of Criterion-Referenced Tests.

PDF pending restoration

Kriewall, Thomas E. – 1972

The measurement information generated by CRT's is designed for use in instructional management systems where classifications of pupils for treatment are to be decided on the basis of minimal data consistent with predetermined limits for the errors of misclassification. The measures obtained are content specific estimates of proficiency useful for…

Descriptors: Ability Grouping, Academic Achievement, Criterion Referenced Tests, Decision Making