ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Difficulty Level	5
Item Sampling	5
Test Reliability	5
Item Analysis	3
Test Items	2
Test Theory	2
Test Validity	2
True Scores	2
Achievement Tests	1
Career Development	1
Computer Assisted Testing	1
Criterion Referenced Tests	1
Elementary Education	1
Error of Measurement	1
Group Norms	1
Individual Differences	1
Individualized Instruction	1
Item Banks	1
Knowledge Level	1
Latent Trait Theory	1
Mathematical Models	1
Measurement Objectives	1
Models	1
Norm Referenced Tests	1
Scoring	1
More ▼

Source

Applied Psychological…	1
Assessment & Evaluation in…	1
Educational and Psychological…	1
Illinois School Research	1

Author

Burton, Richard F.	1
Forster, Fred	1
Kriewall, Thomas E.	1
Lord, Frederic M.	1
Shoemaker, David M.	1

Publication Type

Reports - Research	2
Journal Articles	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Sampling Knowledge and Understanding: How Long Should a Test Be?

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006

Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…

Descriptors: Item Sampling, Tests, Test Length, Test Reliability

Standard Errors of Estimate in Item-Examinee Sampling as a Function of Test Reliability, Variation in Item Difficulty Indices and Degree of Skewness in the Normative Distribution

Peer reviewed

Shoemaker, David M. – Educational and Psychological Measurement, 1972

Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation

Aspects and Applications of Criterion-Referenced Tests

Kriewall, Thomas E. – Illinois School Research, 1972

Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)

Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis

Some Item Analysis and Test Theory for a System of Computer-Assisted Test Construction for Individualized Instruction

Peer reviewed

Lord, Frederic M. – Applied Psychological Measurement, 1977

Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…

Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis