Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Difficulty Level | 5 |
Item Sampling | 5 |
Test Reliability | 5 |
Item Analysis | 3 |
Test Items | 2 |
Test Theory | 2 |
Test Validity | 2 |
True Scores | 2 |
Achievement Tests | 1 |
Career Development | 1 |
Computer Assisted Testing | 1 |
More ▼ |
Source
Applied Psychological… | 1 |
Assessment & Evaluation in… | 1 |
Educational and Psychological… | 1 |
Illinois School Research | 1 |
Publication Type
Reports - Research | 2 |
Journal Articles | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006
Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…
Descriptors: Item Sampling, Tests, Test Length, Test Reliability

Shoemaker, David M. – Educational and Psychological Measurement, 1972
Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation
Kriewall, Thomas E. – Illinois School Research, 1972
Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)
Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis

Lord, Frederic M. – Applied Psychological Measurement, 1977
Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…
Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms
Forster, Fred – 1987
Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis