Descriptor
| Difficulty Level | 5 |
| Item Sampling | 5 |
| Test Reliability | 5 |
| Item Analysis | 3 |
| Test Items | 2 |
| Test Theory | 2 |
| Test Validity | 2 |
| True Scores | 2 |
| Achievement Tests | 1 |
| Career Development | 1 |
| Computer Assisted Testing | 1 |
| More ▼ | |
Source
| Applied Psychological… | 1 |
| Assessment & Evaluation in… | 1 |
| Educational and Psychological… | 1 |
| Illinois School Research | 1 |
Publication Type
| Reports - Research | 2 |
| Journal Articles | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
Audience
| Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedShoemaker, David M. – Educational and Psychological Measurement, 1972
Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation
Burton, Richard F. – Assessment & Evaluation in Higher Education, 2006
Many academic tests (e.g. short-answer and multiple-choice) sample required knowledge with questions scoring 0 or 1 (dichotomous scoring). Few textbooks give useful guidance on the length of test needed to do this reliably. Posey's binomial error model of 1932 provides the best starting point, but allows neither for heterogeneity of question…
Descriptors: Item Sampling, Tests, Test Length, Test Reliability
Kriewall, Thomas E. – Illinois School Research, 1972
Author discusses and defines criterion tests in the context of classroom needs that have created much of the interest in the theory at this time. The primary source of interest is related to the growing implementation of individualized curricula. (Author/CB)
Descriptors: Criterion Referenced Tests, Difficulty Level, Individualized Instruction, Item Analysis
Peer reviewedLord, Frederic M. – Applied Psychological Measurement, 1977
Under given conditions, conventional testing and computer-generated repeatable testing (CGRT) are equally effective for estimating examinee ability; CGRT is more effective for estimating the mean ability level of a group and less effective for estimating ability differences among individuals. These conclusion are drawn from domain-referenced test…
Descriptors: Career Development, Computer Assisted Testing, Difficulty Level, Group Norms
Forster, Fred – 1987
Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

Direct link
