Descriptor
Test Items | 9 |
Test Length | 9 |
Latent Trait Theory | 6 |
Item Analysis | 5 |
Difficulty Level | 4 |
Equated Scores | 4 |
Test Construction | 4 |
Statistical Studies | 3 |
Testing Problems | 3 |
Computer Simulation | 2 |
High Schools | 2 |
More ▼ |
Source
Author
Bashaw, W. L. | 1 |
Boyd, Thomas A. | 1 |
Chang, S. Tai | 1 |
Cleary, T. Anne | 1 |
Gilmer, Jerry S. | 1 |
Hambleton, Ronald K. | 1 |
Harnisch, Delwyn L. | 1 |
Hwang, Chi-en | 1 |
Lenel, Julia C. | 1 |
Livingston, Samuel A. | 1 |
Noble, Christopher S. | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 9 |
Reports - Research | 8 |
Information Analyses | 1 |
Education Level
Audience
Researchers | 9 |
Location
Laws, Policies, & Programs
Assessments and Surveys
New Jersey College Basic… | 1 |
Otis Lennon School Ability… | 1 |
Texas Assessment of Basic… | 1 |
Texas Educational Assessment… | 1 |
Wechsler Intelligence Scale… | 1 |
Wechsler Intelligence Scales… | 1 |
What Works Clearinghouse Rating
Chang, S. Tai; Bashaw, W. L. – 1984
The purpose of this study was twofold: to investigate to what extent characteristics of anchor tests may affect precision of item calibration, and to estimate to what extent precision of item calibration may be affected by removal of persons whose response patterns deviate from those normally expected from the Rasch one-parameter logistic model.…
Descriptors: Aptitude Tests, Difficulty Level, Equated Scores, Junior High Schools
Hwang, Chi-en; Cleary, T. Anne – 1986
The results obtained from two basic types of pre-equatings of tests were compared: the item response theory (IRT) pre-equating and section pre-equating (SPE). The simulated data were generated from a modified three-parameter logistic model with a constant guessing parameter. Responses of two replication samples of 3000 examinees on two 72-item…
Descriptors: Computer Simulation, Equated Scores, Latent Trait Theory, Mathematical Models
Hambleton, Ronald K.; And Others – 1987
The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…
Descriptors: Comparative Analysis, Content Validity, Cutting Scores, Difficulty Level
Lenel, Julia C.; Gilmer, Jerry S. – 1986
In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…
Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)
Samejima, Fumiko – 1986
Item analysis data fitting the normal ogive model were simulated in order to investigate the problems encountered when applying the three-parameter logistic model. Binary item tests containing 10 and 35 items were created, and Monte Carlo methods simulated the responses of 2,000 and 500 examinees. Item parameters were obtained using Logist 5.…
Descriptors: Computer Simulation, Difficulty Level, Guessing (Tests), Item Analysis
Livingston, Samuel A. – 1987
The effect of increased writing or planning time on a test of basic college level writing ability was studied. The essay portion of the New Jersey College Basic Skills Placement Test was given to students in nine New Jersey public colleges and three New Jersey public high schools. Each student wrote two essays on two different topics. The first…
Descriptors: Academic Ability, Difficulty Level, Essay Tests, High Schools
Boyd, Thomas A.; Tramontana, Michael G. – 1984
To examine the validity of short forms of the Wechsler Intelligence Scale for Children-Revised (WISC-R), the WISC-R was first administered to 106 hospitalized psychiatric patients, aged 8-16. No subjects had a primary diagnosis of mental retardation or learning disability, and one-third were receiving psychotropic medication. WISC-R IQ scores…
Descriptors: Adolescents, Children, Correlation, Elementary Secondary Education
Harnisch, Delwyn L. – 1985
Computer adaptive testing systems are feasible for certification and licensure testing. This is in part due to the availability of extensive yet inexpensive computers. Modern item response theory, combined with computerized adaptive testing, yields a powerful new method of testing which provides greater accuracy and efficiency and less boredom for…
Descriptors: Adaptive Testing, Certification, Computer Assisted Testing, Cost Effectiveness
Noble, Christopher S.; And Others – 1986
The relationship between item omission and item position on criterion-referenced tests in the Texas state assessment program is examined. Item statistics from the Texas Educational Assessment of Minimum Skills (TEAMS) and Texas Assessment of Basic Skills (TABS) mathematics and reading tests from 1983 through 1985 are examined for three ethnic…
Descriptors: Basic Skills, Blacks, Criterion Referenced Tests, Ethnic Groups