Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Evaluation Methods | 12 |
Test Items | 12 |
Test Construction | 6 |
Psychometrics | 4 |
Item Analysis | 3 |
Item Response Theory | 3 |
Scaling | 3 |
Scores | 3 |
Latent Trait Theory | 2 |
Measurement Techniques | 2 |
Models | 2 |
More ▼ |
Source
Applied Psychological… | 1 |
Journal of Early Intervention | 1 |
Journal of Educational… | 1 |
Multivariate Behavioral… | 1 |
Author
Publication Type
Reports - Research | 6 |
Speeches/Meeting Papers | 6 |
Journal Articles | 4 |
Reports - Descriptive | 3 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 12 |
Administrators | 1 |
Practitioners | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Piers Harris Childrens Self… | 1 |
Tennessee Self Concept Scale | 1 |
What Works Clearinghouse Rating
Roberts, James S. – Applied Psychological Measurement, 2008
Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…
Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models
van Ginkel, Joost R.; van der Ark, L. Andries; Sijtsma, Klaas – Multivariate Behavioral Research, 2007
The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate normal imputation were used as lower and upper benchmark, respectively. Test data were simulated and item scores were deleted such that they were either missing completely at random, missing at…
Descriptors: Evaluation Methods, Psychometrics, Item Response Theory, Scores
Holden, Ronald R. – 1985
Modern test construction strategies in the areas of personality and psychopathology differ in the use of disguise within test stimulus material. Previous research on the validity of using disguised test item content has favored the rational strategy of test construction which views disguise as a liability under normal test-taking circumstances.…
Descriptors: Adults, Evaluation Methods, Psychopathology, Test Construction

Van Der Flier, Henk; And Others – Journal of Educational Measurement, 1984
Two strategies for assessing item bias are discussed: methods comparing item difficulties unconditional on ability and methods comparing probabilities of response conditional on ability. Results suggest that the iterative logit method is an improvement on the noniterative one and is efficient in detecting biased and unbiased items. (Author/DWH)
Descriptors: Algorithms, Evaluation Methods, Item Analysis, Scores

Reckase, Mark D.; McKinley, Robert L. – 1984
A new indicator of item difficulty, which identifies effectiveness ranges, overcomes the limitations of other item difficulty indexes in describing the difficulty of an item or a test as a whole and in aiding the selection of appropriate ability level items for a test. There are three common uses of the term "item difficulty": (1) the probability…
Descriptors: Difficulty Level, Evaluation Methods, Item Analysis, Latent Trait Theory
Furst, Edward J. – 1983
Enough evidence has accumulated on Bloom's "Taxonomy of Educational Objectives" for the cognitive domain to justify a review of its communicability. This article covers both published and unpublished studies as well as certain informal reports that bear on this property. It also examines possibilities for improving agreement among…
Descriptors: Achievement Tests, Classification, Cognitive Processes, Diffusion (Communication)
Hiscox, Michael D. – 1983
Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…
Descriptors: Computer Assisted Testing, Cost Effectiveness, Cost Estimates, Educational Testing

Snyder, Scott; Sheehan, Robert – Journal of Early Intervention, 1992
This examination of the Rasch scaling model concludes that the model could potentially facilitate objective comparisons of status and change of young children with disabilities at individual and group levels. The paper discusses applications of the model to early childhood assessment in the areas of item banking, test analysis, and subject…
Descriptors: Disabilities, Evaluation Methods, Item Response Theory, Measurement Techniques
Thomas, Julia Anne – 1985
A sample of 234 fifth- and 259 sixth-grade students scaled the items of the Piers-Harris, Tennessee, Coopersmith, and Lipsett self-concept measures. The scaling of the Piers-Harris and the Tennessee inventories was examined in reference to their subscales. The present technique placed items on a bivariate plane of two orthogonal dimensions…
Descriptors: Evaluation Methods, Factor Structure, Intermediate Grades, Orthogonal Rotation
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods
Valentine, Jerry W.; Bowman, Michael L. – 1986
This technical manual is presented to assist those using the Audit of Principal Effectiveness, an 80-statement evaluation instrument designed to determine teachers' perceptions of principals' effectiveness, allow principals to obtain feedback from teachers regarding strengths and weaknesses, and provide a useful tool for researchers studying…
Descriptors: Administrator Evaluation, Elementary Secondary Education, Evaluation Methods, Feedback
Rengel, Elizabeth – 1986
The Ball Aptitude Battery (BAB) was examined for item bias in a sample of 577 high school students in which males and females, as well as three ethnic groups (Blacks, Whites, and Hispanics) were represented. The objectives of the investigation were: (1) to assess the level of interrater agreement for the judgmental method; (2) to find the level of…
Descriptors: Academic Aptitude, Aptitude Tests, Black Students, Culture Fair Tests