ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Evaluation Methods	12
Test Items	12
Test Construction	6
Psychometrics	4
Item Analysis	3
Item Response Theory	3
Scaling	3
Scores	3
Latent Trait Theory	2
Measurement Techniques	2
Models	2
Simulation	2
Statistical Analysis	2
Test Bias	2
Test Validity	2
Academic Aptitude	1
Achievement Tests	1
Administrator Evaluation	1
Adults	1
Algorithms	1
Aptitude Tests	1
Benchmarking	1
Black Students	1
Classification	1
Cognitive Processes	1
More ▼

Source

Applied Psychological…	1
Journal of Early Intervention	1
Journal of Educational…	1
Multivariate Behavioral…	1

Author

Bowman, Michael L.	1
Cook, Linda L.	1
Furst, Edward J.	1
Hiscox, Michael D.	1
Holden, Ronald R.	1
McKinley, Robert L.	1
Petersen, Nancy S.	1
Reckase, Mark D.	1
Rengel, Elizabeth	1
Roberts, James S.	1
Sheehan, Robert	1
Sijtsma, Klaas	1
Snyder, Scott	1
Thomas, Julia Anne	1
Valentine, Jerry W.	1
Van Der Flier, Henk	1
van Ginkel, Joost R.	1
van der Ark, L. Andries	1
More ▼

Publication Type

Reports - Research	6
Speeches/Meeting Papers	6
Journal Articles	4
Reports - Descriptive	3
Guides - Non-Classroom	1
Information Analyses	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Audience

Researchers	12
Administrators	1
Practitioners	1

Location

Laws, Policies, & Programs

Assessments and Surveys

Piers Harris Childrens Self…	1
Tennessee Self Concept Scale	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Modified Likelihood-Based Item Fit Statistics for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Roberts, James S. – Applied Psychological Measurement, 2008

Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…

Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models

Multiple Imputation of Item Scores in Test and Questionnaire Data, and Influence on Psychometric Results

Peer reviewed

Direct link

van Ginkel, Joost R.; van der Ark, L. Andries; Sijtsma, Klaas – Multivariate Behavioral Research, 2007

The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate normal imputation were used as lower and upper benchmark, respectively. Test data were simulated and item scores were deleted such that they were either missing completely at random, missing at…

Descriptors: Evaluation Methods, Psychometrics, Item Response Theory, Scores

Test Item Disguise and the Structured Assessment of Clinical Psychopathology.

Download full text

Holden, Ronald R. – 1985

Modern test construction strategies in the areas of personality and psychopathology differ in the use of disguise within test stimulus material. Previous research on the validity of using disguised test item content has favored the rational strategy of test construction which views disguise as a liability under normal test-taking circumstances.…

Descriptors: Adults, Evaluation Methods, Psychopathology, Test Construction

An Iterative Item Bias Detection Method.

Peer reviewed

Van Der Flier, Henk; And Others – Journal of Educational Measurement, 1984

Two strategies for assessing item bias are discussed: methods comparing item difficulties unconditional on ability and methods comparing probabilities of response conditional on ability. Results suggest that the iterative logit method is an improvement on the noniterative one and is efficient in detecting biased and unbiased items. (Author/DWH)

Descriptors: Algorithms, Evaluation Methods, Item Analysis, Scores

Item Difficulty Reconsidered: An IRT Perspective.

PDF pending restoration

Reckase, Mark D.; McKinley, Robert L. – 1984

A new indicator of item difficulty, which identifies effectiveness ranges, overcomes the limitations of other item difficulty indexes in describing the difficulty of an item or a test as a whole and in aiding the selection of appropriate ability level items for a test. There are three common uses of the term "item difficulty": (1) the probability…

Descriptors: Difficulty Level, Evaluation Methods, Item Analysis, Latent Trait Theory

Communicability of the Taxonomy of Educational Objectives for the Cognitive Domain.

Furst, Edward J. – 1983

Enough evidence has accumulated on Bloom's "Taxonomy of Educational Objectives" for the cognitive domain to justify a review of its communicability. This article covers both published and unpublished studies as well as certain informal reports that bear on this property. It also examines possibilities for improving agreement among…

Descriptors: Achievement Tests, Classification, Cognitive Processes, Diffusion (Communication)

A Balance Sheet for Educational Item Banking.

Hiscox, Michael D. – 1983

Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…

Descriptors: Computer Assisted Testing, Cost Effectiveness, Cost Estimates, Educational Testing

The Rasch Measurement Model: An Introduction.

Peer reviewed

Snyder, Scott; Sheehan, Robert – Journal of Early Intervention, 1992

This examination of the Rasch scaling model concludes that the model could potentially facilitate objective comparisons of status and change of young children with disabilities at individual and group levels. The paper discusses applications of the model to early childhood assessment in the areas of item banking, test analysis, and subject…

Descriptors: Disabilities, Evaluation Methods, Item Response Theory, Measurement Techniques

Evaluation of the Construction of the Subscales for the Piers-Harris and Tennessee Inventories.

Thomas, Julia Anne – 1985

A sample of 234 fifth- and 259 sixth-grade students scaled the items of the Piers-Harris, Tennessee, Coopersmith, and Lipsett self-concept measures. The scaling of the Piers-Harris and the Tennessee inventories was examined in reference to their subscales. The present technique placed items on a bivariate plane of two orthogonal dimensions…

Descriptors: Evaluation Methods, Factor Structure, Intermediate Grades, Orthogonal Rotation

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

Audit of Principal Effectiveness: A User's Technical Manual. Designed and Tested for Principal Assessment in Elementary, Middle and Secondary Schools. Revised.

Download full text

Valentine, Jerry W.; Bowman, Michael L. – 1986

This technical manual is presented to assist those using the Audit of Principal Effectiveness, an 80-statement evaluation instrument designed to determine teachers' perceptions of principals' effectiveness, allow principals to obtain feedback from teachers regarding strengths and weaknesses, and provide a useful tool for researchers studying…

Descriptors: Administrator Evaluation, Elementary Secondary Education, Evaluation Methods, Feedback

Agreement between Statistical and Judgmental Item Bias Methods.

Rengel, Elizabeth – 1986

The Ball Aptitude Battery (BAB) was examined for item bias in a sample of 577 high school students in which males and females, as well as three ethnic groups (Blacks, Whites, and Hispanics) were represented. The objectives of the investigation were: (1) to assess the level of interrater agreement for the judgmental method; (2) to find the level of…

Descriptors: Academic Aptitude, Aptitude Tests, Black Students, Culture Fair Tests