ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Source

Educational Measurement:…	2
Applied Psychological…	1
Educational Assessment	1
Educational and Psychological…	1
Review of Educational Research	1

Author

Hambleton, Ronald K.	21
Rogers, H. Jane	3
Cook, Linda L.	2
Rovinelli, Richard J.	2
Baldwin, Peter	1
Clauser, Jerome C.	1
Gorth, William P.	1
Han, Kyung T.	1
Liang, Tie	1
Murray, Linda N.	1
Robin, Frederic	1
Swaminathan, H.	1
Traub, Ross E.	1
Yoo, Hanwook	1
Zenisky, April L.	1
More ▼

Publication Type

Reports - Research	10
Journal Articles	6
Speeches/Meeting Papers	6
Reports - Evaluative	4
Information Analyses	2
Opinion Papers	2
Reports - Descriptive	2
Tests/Questionnaires	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Digital Module 08: Foundations of Operational Item Analysis https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Yoo, Hanwook; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 2019

Item analysis is an integral part of operational test development and is typically conducted within two popular statistical frameworks: classical test theory (CTT) and item response theory (IRT). In this digital ITEMS module, Hanwook Yoo and Ronald K. Hambleton provide an accessible overview of operational item analysis approaches within these…

Descriptors: Item Analysis, Item Response Theory, Guidelines, Test Construction

The Effect of Rating Unfamiliar Items on Angoff Passing Scores

Peer reviewed

Direct link

Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017

The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…

Descriptors: Scores, Item Analysis, Classification, Decision Making

ResidPlots-2: Computer Software for IRT Graphical Residual Analyses

Peer reviewed

Direct link

Liang, Tie; Han, Kyung T.; Hambleton, Ronald K. – Applied Psychological Measurement, 2009

This article discusses the ResidPlots-2, a computer software that provides a powerful tool for IRT graphical residual analyses. ResidPlots-2 consists of two components: a component for computing residual statistics and another component for communicating with users and for plotting the residual graphs. The features of the ResidPlots-2 software are…

Descriptors: Computer Software, Statistics, Item Response Theory, Graphs

DIF Detection and Interpretation in Large-Scale Science Assessments: Informing Item Writing Practices

Peer reviewed

Direct link

Zenisky, April L.; Hambleton, Ronald K.; Robin, Frederic – Educational Assessment, 2004

Differential item functioning (DIF) analyses are a routine part of the development of large-scale assessments. Less common are studies to understand the potential sources of DIF. The goals of this study were (a) to identify gender DIF in a large-scale science assessment and (b) to look for trends in the DIF and non-DIF items due to content,…

Descriptors: Program Effectiveness, Test Format, Science Tests, Test Items

On the Use of Content Specialists in the Assessment of Criterion-Referenced Test Item Validity.

Download full text

Rovinelli, Richard J.; Hambleton, Ronald K. – 1976

Essential for an effective criterion-referenced testing program is a set of test items that are "valid" indicators of the objectives they have been designed to measure. Unfortunately, the complex matter of assessing item validity has received only limited attention from educational measurement specialists. One promising approach to the item…

Descriptors: Content Analysis, Criterion Referenced Tests, Data Collection, Evaluation Methods

Analysis of Empirical Data Using Two Logistic Latent Trait Models.

Download full text

Hambleton, Ronald K.; Traub, Ross E. – 1970

Georg Rasch has developed a new one-parameter latent trait model to explain the performance of examinees on achievement tests. The model can be viewed as a special case of Birnbaum's two-parameter logistic model where all items are assumed to have equal discriminating power. Birnbaum's model permits items to vary in discriminating power. Both…

Descriptors: Academic Ability, Achievement Tests, Aptitude Tests, Item Analysis

Using Microcomputers to Develop Tests.

Peer reviewed

Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 1984

The purpose of this paper is to describe some of the current changes in test development that are taking place because of the availability and capabilities of computers, especially microcomputers. Item banking and test assembly are discussed, and a comprehensive testing system is described. (BW)

Descriptors: Computer Assisted Testing, Computer Software, Educational Testing, Elementary Secondary Education

Detecting Biased Test Items: Comparison of the IRT Area and Mantel-Haenszel Methods.

Download full text

Hambleton, Ronald K.; Rogers, H. Jane – 1988

The agreement between item response theory-based and Mantel Haenszel (MH) methods in identifying biased items on tests was studied. Data came from item responses of four spaced samples of 1,000 examinees each--two samples of 1,000 Anglo-American and two samples of 1,000 Native American students taking the New Mexico High School Proficiency…

Descriptors: Comparative Analysis, High School Students, High Schools, Item Analysis

Using Residual Analyses to Assess Item Response Model-Test Data Fit. Laboratory of Psychometric and Evaluative Research Report No. 140.

Download full text

Murray, Linda N.; Hambleton, Ronald K. – 1983

The purpose of this research study was to assess item response model-test data fit using residuals. First, a comparison of raw and standardized residuals for describing model-test data fit was carried out. Second, hypotheses concerning the relationship between residual sizes and several item characteristics were studied. The analyses with…

Descriptors: Educational Assessment, Goodness of Fit, Item Analysis, Latent Trait Theory

Assessing the Dimensionality of a Set of Test Items.

Download full text

Hambleton, Ronald K.; Rovinelli, Richard J. – 1986

Four methods for determining the dimensionality of a set of test items were compared: (1) linear factor analysis; (2) residual analysis; (3) nonlinear factor analysis; and (4) Bejar's method. Five artificial test data sets (for 40 items and 1500 examinees) were generated, consistent with the three-parameter logistic model and the assumption of…

Descriptors: Comparative Analysis, Computer Simulation, Correlation, Factor Analysis

Evaluation of Computer Simulated Baseline Statistics for Use in Item Bias Studies. [Revised].

Download full text

Rogers, H. Jane; Hambleton, Ronald K. – 1987

Although item bias statistics are widely recommended for use in test development and test analysis work, problems arise in their interpretation. The purpose of the present research was to evaluate the validity of logistic test models and computer simulation methods for providing a frame of reference for item bias statistic interpretations.…

Descriptors: Computer Simulation, Evaluation Methods, Item Analysis, Latent Trait Theory

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

A Look at Psychometrics in the Netherlands.

Download full text

Hambleton, Ronald K.; Swaminathan, H. – 1985

Comments are made on the review papers presented by six Dutch psychometricians: Ivo Molenaar, Wim van der Linden, Ed Roskam, Arnold Van den Wollenberg, Gideon Mellenbergh, and Dato de Gruijter. Molenaar has embraced a pragmatic viewpoint on Bayesian methods, using both empirical and pure approaches to solve educational research problems. Molenaar…

Descriptors: Bayesian Statistics, Decision Making, Elementary Secondary Education, Foreign Countries

Application of Latent Trait Models to the Development of Norm-Referenced and Criterion-Referenced Tests.

PDF pending restoration

Cook, Linda L.; Hambleton, Ronald K. – 1978

Latent trait models may offer considerable potential for the improvement of educational measurement practices, but until recently, they have received only limited attention from measurement specialists. This paper provides a brief introduction to latent trait models, and provides test practitioners with a non-technical introduction to the…

Descriptors: Career Development, Criterion Referenced Tests, Difficulty Level, Item Analysis

Optimal Item Selection with Credentialing Examinations.

Download full text

Hambleton, Ronald K.; And Others – 1987

The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…

Descriptors: Comparative Analysis, Content Validity, Cutting Scores, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2

Item Analysis	21
Test Construction	14
Test Items	12
Latent Trait Theory	11
Comparative Analysis	5
Goodness of Fit	5
Mathematical Models	5
Statistical Analysis	5
Criterion Referenced Tests	4
Evaluation Methods	4
Scores	4
Test Bias	4
Test Validity	4
Computer Simulation	3
Difficulty Level	3
Elementary Secondary Education	3
Item Response Theory	3
Measurement Techniques	3
Psychometrics	3
Test Reliability	3
Test Results	3
Test Theory	3
Bayesian Statistics	2
Career Development	2
Computer Software	2
More ▼