ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Source

Journal of Educational…

Publication Type

Journal Articles	31
Reports - Evaluative	13
Reports - Research	13
Reports - Descriptive	4
Speeches/Meeting Papers	3
Book/Product Reviews	1
Opinion Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
ACT Assessment	1
Comprehensive Tests of Basic…	1
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
Lexile Scale of Reading	1
Metropolitan Achievement Tests	1
North Carolina End of Course…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Using a Projection IRT Method for Vertical Scaling When Construct Shift Is Present

Peer reviewed

Direct link

Strachan, Tyler; Cho, Uk Hyun; Kim, Kyung Yong; Willse, John T.; Chen, Shyh-Huei; Ip, Edward H.; Ackerman, Terry A.; Weeks, Jonathan P. – Journal of Educational Measurement, 2021

In vertical scaling, results of tests from several different grade levels are placed on a common scale. Most vertical scaling methodologies rely heavily on the assumption that the construct being measured is unidimensional. In many testing situations, however, such an assumption could be problematic. For instance, the construct measured at one…

Descriptors: Item Response Theory, Scaling, Tests, Construct Validity

IRT Model Misspecification and Measurement of Growth in Vertical Scaling

Peer reviewed

Direct link

Bolt, Daniel M.; Deng, Sien; Lee, Sora – Journal of Educational Measurement, 2014

Functional form misfit is frequently a concern in item response theory (IRT), although the practical implications of misfit are often difficult to evaluate. In this article, we illustrate how seemingly negligible amounts of functional form misfit, when systematic, can be associated with significant distortions of the score metric in vertical…

Descriptors: Item Response Theory, Scaling, Goodness of Fit, Models

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

The Long-Term Sustainability of IRT Scaling Methods in Mixed-Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Hambleton, Ronald K. – Journal of Educational Measurement, 2013

Due to recent research in equating methodologies indicating that some methods may be more susceptible to the accumulation of equating error over multiple administrations, the sustainability of several item response theory methods of equating over time was investigated. In particular, the paper is focused on two equating methodologies: fixed common…

Descriptors: Item Response Theory, Scaling, Test Format, Equated Scores

Measuring Growth with Vertical Scales

Peer reviewed

Direct link

Briggs, Derek C. – Journal of Educational Measurement, 2013

A vertical score scale is needed to measure growth across multiple tests in terms of absolute changes in magnitude. Since the warrant for subsequent growth interpretations depends upon the assumption that the scale has interval properties, the validation of a vertical scale would seem to require methods for distinguishing interval scales from…

Descriptors: Measurement, Scaling, Validity, Test Interpretation

The Influence of Labels and Positions in Rating Scales.

Peer reviewed

Klockars, Alan J.; Yamagishi, Midori – Journal of Educational Measurement, 1988

The influence of the verbal label and its scalar position in defining the meaning of the labeled position on a rating scale was studied in three forms of the scale with the labels FAIR and GOOD systematically moved. When labels and position differed in meaning, college students rated the labeled position as a compromise between the two. (SLD)

Descriptors: College Students, Rating Scales, Scaling

Comparison of Item Response Theory and Thurstone Methods of Vertical Scaling.

Peer reviewed

Burket, George R.; Yen, Wendy M. – Journal of Educational Measurement, 1997

Using simulated data modeled after real tests, a Thurstone method (L. Thurstone, 1925 and later) and three-parameter item response theory were compared for vertical scaling. Neither procedure produced artificial scale shrinkage, and both produced modest scale expansion for one simulated condition. (SLD)

Descriptors: Comparative Analysis, Item Response Theory, Scaling, Simulation

Note on the Pearson r as a Function of the Bivariate Distributional Characteristics

Peer reviewed

Quereshi, M. Y. – Journal of Educational Measurement, 1971

The study investigated the degree to which errors of scaling and selection depress the linear relationship and whether the reduction in the magnitude of r differs with the type of error. Results indicated that various scaling errors caused considerable discrepancy in the measurement of underlying relations, but the effect of non-normality was…

Descriptors: Correlation, Error Patterns, Factor Analysis, Scaling

Obtaining Test Blueprint Weights from Job Analysis Surveys.

Peer reviewed

Spray, Judith; Huang, Chi-Yu – Journal of Educational Measurement, 2000

Presents a method for combining multiple scale responses from job or task surveys based on a hierarchical rating scheme. Provides the rationale for placing the resulting ordinal information on an interval scale of measurement using the Rasch model. Also suggests a method for linking two or more surveys using the Rasch model and the BIGSTEPS…

Descriptors: Item Response Theory, Job Analysis, Responses, Scaling

Grouping Continuous Data in Discrete Intervals: Information Loss and Recovery

Peer reviewed

Shaw, Dale G; And Others – Journal of Educational Measurement, 1987

Information loss occurs when continuous data are grouped in discrete intervals. After calculating the squared correlation coefficients between continuous data and corresponding grouped data for four population distributions, the effects of population distribution, number of intervals, and interval width on information loss and recovery were…

Descriptors: Intervals, Rating Scales, Sampling, Scaling

Defining Score Scales in Relation to Measurement Error.

Peer reviewed

Kolen, Michael J. – Journal of Educational Measurement, 1988

Linear and nonlinear methods for incorporating score precision information when the score scale is established for educational tests are compared. Examples illustrate the methods, which discourage overinterpretation of small score differences and enhance score interpretability by equalizing error variance along the score scale. Measurement error…

Descriptors: Error of Measurement, Measures (Individuals), Scaling, Scoring

Measurement Error, Multidimensionality, and Scale Shrinkage: A Reply to Yen and Burket.

Peer reviewed

Camilli, Gregory – Journal of Educational Measurement, 1999

Yen and Burket suggested that shrinkage in vertical equating cannot be understood apart from multidimensionality. Reviews research on reliability, multidimensionality, and scale shrinkage, and explores issues of practical importance to educators. (SLD)

Descriptors: Equated Scores, Error of Measurement, Item Response Theory, Reliability

Group Scores: A Response to Baglin.

Peer reviewed

Burket, George R. – Journal of Educational Measurement, 1987

This response to the Baglin paper (1986) points out the fallacy in inferring that inappropriate scaling procedures cause apparent discrepancies between medians and means and between means calculated using different units. (LMO)

Descriptors: Norm Referenced Tests, Scaling, Scoring, Statistical Distributions

Grade Equivalent and IRT Representations of Growth.

Peer reviewed

Schulz, E. Matthew; Nicewander, W. Alan – Journal of Educational Measurement, 1997

The arbitrary nature of growth trends in cognitive variables is illustrated. Two metrics, grade equivalent and item-response theory representations, both of which preserve the order of performance levels in test data, produced different pictures of cognitive growth, and differences were seen to arise from differences in the scaling models. (SLD)

Descriptors: Cognitive Development, Comparative Analysis, Grade Equivalent Scores, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3

Scaling	32
Item Response Theory	16
Achievement Tests	6
Error of Measurement	6
Scores	6
Simulation	6
Comparative Analysis	5
Elementary Secondary Education	5
Test Items	5
Estimation (Mathematics)	4
Mathematical Models	4
Models	4
Reliability	4
Scoring	4
Statistical Distributions	4
Test Construction	4
Testing Programs	4
Educational Assessment	3
Equated Scores	3
Latent Trait Theory	3
National Surveys	3
Raw Scores	3
Reading Tests	3
State Programs	3
Statistical Studies	3
More ▼

Kolen, Michael J.	4
Yen, Wendy M.	4
Burket, George R.	2
Hanson, Bradley A.	2
Ackerman, Terry A.	1
Baglin, Roger F.	1
Ban, Jae-Chun	1
Beaton, Albert E.	1
Becker, Douglas F.	1
Bolt, Daniel M.	1
Braun, Henry I.	1
Brennan, Robert L.	1
Briggs, Derek C.	1
Camilli, Gregory	1
Chen, Shyh-Huei	1
Cho, Uk Hyun	1
Deng, Sien	1
Emre Gonulates	1
Englehard, George, Jr.	1
Fitzpatrick, Anne R.	1
Forsyth, Robert A.	1
Hambleton, Ronald K.	1
Harold Doran	1
Harris, Deborah J.	1
More ▼