ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Reliability	12
Models	8
Item Response Theory	5
Comparative Analysis	4
Mathematical Models	4
Error of Measurement	3
Test Items	3
True Scores	3
Classification	2
Equations (Mathematics)	2
Estimation (Mathematics)	2
Item Analysis	2
Research Design	2
Sampling	2
Scaling	2
Scores	2
Academic Achievement	1
Analysis of Covariance	1
Analysis of Variance	1
Black Students	1
Change	1
College Entrance Examinations	1
Computation	1
Computer Assisted Testing	1
Curriculum	1
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	11
Reports - Research	5
Reports - Evaluative	4
Reports - Descriptive	2

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Longitudinal Study…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Development of Information Functions and Indices for the GGUM-RANK Multidimensional Forced Choice IRT Model

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok; Stark, Stephen – Journal of Educational Measurement, 2018

This research derived information functions and proposed new scalar information indices to examine the quality of multidimensional forced choice (MFC) items based on the RANK model. We also explored how GGUM-RANK information, latent trait recovery, and reliability varied across three MFC formats: pairs (two response alternatives), triplets (three…

Descriptors: Item Response Theory, Models, Item Analysis, Reliability

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

Classification Consistency and Accuracy for Complex Assessments Using Item Response Theory

Peer reviewed

Direct link

Lee, Won-Chan – Journal of Educational Measurement, 2010

In this article, procedures are described for estimating single-administration classification consistency and accuracy indices for complex assessments using item response theory (IRT). This IRT approach was applied to real test data comprising dichotomous and polytomous items. Several different IRT model combinations were considered. Comparisons…

Descriptors: Classification, Item Response Theory, Comparative Analysis, Models

Gaining Accuracy in Generalizability Theory: Using Multiple Designs.

Peer reviewed

Smith, Philip L. – Journal of Educational Measurement, 1981

This study explores a strategy for improving the stability of variance component estimates when only small samples are available, using a series of small, less complex generalizability (G) study designs as a surrogate for a single large design. (Author/BW)

Descriptors: Models, Reliability, Research Design, Sampling

The Influence of Several Factors on Reliability for Complex Reading Comprehension Tests.

Peer reviewed

Lee, Guemin – Journal of Educational Measurement, 2002

Studied the effects of items, passages, contents, themes, and types of passages on the reliability and standard errors of measurement for complex reading comprehension tests using seven different generalizability theory models. Results suggest that passages and themes should be taken into account when evaluating the reliability of test scores for…

Descriptors: Error of Measurement, Generalizability Theory, Models, Reading Comprehension

Application of the Bi-Factor Multidimensional Item Response Theory Model to Testlet-Based Tests

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational Measurement, 2006

Four item response theory (IRT) models were compared using data from tests where multiple items were grouped into testlets focused on a common stimulus. In the bi-factor model each item was treated as a function of a primary trait plus a nuisance trait due to the testlet; in the testlet-effects model the slopes in the direction of the testlet…

Descriptors: Item Response Theory, Reliability, Item Analysis, Factor Analysis

An Investigation of Classification Consistency Indexes Estimated under Alternative Strong True Score Models.

Peer reviewed

Hanson, Bradley A.; Brennan, Robert L. – Journal of Educational Measurement, 1990

Using several data sets, the relative performance of the beta binomial model and two more general strong true score models in estimating several indices of classification consistency is examined. It appears that the beta binomial model can provide inadequate fits to raw score distributions compared to more general models. (TJH)

Descriptors: Classification, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)

Incidence Sampling: An Integrated Theory for "Matrix Sampling"

Peer reviewed

Sirotnik, Kenneth; Wellington, Roger – Journal of Educational Measurement, 1977

A single conceptual and theoretical framework for sampling any configuration of data from one or more population matrices is presented, integrating past designs and discussing implications for more general designs. The theory is based upon a generalization of the generalized symmetric mean approach for single matrix samples. (Author/CTM)

Descriptors: Analysis of Variance, Data Analysis, Item Sampling, Mathematical Models

A Latent Trait Method for Determining Intrajudge Inconsistency in the Angoff and Nedelsky Techniques of Standard Setting.

Peer reviewed

Van der Linden, Wim J. – Journal of Educational Measurement, 1982

An ignored aspect of standard setting, namely the possibility that Angoff or Nedelsky judges specify inconsistent probabilities (e.g., low probabilities for easy items but large probabilities for hard items) is explored. A latent trait method is proposed to estimate such misspecifications, and an index of consistency is defined. (Author/PN)

Descriptors: Cutting Scores, Latent Trait Theory, Mastery Tests, Mathematical Models

The Stability of IRT "b" Values.

Peer reviewed

Sykes, Robert C.; Fitzpatrick, Anne R. – Journal of Educational Measurement, 1992

Explanations for an observed change in Rasch item parameters ("b" values) from consecutive administrations of a professional licensing examination were investigated. Analysis of covariance indicated that the change was not related to item position or type. It is hypothesized that the change is attributable to shifts in curriculum…

Descriptors: Analysis of Covariance, Change, Curriculum, Higher Education

Conditional Standard Errors of Measurement for Scale Scores.

Peer reviewed

Kolen, Michael J.; And Others – Journal of Educational Measurement, 1992

A procedure is described for estimating the reliability and conditional standard errors of measurement of scale scores incorporating the discrete transformation of raw scores to scale scores. The method is illustrated using a strong true score model, and practical applications are described. (SLD)

Descriptors: College Entrance Examinations, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)

Racial Differences in Measurement Error in Educational Achievement Models.

Peer reviewed

Wolfle, Lee M.; Robertshaw, Dianne – Journal of Educational Measurement, 1983

Racial differences in the reporting accuracy of parental status characteristics by White and Black high school seniors were investigated using Joreskog's general framework for simultaneous covariance structure analyses of multiple populations. Reliability estimates for Whites were significantly higher than for Blacks due to differences in true…

Descriptors: Academic Achievement, Black Students, Educational Research, Error of Measurement

Brennan, Robert L.	1
DeMars, Christine E.	1
Fitzpatrick, Anne R.	1
Hanson, Bradley A.	1
Joo, Seang-Hwane	1
Kolen, Michael J.	1
Lee, Guemin	1
Lee, Philseok	1
Lee, Won-Chan	1
Robertshaw, Dianne	1
Schwarz, Richard D.	1
Shu, Lianghua	1
Sirotnik, Kenneth	1
Smith, Philip L.	1
Stark, Stephen	1
Sykes, Robert C.	1
Van der Linden, Wim J.	1
Wellington, Roger	1
Wolfle, Lee M.	1
More ▼