Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 7 |
Descriptor
True Scores | 23 |
Item Response Theory | 10 |
Comparative Analysis | 6 |
Equated Scores | 6 |
Mathematical Models | 6 |
Simulation | 6 |
Equations (Mathematics) | 5 |
Reliability | 4 |
Test Reliability | 4 |
Correlation | 3 |
Error of Measurement | 3 |
More ▼ |
Source
Applied Psychological… | 23 |
Author
Publication Type
Journal Articles | 20 |
Reports - Evaluative | 9 |
Reports - Research | 8 |
Reports - Descriptive | 3 |
Education Level
High Schools | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
Iowa Tests of Educational… | 1 |
What Works Clearinghouse Rating
Brossman, Bradley G.; Lee, Won-Chan – Applied Psychological Measurement, 2013
The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the multidimensional item response theory (MIRT) framework. Three equating procedures--two observed score procedures and one true score procedure--were created and described in detail. One observed score procedure was…
Descriptors: Equated Scores, True Scores, Item Response Theory, Mathematics Tests
Jurich, Daniel P.; DeMars, Christine E.; Goodman, Joshua T. – Applied Psychological Measurement, 2012
The prevalence of high-stakes test scores as a basis for significant decisions necessitates the dissemination of accurate and fair scores. However, the magnitude of these decisions has created an environment in which examinees may be prone to resort to cheating. To reduce the risk of cheating, multiple test forms are commonly administered. When…
Descriptors: High Stakes Tests, Scores, Prevention, Cheating
Han, Kyung T. – Applied Psychological Measurement, 2009
This article provides a brief description of a Windows application called IRTEQ. IRTEQ employs an intuitive, user-friendly graphic user interface that can rescale one test form to another by using various item response theory (IRT) scaling methods. It supports various IRT models for test forms. It can also equate test scores on the scale of one…
Descriptors: Item Response Theory, Scaling, True Scores, Equated Scores
Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010
Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…
Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics
Hoshino, Takahiro; Shigemasu, Kazuo – Applied Psychological Measurement, 2008
The authors propose a concise formula to evaluate the standard error of the estimated latent variable score when the true values of the structural parameters are not known and must be estimated. The formula can be applied to factor scores in factor analysis or ability parameters in item response theory, without bootstrap or Markov chain Monte…
Descriptors: Monte Carlo Methods, Markov Processes, Factor Analysis, Computation
von Davier, Alina A.; Wilson, Christine – Applied Psychological Measurement, 2008
Dorans and Holland (2000) and von Davier, Holland, and Thayer (2003) introduced measures of the degree to which an observed-score equating function is sensitive to the population on which it is computed. This article extends the findings of Dorans and Holland and of von Davier et al. to item response theory (IRT) true-score equating methods that…
Descriptors: Advanced Placement, Advanced Placement Programs, Equated Scores, Calculus

Baker, Frank B. – Applied Psychological Measurement, 1997
Describes an idiosyncracy of the MULTILOG (D. Thissen, 1991) parameter estimation process discovered during a simulation study involving the graded response model. A misordering reflected in boundary function location parameter estimates resulted in a large negative contribution to the true score followed by a large positive contribution. These…
Descriptors: Estimation (Mathematics), Simulation, True Scores

Komaroff, Eugene – Applied Psychological Measurement, 1997
Evaluated coefficient alpha under violations of two classical test theory assumptions: essential tau-equivalence and uncorrelated errors through simulation. Discusses the interactive effects of both violations with true and error scores. Provides empirical evidence of the derivation of M. Novick and C. Lewis (1993). (SLD)
Descriptors: Correlation, Reliability, Simulation, Test Theory
Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006
This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…
Descriptors: True Scores, Test Theory, Test Reliability, Scores

Wilcox, Rand R. – Applied Psychological Measurement, 1979
Using a new coefficient, a rescaling of the Bayes risk is examined and a modification of this coefficient is described which yields an index that always has a value between zero and one. (Author/MH)
Descriptors: Bayesian Statistics, Measurement Techniques, Scoring, Technical Reports

Tisak, John; Tisak, Marie S. – Applied Psychological Measurement, 1996
Dynamic generalizations of reliability and validity that will incorporate longitudinal or developmental models, using latent curve analysis, are discussed. A latent curve model formulated to depict change is incorporated into the classical definitions of reliability and validity. The approach is illustrated with sociological and psychological…
Descriptors: Definitions, Development, Longitudinal Studies, Models

Baker, Frank B. – Applied Psychological Measurement, 1992
The procedure of M.L. Stocking and F.M. Lord (1983) for computing equating coefficients for tests having dichotomously scored items is extended to the case of graded response items. A system of equations for obtaining the equating coefficients under the graded response model is derived. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Item Response Theory, Mathematical Models
Lee, Won-Chan; Hanson, Bradley A.; Brennan, Robert L. – Applied Psychological Measurement, 2002
This article describes procedures for estimating various indices of classification consistency and accuracy for multiple category classifications using data from a single test administration. The estimates of the classification consistency and accuracy indices are compared under three different psychometric models: the two-parameter beta binomial,…
Descriptors: Classification, True Scores, Psychometrics, Item Response Theory

Vander Linden, Wim J.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 1978
A general coefficient for tests, delta, is derived from a decision theoretic point of view. The situations are considered in which a true score is estimated by a function of the observed score, observed scores are split into more than two categories, and observed scores are split into only two categories. (Author/CTM)
Descriptors: Criterion Referenced Tests, Decision Making, Mathematical Models, Raw Scores

Hanson, Bradley A. – Applied Psychological Measurement, 1991
Log-linear model bivariate smoothing and a bivariate smoothing model based on the four-parameter beta binomial model were compared for usefulness in frequency estimation common-item equipercentile equating using two datasets. The performance of smoothed equipercentile methods was also compared to that of linear methods of common-item equating.…
Descriptors: Comparative Analysis, Equated Scores, Equations (Mathematics), Estimation (Mathematics)
Previous Page | Next Page ยป
Pages: 1 | 2