Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Error of Measurement | 15 |
Item Response Theory | 15 |
Mathematical Models | 15 |
Estimation (Mathematics) | 7 |
Equations (Mathematics) | 5 |
Goodness of Fit | 4 |
Comparative Analysis | 3 |
Computer Simulation | 3 |
Item Bias | 3 |
Mathematics Tests | 3 |
Maximum Likelihood Statistics | 3 |
More ▼ |
Source
Educational and Psychological… | 2 |
Applied Psychological… | 1 |
Creativity Research Journal | 1 |
Journal of Educational… | 1 |
Psychometrika | 1 |
Author
Chang, Yu-Wen | 2 |
Davison, Mark L. | 2 |
Baldwin, Beatrice | 1 |
Berger, Martjin P. F. | 1 |
Brown, William L. | 1 |
Chen, Hsueh-Chih | 1 |
Chen, Po-Hsi | 1 |
De Ayala, R. J. | 1 |
Haertel, Edward H. | 1 |
Hung, Su-Pin | 1 |
Linacre, John M. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 9 |
Speeches/Meeting Papers | 7 |
Journal Articles | 6 |
Reports - Research | 4 |
Opinion Papers | 2 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Location
Taiwan (Taipei) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Woodcock Johnson Psycho… | 1 |
What Works Clearinghouse Rating
Hung, Su-Pin; Chen, Po-Hsi; Chen, Hsueh-Chih – Creativity Research Journal, 2012
Product assessment is widely applied in creative studies, typically as an important dependent measure. Within this context, this study had 2 purposes. First, the focus of this research was on methods for investigating possible rater effects, an issue that has not received a great deal of attention in past creativity studies. Second, the…
Descriptors: Item Response Theory, Creativity, Interrater Reliability, Undergraduate Students
Liu, Yan; Zumbo, Bruno D. – Educational and Psychological Measurement, 2007
The impact of outliers on Cronbach's coefficient [alpha] has not been documented in the psychometric or statistical literature. This is an important gap because coefficient [alpha] is the most widely used measurement statistic in all of the social, educational, and health sciences. The impact of outliers on coefficient [alpha] is investigated for…
Descriptors: Psychometrics, Computation, Reliability, Monte Carlo Methods

Reiser, Mark – Psychometrika, 1996
Using the item response model as developed on the multinomial distribution, asymptotic variances are obtained for residuals with response patterns and first- and second-order marginal frequencies of manifest variables. A limited-information test of fit is developed by using residuals defined for the first- and second-order marginals. (Author/SLD)
Descriptors: Error of Measurement, Factor Analysis, Goodness of Fit, Item Response Theory
Baldwin, Beatrice; Lomax, Richard – 1990
This LISREL study examines the robustness of the maximum likelihood estimates under varying degrees of measurement model misspecification. A true model containing five latent variables (two endogenous and three exogenous) and two indicator variables per latent variable was used. Measurement model misspecification considered included errors of…
Descriptors: Computer Software, Error of Measurement, Item Response Theory, Mathematical Models

Zwick, Rebecca – Journal of Educational Statistics, 1990
Use of the Mantel-Haenszel procedure as a test for differential item functioning under the Rasch model of item-response theory is examined. Results of the procedure cannot be generalized to the class of items for which item-response functions are monotonic and local independence holds. (TJH)
Descriptors: Demography, Equations (Mathematics), Error of Measurement, Item Bias

Berger, Martjin P. F. – Applied Psychological Measurement, 1991
A generalized variance criterion is proposed to measure efficiency in item-response-theory (IRT) models. Heuristic arguments are given to formulate the efficiency of a design in terms of an asymptotic generalized variance criterion. Efficiencies of designs for one-, two-, and three-parameter models are compared. (SLD)
Descriptors: Comparative Analysis, Efficiency, Equations (Mathematics), Error of Measurement
Chang, Yu-Wen; Davison, Mark L. – 1992
Standard errors and bias of unidimensional and multidimensional ability estimates were compared in a factorial, simulation design with two item response theory (IRT) approaches, two levels of test correlation (0.42 and 0.63), two sample sizes (500 and 1,000), and a hierarchical test content structure. Bias and standard errors of subtest scores…
Descriptors: Comparative Testing, Computer Simulation, Correlation, Error of Measurement
Samejima, Fumiko – 1990
Because the test information function and its two modified formulas provide useful information, the reliability coefficient of a test is no longer necessary in modern mental test theory. Yet it is interesting to know how to predict the coefficient using the test information function and its modifications, tailored for each separate population of…
Descriptors: Ability Identification, Elementary Secondary Education, Equations (Mathematics), Error of Measurement
Linacre, John M. – 1990
Rank ordering examinees is an easier task for judges than is awarding numerical ratings. A measurement model for rankings based on Rasch's objectivity axioms provides linear, sample-independent and judge-independent measures. Estimates of examinee measures are obtained from the data set of rankings, along with standard errors and fit statistics.…
Descriptors: Comparative Analysis, Error of Measurement, Essay Tests, Evaluators
Scheuneman, Janice Dowd – 1990
The current status of item response theory (IRT) is discussed. Several IRT methods exist for assessing whether an item is biased. Focus is on methods proposed by L. M. Rudner (1975), F. M. Lord (1977), D. Thissen et al. (1988) and R. L. Linn and D. Harnisch (1981). Rudner suggested a measure of the area lying between the two item characteristic…
Descriptors: Chi Square, Error of Measurement, Estimation (Mathematics), Goodness of Fit

De Ayala, R. J. – Educational and Psychological Measurement, 1992
Effects of dimensionality on ability estimation of an adaptive test were examined using generated data in Bayesian computerized adaptive testing (CAT) simulations. Generally, increasing interdimensional difficulty association produced a slight decrease in test length and an increase in accuracy of ability estimation as assessed by root mean square…
Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Computer Simulation
Haertel, Edward H. – 1992
Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…
Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory
Davison, Mark L.; Chang, Yu-Wen – 1992
A two-dimensional, compensatory item response model and a unidimensional model were fitted to the reading and mathematics items in the Woodcock-Johnson Psycho-Educational Battery-Revised for a sample of 1,000 adults aged 20-39 years. Multidimensional information theory predicts that if the unidimensional abilities can be represented as vectors in…
Descriptors: Achievement Tests, Adults, Equations (Mathematics), Error of Measurement
Miller, Timothy R. – 1991
Two studies were carried out to evaluate the quality of multidimensional item response theory (MIRT) model parameter estimates obtained from the computer program NOHARM. The purpose of the first study was to compute empirical estimates of the standard errors of the parameters. In addition, the parameter estimates were evaluated for bias and the…
Descriptors: College Entrance Examinations, Comparative Analysis, Computer Simulation, Equations (Mathematics)
Brown, William L. – 1992
The partial credit model of G. N. Masters (1982), a one-parameter unidimensional polychotomous Rasch model, was used to reduce the error of measurement, particularly for students near the cut score, and to permit measurement to reflect the actual ability of a student more accurately by reducing the degree of misfit for students near the cut…
Descriptors: Ability, Computer Assisted Testing, Cutting Scores, Error of Measurement