Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 5 |
Descriptor
Test Length | 10 |
Item Response Theory | 5 |
Test Reliability | 4 |
Error of Measurement | 3 |
Comparative Analysis | 2 |
Computation | 2 |
Foreign Countries | 2 |
Identification | 2 |
Intervals | 2 |
Mastery Tests | 2 |
Maximum Likelihood Statistics | 2 |
More ▼ |
Source
Psychometrika | 10 |
Author
Allison, Paul A. | 1 |
Chang, Yuan-chin Ivan | 1 |
Chiu, Chia-Yi | 1 |
Doebler, Anna | 1 |
Doebler, Philipp | 1 |
Douglas, Jeffrey A. | 1 |
Eggen, Theo J. H. M. | 1 |
Holling, Heinz | 1 |
Huynh, Huynh | 1 |
Kim, Seock-Ho | 1 |
Kristof, Walter | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 3 |
Reports - Evaluative | 2 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 1 |
Audience
Location
Germany | 1 |
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Doebler, Anna; Doebler, Philipp; Holling, Heinz – Psychometrika, 2013
The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter [theta] is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given…
Descriptors: Foreign Countries, Item Response Theory, Computation, Hypothesis Testing
Sijtsma, Klaas – Psychometrika, 2012
I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…
Descriptors: Foreign Countries, Psychometrics, Knowledge Level, Test Construction
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing
Chiu, Chia-Yi; Douglas, Jeffrey A.; Li, Xiaodong – Psychometrika, 2009
Latent class models for cognitive diagnosis often begin with specification of a matrix that indicates which attributes or skills are needed for each item. Then by imposing restrictions that take this into account, along with a theory governing how subjects interact with items, parametric formulations of item response functions are derived and…
Descriptors: Test Length, Identification, Multivariate Analysis, Item Response Theory
Chang, Yuan-chin Ivan – Psychometrika, 2005
In this paper, we apply sequential one-sided confidence interval estimation procedures with beta-protection to adaptive mastery testing. The procedures of fixed-width and fixed proportional accuracy confidence interval estimation can be viewed as extensions of one-sided confidence interval procedures. It can be shown that the adaptive mastery…
Descriptors: Mastery Tests, Probability, Intervals, Testing

Huynh, Huynh – Psychometrika, 1978
The use of Cohen's kappa index as a measure of the reliability of multiple classifications is developed. Special cases of the index as well as the effects of test length on the index are also explored. (JKS)
Descriptors: Career Development, Classification, Mastery Tests, Test Length

Allison, Paul A. – Psychometrika, 1976
A direct proof is given for the generalized Spearman-Brown formula for any real multiple of test length. (Author)
Descriptors: Correlation, Error of Measurement, Raw Scores, Test Length
Eggen, Theo J. H. M.; Verelst, Norman D. – Psychometrika, 2006
In this paper, the efficiency of conditional maximum likelihood (CML) and marginal maximum likelihood (MML) estimation of the item parameters of the Rasch model in incomplete designs is investigated. The use of the concept of F-information (Eggen, 2000) is generalized to incomplete testing designs. The scaled determinant of the F-information…
Descriptors: Test Length, Computation, Maximum Likelihood Statistics, Models

Kim, Seock-Ho; And Others – Psychometrika, 1994
Hierarchical Bayes procedures for the two-parameter logistic item response model were compared for estimating item and ability parameters through two joint and two marginal Bayesian procedures. Marginal procedures yielded smaller root mean square differences for item and ability, but results for larger sample size and test length were similar.…
Descriptors: Ability, Bayesian Statistics, Computer Simulation, Estimation (Mathematics)

Kristof, Walter – Psychometrika, 1971
Descriptors: Cognitive Measurement, Error of Measurement, Mathematical Models, Psychological Testing