Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 12 |
Descriptor
Models | 12 |
Evaluation Methods | 5 |
Psychometrics | 5 |
Comparative Analysis | 4 |
Educational Assessment | 4 |
Global Approach | 4 |
Computation | 3 |
Computer Software | 3 |
Foreign Countries | 3 |
Item Response Theory | 3 |
Scores | 3 |
More ▼ |
Source
International Journal of… | 15 |
Author
Bartram, Dave | 2 |
Beland, Sebastien | 1 |
Buckendahl, Chad W. | 1 |
Byrne, Barbara M. | 1 |
Cheong, Yuk Fai | 1 |
Chiu, Chia-Yi | 1 |
DeMars, Christine E. | 1 |
Evers, Arne | 1 |
Foster, Jeff L. | 1 |
Geranpayeh, Ardeshir | 1 |
Gerard, Paul | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Descriptive | 15 |
Guides - Non-Classroom | 2 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Location
Canada | 2 |
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Chiu, Chia-Yi; Köhn, Hans-Friedrich; Wu, Huey-Min – International Journal of Testing, 2016
The Reduced Reparameterized Unified Model (Reduced RUM) is a diagnostic classification model for educational assessment that has received considerable attention among psychometricians. However, the computational options for researchers and practitioners who wish to use the Reduced RUM in their work, but do not feel comfortable writing their own…
Descriptors: Educational Diagnosis, Classification, Models, Educational Assessment
Lim, Gad S.; Geranpayeh, Ardeshir; Khalifa, Hanan; Buckendahl, Chad W. – International Journal of Testing, 2013
Standard setting theory has largely developed with reference to a typical situation, determining a level or levels of performance for one exam for one context. However, standard setting is now being used with international reference frameworks, where some parameters and assumptions of classical standard setting do not hold. We consider the…
Descriptors: Standard Setting (Scoring), Validity, Models, Language Tests
DeMars, Christine E. – International Journal of Testing, 2013
This tutorial addresses possible sources of confusion in interpreting trait scores from the bifactor model. The bifactor model may be used when subscores are desired, either for formative feedback on an achievement test or for theoretically different constructs on a psychological test. The bifactor model is often chosen because it requires fewer…
Descriptors: Test Interpretation, Scores, Models, Correlation
Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012
In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…
Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach
Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items
Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul – International Journal of Testing, 2011
We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…
Descriptors: Language Skills, Identification, Foreign Countries, Evaluation Methods
Evers, Arne – International Journal of Testing, 2012
In this article, the characteristics of five test review models are described. The five models are the US review system at the Buros Center for Testing, the German Test Review System of the Committee on Tests, the Brazilian System for the Evaluation of Psychological Tests, the European EFPA Review Model, and the Dutch COTAN Evaluation System for…
Descriptors: Program Evaluation, Test Reviews, Trend Analysis, International Education
Rupp, Andre A. – International Journal of Testing, 2007
One of the most revolutionary advances in psychometric research during the last decades has been the systematic development of statistical models that allow for cognitive psychometric research (CPR) to be conducted. Many of the models currently available for such purposes are extensions of basic latent variable models in item response theory…
Descriptors: Psychometrics, Research, Models, Item Response Theory

Zimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001
Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)
Descriptors: Models, Probability, Reliability, Scores
Bartram, Dave – International Journal of Testing, 2006
The Internet has opened up a whole new set of opportunities for advancing the science of psychometrics and the technology of testing. It has also created some new challenges for those of us involved in test design and testing. In particular, we are seeing impacts from internationalization of testing and new models for test delivery. These are…
Descriptors: Internet, Testing, Computer Security, Confidentiality

Ruhe, Valerie – International Journal of Testing, 2002
Demonstrates how the framework provided by S. Messick (1988) provides a set of lenses with which to explore issues in the validation of small-scale assessments in new technology-mediated environments. In technology-based distributed learning, the conception of validity will not change, but validation practice will be different. (SLD)
Descriptors: Distance Education, Educational Assessment, Educational Technology, Models
Raykov, Tenko; Marcoulides, George A. – International Journal of Testing, 2006
A structural equation modeling approach to scale reliability evaluation can be employed to estimate generalizability theory indexes in settings where sampling of subjects and conditions is carried out. In one- and two-facet crossed designs, it is demonstrated how this method can be used to obtain estimates of relative generalizability…
Descriptors: Computation, Generalizability Theory, Structural Equation Models, Reliability

Byrne, Barbara M. – International Journal of Testing, 2001
Uses a confirmatory factor analytic (CFA) model as a paradigmatic basis for the comparison of three widely used structural equation modeling computer programs: (1) AMOS 4.0; (2) EQS 6; and (3) LISREL 8. Comparisons focus on aspects of programs that bear on the specification and testing of CFA models and the treatment of incomplete, nonnormally…
Descriptors: Comparative Analysis, Computer Software, Data Analysis, Statistical Distributions
Cheong, Yuk Fai – International Journal of Testing, 2006
This article considers and illustrates a strategy to study effects of school context on differential item functioning (DIF) in large-scale assessment. The approach employs a hierarchical generalized linear modeling framework to (a) detect DIF, and (b) identify school-level correlates of the between-group differences in item performance. To…
Descriptors: Context Effect, Test Bias, Causal Models, Educational Assessment
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources