ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	12

Source

International Journal of…

Publication Type

Journal Articles	15
Reports - Descriptive	15
Guides - Non-Classroom	2

Education Level

Higher Education	3
Postsecondary Education	2
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Practitioners	1
Researchers	1

Location

Canada	2
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Fitting the Reduced RUM with Mplus: A Tutorial

Peer reviewed

Direct link

Chiu, Chia-Yi; Köhn, Hans-Friedrich; Wu, Huey-Min – International Journal of Testing, 2016

The Reduced Reparameterized Unified Model (Reduced RUM) is a diagnostic classification model for educational assessment that has received considerable attention among psychometricians. However, the computational options for researchers and practitioners who wish to use the Reduced RUM in their work, but do not feel comfortable writing their own…

Descriptors: Educational Diagnosis, Classification, Models, Educational Assessment

Standard Setting to an International Reference Framework: Implications for Theory and Practice

Peer reviewed

Direct link

Lim, Gad S.; Geranpayeh, Ardeshir; Khalifa, Hanan; Buckendahl, Chad W. – International Journal of Testing, 2013

Standard setting theory has largely developed with reference to a typical situation, determining a level or levels of performance for one exam for one context. However, standard setting is now being used with international reference frameworks, where some parameters and assumptions of classical standard setting do not hold. We consider the…

Descriptors: Standard Setting (Scoring), Validity, Models, Language Tests

A Tutorial on Interpreting Bifactor Model Scores

Peer reviewed

Direct link

DeMars, Christine E. – International Journal of Testing, 2013

This tutorial addresses possible sources of confusion in interpreting trait scores from the bifactor model. The bifactor model may be used when subscores are desired, either for formative feedback on an achievement test or for theoretically different constructs on a psychological test. The bifactor model is often chosen because it requires fewer…

Descriptors: Test Interpretation, Scores, Models, Correlation

Use of the EFPA Test Review Model by the UK and Issues Relating to the Internationalization of Test Standards

Peer reviewed

Direct link

Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012

In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…

Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach

The Role of Item Models in Automatic Item Generation

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012

Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items

A Generalized Logistic Regression Procedure to Detect Differential Item Functioning among Multiple Groups

Peer reviewed

Direct link

Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul – International Journal of Testing, 2011

We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…

Descriptors: Language Skills, Identification, Foreign Countries, Evaluation Methods

The Internationalization of Test Reviewing: Trends, Differences, and Results

Peer reviewed

Direct link

Evers, Arne – International Journal of Testing, 2012

In this article, the characteristics of five test review models are described. The five models are the US review system at the Buros Center for Testing, the German Test Review System of the Committee on Tests, the Brazilian System for the Evaluation of Psychological Tests, the European EFPA Review Model, and the Dutch COTAN Evaluation System for…

Descriptors: Program Evaluation, Test Reviews, Trend Analysis, International Education

The Answer Is in the Question: A Guide for Describing and Investigating the Conceptual Foundations and Statistical Properties of Cognitive Psychometric Models

Peer reviewed

Direct link

Rupp, Andre A. – International Journal of Testing, 2007

One of the most revolutionary advances in psychometric research during the last decades has been the systematic development of statistical models that allow for cognitive psychometric research (CPR) to be conducted. Many of the models currently available for such purposes are extensions of basic latent variable models in item response theory…

Descriptors: Psychometrics, Research, Models, Item Response Theory

The Geometry of Probability, Statistics, and Test Theory.

Peer reviewed

Zimmerman, Donald W.; Zumbo, Bruno D. – International Journal of Testing, 2001

Presents a model of tests and measurement that identifies test scores with Hilbert space vectors and true and error components of scores with linear operators. This geometric point of view brings to light relations among elementary concepts in test theory, including reliability, validity, and parallel tests. (Author/SLD)

Descriptors: Models, Probability, Reliability, Scores

The Internationalization of Testing and New Models of Test Delivery on the Internet

Peer reviewed

Direct link

Bartram, Dave – International Journal of Testing, 2006

The Internet has opened up a whole new set of opportunities for advancing the science of psychometrics and the technology of testing. It has also created some new challenges for those of us involved in test design and testing. In particular, we are seeing impacts from internationalization of testing and new models for test delivery. These are…

Descriptors: Internet, Testing, Computer Security, Confidentiality

Issues in the Validation of Assessment in Technology-Based Distance and Distributed Learning: What Can We Learn from Messick's Framework?

Peer reviewed

Ruhe, Valerie – International Journal of Testing, 2002

Demonstrates how the framework provided by S. Messick (1988) provides a set of lenses with which to explore issues in the validation of small-scale assessments in new technology-mediated environments. In technology-based distributed learning, the conception of validity will not change, but validation practice will be different. (SLD)

Descriptors: Distance Education, Educational Assessment, Educational Technology, Models

Estimation of Generalizability Coefficients via a Structural Equation Modeling Approach to Scale Reliability Evaluation

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – International Journal of Testing, 2006

A structural equation modeling approach to scale reliability evaluation can be employed to estimate generalizability theory indexes in settings where sampling of subjects and conditions is carried out. In one- and two-facet crossed designs, it is demonstrated how this method can be used to obtain estimates of relative generalizability…

Descriptors: Computation, Generalizability Theory, Structural Equation Models, Reliability

Structural Equation Modeling with AMOS, EQS, and LISREL: Comparative Approaches to Testing for the Factorial Validity of a Measuring Instrument.

Peer reviewed

Byrne, Barbara M. – International Journal of Testing, 2001

Uses a confirmatory factor analytic (CFA) model as a paradigmatic basis for the comparison of three widely used structural equation modeling computer programs: (1) AMOS 4.0; (2) EQS 6; and (3) LISREL 8. Comparisons focus on aspects of programs that bear on the specification and testing of CFA models and the treatment of incomplete, nonnormally…

Descriptors: Comparative Analysis, Computer Software, Data Analysis, Statistical Distributions

Analysis of School Context Effects on Differential Item Functioning Using Hierarchical Generalized Linear Models

Peer reviewed

Direct link

Cheong, Yuk Fai – International Journal of Testing, 2006

This article considers and illustrates a strategy to study effects of school context on differential item functioning (DIF) in large-scale assessment. The approach employs a hierarchical generalized linear modeling framework to (a) detect DIF, and (b) identify school-level correlates of the between-group differences in item performance. To…

Descriptors: Context Effect, Test Bias, Causal Models, Educational Assessment

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Models	12
Evaluation Methods	5
Psychometrics	5
Comparative Analysis	4
Educational Assessment	4
Global Approach	4
Computation	3
Computer Software	3
Foreign Countries	3
Item Response Theory	3
Scores	3
Validity	3
Classification	2
College Students	2
English (Second Language)	2
Evaluation Criteria	2
Generalizability Theory	2
Internet	2
Reliability	2
Sampling	2
Statistical Analysis	2
Structural Equation Models	2
Test Bias	2
Test Construction	2
Test Interpretation	2
More ▼

Bartram, Dave	2
Beland, Sebastien	1
Buckendahl, Chad W.	1
Byrne, Barbara M.	1
Cheong, Yuk Fai	1
Chiu, Chia-Yi	1
DeMars, Christine E.	1
Evers, Arne	1
Foster, Jeff L.	1
Geranpayeh, Ardeshir	1
Gerard, Paul	1
Gierl, Mark J.	1
Khalifa, Hanan	1
Köhn, Hans-Friedrich	1
Lai, Hollis	1
Lim, Gad S.	1
Lindley, Patricia A.	1
Magis, David	1
Marcoulides, George A.	1
Meyer, Kevin D.	1
Raiche, Gilles	1
Raykov, Tenko	1
Ruhe, Valerie	1
Rupp, Andre A.	1
Wu, Huey-Min	1
More ▼