Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 14 |
Descriptor
Comparative Analysis | 15 |
Evaluation Methods | 15 |
Models | 7 |
Psychometrics | 7 |
Cultural Differences | 6 |
Foreign Countries | 6 |
Global Approach | 6 |
Measurement | 6 |
Business Administration | 5 |
Cultural Context | 5 |
Evaluation Research | 5 |
More ▼ |
Source
International Journal of… | 15 |
Author
Badham, Louise | 1 |
Bank, Jurgen | 1 |
Bartram, Dave | 1 |
Beland, Sebastien | 1 |
Ercikan, Kadriye | 1 |
Evers, Arne | 1 |
Foster, Jeff L. | 1 |
Furlong, Antony | 1 |
Gerard, Paul | 1 |
Glas, Cees A. W. | 1 |
Guo, Xiuyan | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Research | 10 |
Reports - Descriptive | 4 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 2 |
Secondary Education | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
High Schools | 1 |
Audience
Location
Canada | 2 |
United States | 2 |
Australia | 1 |
China | 1 |
Colombia | 1 |
Denmark | 1 |
Germany | 1 |
Poland | 1 |
South Africa | 1 |
Sweden | 1 |
United Kingdom | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 2 |
Progress in International… | 1 |
What Works Clearinghouse Rating
Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025
This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…
Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis
Badham, Louise; Furlong, Antony – International Journal of Testing, 2023
Multilingual summative assessments face significant challenges due to tensions that exist between multiple language provision and comparability. Yet, conventional approaches for investigating comparability in multilingual assessments fail to accommodate assessments that comprise extended responses that target complex constructs. This article…
Descriptors: Summative Evaluation, Multilingualism, Comparative Analysis, Literature
Maritza Casas; Stephen G. Sireci – International Journal of Testing, 2025
In this study, we take a critical look at the degree to which the measurement of bullying and sense of belonging at school is invariant across groups of students defined by immigrant status. Our study focuses on the invariance of these constructs as measured on a recent PISA administration and includes a discussion of two statistical methods for…
Descriptors: Error of Measurement, Immigrants, Peer Groups, Bullying
Guo, Xiuyan; Lei, Pui-Wa – International Journal of Testing, 2020
Little research has been done on the effects of peer raters' quality characteristics on peer rating qualities. This study aims to address this gap and investigate the effects of key variables related to peer raters' qualities, including content knowledge, previous rating experience, training on rating tasks, and rating motivation. In an experiment…
Descriptors: Peer Evaluation, Error Patterns, Correlation, Knowledge Level
Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul – International Journal of Testing, 2011
We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…
Descriptors: Language Skills, Identification, Foreign Countries, Evaluation Methods
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Makransky, Guido; Glas, Cees A. W. – International Journal of Testing, 2013
Cognitive ability tests are widely used in organizations around the world because they have high predictive validity in selection contexts. Although these tests typically measure several subdomains, testing is usually carried out for a single subdomain at a time. This can be ineffective when the subdomains assessed are highly correlated. This…
Descriptors: Foreign Countries, Cognitive Ability, Adaptive Testing, Feedback (Response)
Evers, Arne – International Journal of Testing, 2012
In this article, the characteristics of five test review models are described. The five models are the US review system at the Buros Center for Testing, the German Test Review System of the Committee on Tests, the Brazilian System for the Evaluation of Psychological Tests, the European EFPA Review Model, and the Dutch COTAN Evaluation System for…
Descriptors: Program Evaluation, Test Reviews, Trend Analysis, International Education
Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009
Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…
Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment
Sese, Albert; Palmer, Alfonso L.; Montano, Juan J. – International Journal of Testing, 2004
The study of measurement models in psychometrics by means of dimensionality reduction techniques such as Principal Components Analysis (PCA) is a very common practice. In recent times, an upsurge of interest in the study of artificial neural networks apt to computing a principal component extraction has been observed. Despite this interest, the…
Descriptors: Psychometrics, Computer Simulation, Models, Comparative Analysis
Bartram, Dave – International Journal of Testing, 2008
The article discusses issues relating to the international use of personality inventories, especially those in which organizations make comparisons between people from differing cultures or countries or those with different languages. The focus is on the issue of norming and the use of national versus multinational norms. It is noted that…
Descriptors: Guidelines, Norms, Cultural Differences, Global Approach
Hedricks, Cynthia A.; Robie, Chet; Harnisher, John V. – International Journal of Testing, 2008
Personality scores were used to construct three databases of global norms. The composition of the three databases varied according to percentage of cases by global region, occupational group, applicant status, and gender of the job candidate. Comparison of personality scores across the three norms databases revealed that the magnitude of the…
Descriptors: Norms, Databases, Case Studies, Cultural Differences
Kabacoff, Robert I. – International Journal of Testing, 2008
With the growth of globalization, organizations are facing an increased challenge to manage talent, improve employee engagement, develop effective teams, create succession pipelines, and increase organizational effectiveness in multinational settings. Given the need to evaluate and compare individuals from more than one country using standardized…
Descriptors: Motivation, Local Norms, Foreign Countries, Organizational Development
Ramesh, Anuradha; Hazucha, Joy F.; Bank, Jurgen – International Journal of Testing, 2008
A major challenge that decisions makers face in multi-national organizations is how to compare managers from different parts of the globe. This challenge is both psychometric and practical. We draw on the cross-cultural psychology literature to propose a three-step framework to compare personality data from different countries. The first step…
Descriptors: Personality, Norms, Psychometrics, International Organizations
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources