Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 29 |
Descriptor
Comparative Analysis | 29 |
Foreign Countries | 29 |
Item Response Theory | 10 |
Scores | 9 |
Test Bias | 8 |
Evaluation Methods | 6 |
Mathematics Tests | 6 |
Secondary School Students | 6 |
Test Items | 6 |
Achievement Tests | 5 |
Correlation | 5 |
More ▼ |
Source
International Journal of… | 29 |
Author
Ercikan, Kadriye | 3 |
Fine, Saul | 2 |
Oliveri, Maria Elena | 2 |
Sandilands, Debra | 2 |
Zumbo, Bruno D. | 2 |
Abdelfattah, Faisal | 1 |
Alexeev, Natalia | 1 |
Andersson, Gerhard | 1 |
Asil, Mustafa | 1 |
Austin, David W. | 1 |
Balboni, Giulia | 1 |
More ▼ |
Publication Type
Journal Articles | 29 |
Reports - Research | 22 |
Reports - Evaluative | 4 |
Reports - Descriptive | 3 |
Tests/Questionnaires | 1 |
Education Level
Secondary Education | 8 |
Grade 4 | 5 |
Higher Education | 5 |
Elementary Education | 4 |
High Schools | 4 |
Grade 8 | 2 |
Intermediate Grades | 2 |
Grade 2 | 1 |
Grade 9 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Location
Canada | 7 |
United States | 7 |
Australia | 5 |
Germany | 4 |
Hong Kong | 4 |
United Kingdom | 4 |
Austria | 3 |
China | 3 |
Israel | 3 |
Netherlands | 3 |
Spain | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 6 |
Progress in International… | 3 |
Trends in International… | 2 |
What Works Clearinghouse Rating
Maritza Casas; Stephen G. Sireci – International Journal of Testing, 2025
In this study, we take a critical look at the degree to which the measurement of bullying and sense of belonging at school is invariant across groups of students defined by immigrant status. Our study focuses on the invariance of these constructs as measured on a recent PISA administration and includes a discussion of two statistical methods for…
Descriptors: Error of Measurement, Immigrants, Peer Groups, Bullying
Rujun Xu; James Soland – International Journal of Testing, 2024
International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…
Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries
Guo, Xiuyan; Lei, Pui-Wa – International Journal of Testing, 2020
Little research has been done on the effects of peer raters' quality characteristics on peer rating qualities. This study aims to address this gap and investigate the effects of key variables related to peer raters' qualities, including content knowledge, previous rating experience, training on rating tasks, and rating motivation. In an experiment…
Descriptors: Peer Evaluation, Error Patterns, Correlation, Knowledge Level
Evers, Arne; McCormick, Carina M.; Hawley, Leslie R.; Muñiz, José; Balboni, Giulia; Bartram, Dave; Boben, Dusica; Egeland, Jens; El-Hassan, Karma; Fernández-Hermida, José R.; Fine, Saul; Frans, Örjan; Gintiliené, Grazina; Hagemeister, Carmen; Halama, Peter; Iliescu, Dragos; Jaworowska, Aleksandra; Jiménez, Paul; Manthouli, Marina; Matesic, Krunoslav; Michaelsen, Lars; Mogaji, Andrew; Morley-Kirk, James; Rózsa, Sándor; Rowlands, Lorraine; Schittekatte, Mark; Sümer, H. Canan; Suwartono, Tono; Urbánek, Tomáš; Wechsler, Solange; Zelenevska, Tamara; Zanev, Svetoslav; Zhang, Jianxin – International Journal of Testing, 2017
On behalf of the International Test Commission and the European Federation of Psychologists' Associations a world-wide survey on the opinions of professional psychologists on testing practices was carried out. The main objective of this study was to collect data for a better understanding of the state of psychological testing worldwide. These data…
Descriptors: Testing, Attitudes, Surveys, Psychologists
Prasad, Joshua J.; Showler, Morgan B.; Schmitt, Neal; Ryan, Ann Marie; Nye, Christopher D. – International Journal of Testing, 2017
The present research compares the operation of situational judgement and biodata measures between Chinese and U.S. respondents. We describe the development and past research on both measures, followed by hypothesized differences across the two groups of respondents. We base hypotheses on the nature of the Chinese and U.S. educational systems and…
Descriptors: Measures (Individuals), Hypothesis Testing, Cross Cultural Studies, Comparative Analysis
Sen, Sedat – International Journal of Testing, 2018
Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…
Descriptors: Item Response Theory, Comparative Analysis, Computation, Maximum Likelihood Statistics
Ercikan, Kadriye; Chen, Michelle Y.; Lyons-Thomas, Juliette; Goodrich, Shawna; Sandilands, Debra; Roth, Wolff-Michael; Simon, Marielle – International Journal of Testing, 2015
The purpose of this research is to examine the comparability of mathematics and science scores for students from English language backgrounds (ELB) and non-English language backgrounds (NELB). We examine the relationship between English reading proficiency and performance on mathematics and science assessments in Australia, Canada, the United…
Descriptors: Scores, Mathematics Tests, Science Tests, Native Speakers
Oliveri, María Elena; Ercikan, Kadriye; Zumbo, Bruno D.; Lawless, René – International Journal of Testing, 2014
In this study, we contrast results from two differential item functioning (DIF) approaches (manifest and latent class) by the number of items and sources of items identified as DIF using data from an international reading assessment. The latter approach yielded three latent classes, presenting evidence of heterogeneity in examinee response…
Descriptors: Test Bias, Comparative Analysis, Reading Tests, Effect Size
Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017
Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…
Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability
Choi, Youn-Jeng; Alexeev, Natalia; Cohen, Allan S. – International Journal of Testing, 2015
The purpose of this study was to explore what may be contributing to differences in performance in mathematics on the Trends in International Mathematics and Science Study 2007. This was done by using a mixture item response theory modeling approach to first detect latent classes in the data and then to examine differences in performance on items…
Descriptors: Test Bias, Mathematics Achievement, Mathematics Tests, Item Response Theory
Oliveri, Maria Elena; von Davier, Matthias – International Journal of Testing, 2014
In this article, we investigate the creation of comparable score scales across countries in international assessments. We examine potential improvements to current score scale calibration procedures used in international large-scale assessments. Our approach seeks to improve fairness in scoring international large-scale assessments, which often…
Descriptors: Test Bias, Scores, International Programs, Educational Assessment
Rogers, W. Todd; Radwan, Nizam – International Journal of Testing, 2015
Restricted equating samples are often used to equate test results. Previously eligible students may be excluded because this group of students is not stable from year to year and their inclusion may bias the results. The present study evaluated the impact of including previously eligible students in the equating samples, where the percentage of…
Descriptors: Eligibility, Equated Scores, Foreign Countries, Public Schools
Cui, Ying; Mousavi, Amin – International Journal of Testing, 2015
The current study applied the person-fit statistic, l[subscript z], to data from a Canadian provincial achievement test to explore the usefulness of conducting person-fit analysis on large-scale assessments. Item parameter estimates were compared before and after the misfitting student responses, as identified by l[subscript z], were removed. The…
Descriptors: Measurement, Achievement Tests, Comparative Analysis, Test Items
Asil, Mustafa; Brown, Gavin T. L. – International Journal of Testing, 2016
The use of the Programme for International Student Assessment (PISA) across nations, cultures, and languages has been criticized. The key criticisms point to the linguistic and cultural biases potentially underlying the design of reading comprehension tests, raising doubts about the legitimacy of comparisons across economies. Our research focused…
Descriptors: Comparative Analysis, Reading Achievement, Achievement Tests, Secondary School Students
Dodeen, Hamzeh; Abdelfattah, Faisal; Shumrani, Saleh; Hilal, Maher Abu – International Journal of Testing, 2012
This study focused on comparing mathematics teachers' qualifications, practices, and perceptions between Saudi and Taiwanese schools. Data analyzed in this study were the responses of mathematics teachers to the Teacher Background Questionnaire--8th Grade from the Trends in International Mathematics and Science Study (TIMSS) in 2007. The Saudi…
Descriptors: Grade 8, Teacher Background, Mathematics Teachers, Educational Environment
Previous Page | Next Page »
Pages: 1 | 2