NotesFAQContact Us
Collection
Advanced
Search Tips
Source
International Journal of…20
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
George, Ann Cathrice; Robitzsch, Alexander – International Journal of Testing, 2021
Modern large-scale studies such as the Progress in International Reading Literacy Study (PIRLS) do not only report reading competence of students on a global reading scale but also report reading on the level of reading subskills. However, the number of and the dependencies between the subskills are frequently discussed. In this study, different…
Descriptors: Foreign Countries, Grade 4, Achievement Tests, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Amery D.; Stone, Jake E.; Liu, Yan – International Journal of Testing, 2016
This article proposes and demonstrates a methodology for test score validation through abductive reasoning. It describes how abductive reasoning can be utilized in support of the claims made about test score validity. This methodology is demonstrated with a real data example of the Canadian English Language Proficiency Index Program…
Descriptors: Test Validity, Scores, Logical Thinking, Multivariate Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kunina-Habenicht, Olga; Rupp, André A.; Wilhelm, Oliver – International Journal of Testing, 2017
Diagnostic classification models (DCMs) hold great potential for applications in summative and formative assessment by providing discrete multivariate proficiency scores that yield statistically driven classifications of students. Using data from a newly developed diagnostic arithmetic assessment that was administered to 2032 fourth-grade students…
Descriptors: Grade 4, Foreign Countries, Classification, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Kajonius, Petri J. – International Journal of Testing, 2017
Research is currently testing how the new maladaptive personality inventory for DSM (PID-5) and the well-established common Five-Factor Model (FFM) together can serve as an empirical and theoretical foundation for clinical psychology. The present study investigated the official short version of the PID-5 together with a common short version of…
Descriptors: Foreign Countries, Personality Measures, Personality Traits, Clinical Diagnosis
Peer reviewed Peer reviewed
Direct linkDirect link
Baghaei, Purya; Aryadoust, Vahid – International Journal of Testing, 2015
Research shows that test method can exert a significant impact on test takers' performance and thereby contaminate test scores. We argue that common test method can exert the same effect as common stimuli and violate the conditional independence assumption of item response theory models because, in general, subsets of items which have a shared…
Descriptors: Test Format, Item Response Theory, Models, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, HyeSun; Geisinger, Kurt F. – International Journal of Testing, 2014
Differential item functioning (DIF) analysis is important in terms of test fairness. While DIF analyses have mainly been conducted with manifest grouping variables, such as gender or race/ethnicity, it has been recently claimed that not only the grouping variables but also contextual variables pertaining to examinees should be considered in DIF…
Descriptors: Test Bias, Gender Differences, Regression (Statistics), Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Chu, Man-Wai; Babenko, Oksana; Cui, Ying; Leighton, Jacqueline P. – International Journal of Testing, 2014
The study examines the role that perceptions or impressions of learning environments and assessments play in students' performance on a large-scale standardized test. Hierarchical linear modeling (HLM) was used to test aspects of the Learning Errors and Formative Feedback model to determine how much variation in students' performance was explained…
Descriptors: Hierarchical Linear Modeling, Secondary School Students, Student Attitudes, Educational Environment
Peer reviewed Peer reviewed
Direct linkDirect link
Byrne, Barbara M.; van de Vijver, Fons J. R. – International Journal of Testing, 2014
In cross-cultural research, there is a tendency for researchers to draw inferences at the country level based on individual-level data. Such action implicitly and often mistakenly assumes that both the measuring instrument and its underlying construct(s) are operating equivalently across both levels. Based on responses from 5,482 college students…
Descriptors: Factor Structure, Measures (Individuals), Cross Cultural Studies, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Skaggs, Gary; Wilkins, Jesse L. M.; Hein, Serge F. – International Journal of Testing, 2016
The purpose of this study was to explore the degree of grain size of the attributes and the sample sizes that can support accurate parameter recovery with the General Diagnostic Model (GDM) for a large-scale international assessment. In this resampling study, bootstrap samples were obtained from the 2003 Grade 8 TIMSS in Mathematics at varying…
Descriptors: Achievement Tests, Foreign Countries, Elementary Secondary Education, Science Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Ong, Yoke Mooi; Williams, Julian; Lamprianou, Iasonas – International Journal of Testing, 2015
The purpose of this article is to explore crossing differential item functioning (DIF) in a test drawn from a national examination of mathematics for 11-year-old pupils in England. An empirical dataset was analyzed to explore DIF by gender in a mathematics assessment. A two-step process involving the logistic regression (LR) procedure for…
Descriptors: Mathematics Tests, Gender Differences, Test Bias, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Lindley, Patricia A.; Bartram, Dave – International Journal of Testing, 2012
In this article, we present the background to the development of test reviewing by the British Psychological Society (BPS) in the United Kingdom. We also describe the role played by the BPS in the development of the EFPA test review model and its adaptation for use in test reviewing in the United Kingdom. We conclude with a discussion of lessons…
Descriptors: Test Reviews, Professional Associations, Psychology, Global Approach
Peer reviewed Peer reviewed
Direct linkDirect link
Fukuda, Eriko; Saklofske, Donald H.; Tamaoka, Katsuo; Fung, Tak Shing; Miyaoka, Yayoi; Kiyama, Sachiko – International Journal of Testing, 2011
This article reports the psychometric properties of two emotional intelligence measures translated into Japanese. Confirmatory factor analysis (CFA) was conducted to examine the factor structure of a Japanese version of the Wong and Law Emotional Intelligence Scale (WLEIS) completed by 310 Japanese university students. A second study employed CFA…
Descriptors: Emotional Intelligence, Japanese, Factor Structure, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul – International Journal of Testing, 2011
We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…
Descriptors: Language Skills, Identification, Foreign Countries, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Byrne, Barbara M.; van de Vijver, Fons J. R. – International Journal of Testing, 2010
A critical assumption in cross-cultural comparative research is that the instrument measures the same construct(s) in exactly the same way across all groups (i.e., the instrument is measurement and structurally equivalent). Structural equation modeling (SEM) procedures are commonly used in testing these assumptions of multigroup equivalence.…
Descriptors: Measures (Individuals), Cross Cultural Studies, Measurement, Comparative Analysis
Previous Page | Next Page »
Pages: 1  |  2