Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 13 |
Descriptor
Test Validity | 14 |
Test Reliability | 8 |
Test Construction | 6 |
Testing | 5 |
Foreign Countries | 4 |
Psychometrics | 4 |
Scoring | 4 |
Test Bias | 4 |
Guidelines | 3 |
Scores | 3 |
Test Items | 3 |
More ▼ |
Source
International Journal of… | 14 |
Author
Sireci, Stephen G. | 2 |
Anderson, Robin D. | 1 |
Beck, Klaus | 1 |
Bonner, Cavan V. | 1 |
Camilla Rjosk | 1 |
Elosua, Paula | 1 |
Emons, Wilco H. M. | 1 |
Faulkner-Bond, Molly | 1 |
Finney, Sara J. | 1 |
Geisinger, Kurt F. | 1 |
Iliescu, Dragos | 1 |
More ▼ |
Publication Type
Journal Articles | 14 |
Reports - Research | 8 |
Reports - Descriptive | 4 |
Tests/Questionnaires | 2 |
Guides - General | 1 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Education Level
Higher Education | 4 |
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Audience
Practitioners | 1 |
Researchers | 1 |
Location
Germany | 2 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
International Journal of Testing, 2018
The second edition of the International Test Commission Guidelines for Translating and Adapting Tests was prepared between 2005 and 2015 to improve upon the first edition, and to respond to advances in testing technology and practices. The 18 guidelines are organized into six categories to facilitate their use: pre-condition (3), test development…
Descriptors: Translation, Test Construction, Testing, Scoring
Shavelson, Richard J.; Zlatkin-Troitschanskaia, Olga; Beck, Klaus; Schmidt, Susanne; Marino, Julian P. – International Journal of Testing, 2019
Following employers' criticisms and recent societal developments, policymakers and educators have called for students to develop a range of generic skills such as critical thinking ("twenty-first century skills"). So far, such skills have typically been assessed by student self-reports or with multiple-choice tests. An alternative…
Descriptors: Critical Thinking, Cognitive Tests, Performance Based Assessment, Student Evaluation
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Roschmann, Sarina; Witmer, Sara E.; Volker, Martin A. – International Journal of Testing, 2021
Accommodations are commonly provided to address language-related barriers students may experience during testing. Research on the validity of scores from accommodated test administrations remains somewhat inconclusive. The current study investigated item response patterns to understand whether accommodations, as used in practice among English…
Descriptors: Testing Accommodations, English Language Learners, Scores, Item Response Theory
Faulkner-Bond, Molly; Sireci, Stephen G. – International Journal of Testing, 2015
Throughout the world, tests are administered to some examinees who are not fully proficient in the language in which they are being tested. It has long been acknowledged that proficiency in the language in which a test is administered often affects examinees' performance on a test. Depending on the context and intended uses for a particular…
Descriptors: Language Minorities, Test Validity, Language Proficiency, Test Construction
Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Descriptors: Psychometrics, Linguistics, Test Construction, Testing
Zilberberg, Anna; Finney, Sara J.; Marsh, Kimberly R.; Anderson, Robin D. – International Journal of Testing, 2014
Given worldwide prevalence of low-stakes testing for monitoring educational quality and students' progress through school (e.g., Trends in International Mathematics and Science Study, Program for International Student Assessment), interpretability of resulting test scores is of global concern. The nonconsequential nature of low-stakes tests…
Descriptors: Student Attitudes, Student Motivation, Test Validity, Accountability
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2013
To efficiently assess multiple psychological constructs and to minimize the burden on respondents, psychologists increasingly use shortened versions of existing tests. However, compared to the longer test, a shorter test version may have a substantial impact on the reliability and the validity of the test scores in psychological research and…
Descriptors: Test Length, Psychological Testing, Test Use, Test Validity
Rios, Joseph A.; Sireci, Stephen G. – International Journal of Testing, 2014
The International Test Commission's "Guidelines for Translating and Adapting Tests" (2010) provide important guidance on developing and evaluating tests for use across languages. These guidelines are widely applauded, but the degree to which they are followed in practice is unknown. The objective of this study was to perform a…
Descriptors: Guidelines, Translation, Adaptive Testing, Second Languages
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Li, Yuan H.; Tompkins, Leroy J. – International Journal of Testing, 2004
The primary objective of this study was to examine the construct validity for the 2 multiple-content testing programs-the multiple-choice Comprehensive Tests of Basic Skills (CTBS/5) together with the performance-based Maryland School Performance Assessment Program (MSPAP)-by evaluating the true-score longitudinal associations among…
Descriptors: Testing Programs, Structural Equation Models, Performance Based Assessment, Multitrait Multimethod Techniques