Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 8 |
Descriptor
Foreign Countries | 9 |
Validity | 5 |
Achievement Tests | 4 |
International Assessment | 4 |
Test Items | 3 |
Test Validity | 3 |
Computer Assisted Testing | 2 |
Psychometrics | 2 |
Reading Comprehension | 2 |
Secondary School Students | 2 |
Test Use | 2 |
More ▼ |
Source
Educational Measurement:… | 9 |
Author
Beller, Michal | 1 |
Buerger, Sarah | 1 |
Cui, Ying | 1 |
Goldhammer, Frank | 1 |
Grammatikopoulos, Vasilis | 1 |
Gregoriadis, Athanasios | 1 |
Hahnel, Carolin | 1 |
Ji, Xuejun Ryan | 1 |
Khorramdel, Lale | 1 |
Koch, Martha J. | 1 |
Koller, Olaf | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 6 |
Reports - Descriptive | 2 |
Reports - Evaluative | 1 |
Education Level
Secondary Education | 5 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 4 | 1 |
Grade 9 | 1 |
High Schools | 1 |
Higher Education | 1 |
Intermediate Grades | 1 |
Postsecondary Education | 1 |
Preschool Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
Progress in International… | 1 |
What Works Clearinghouse Rating
Tsigilis, Nikolaos; Krousorati, Katerina; Gregoriadis, Athanasios; Grammatikopoulos, Vasilis – Educational Measurement: Issues and Practice, 2023
The Preschool Early Numeracy Skills Test--Brief Version (PENS-B) is a measure of early numeracy skills, developed and mainly used in the United States. The purpose of this study was to examine the factorial validity and measurement invariance across gender of PENS-B in the Greek educational context. PENS-B was administered to 906 preschool…
Descriptors: Psychometrics, Preschool Education, Numeracy, Item Response Theory
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Pepper, David – Educational Measurement: Issues and Practice, 2020
The Standards for Educational and Psychological Testing identify several strands of validity evidence that may be needed as support for particular interpretations and uses of assessments. Yet assessment validation often does not seem guided by these Standards, with validations lacking a particular strand even when it appears relevant to an…
Descriptors: Validity, Foreign Countries, Achievement Tests, International Assessment
Kroehne, Ulf; Buerger, Sarah; Hahnel, Carolin; Goldhammer, Frank – Educational Measurement: Issues and Practice, 2019
For many years, reading comprehension in the Programme for International Student Assessment (PISA) was measured via paper-based assessment (PBA). In the 2015 cycle, computer-based assessment (CBA) was introduced, raising the question of whether central equivalence criteria required for a valid interpretation of the results are fulfilled. As an…
Descriptors: Reading Comprehension, Computer Assisted Testing, Achievement Tests, Foreign Countries
Yamamoto, Kentaro; Shin, Hyo Jeong; Khorramdel, Lale – Educational Measurement: Issues and Practice, 2018
A multistage adaptive testing (MST) design was implemented for the Programme for the International Assessment of Adult Competencies (PIAAC) starting in 2012 for about 40 countries and has been implemented for the 2018 cycle of the Programme for International Student Assessment (PISA) for more than 80 countries. Using examples from PISA and PIAAC,…
Descriptors: International Assessment, Foreign Countries, Achievement Tests, Test Validity
Koch, Martha J. – Educational Measurement: Issues and Practice, 2014
Implications of the multiple-use of accountability assessments for the process of validation are examined. Multiple-use refers to the simultaneous use of results from a single administration of an assessment for its intended use and for one or more additional uses. A theoretical discussion of the issues for validation which emerge from…
Descriptors: Foreign Countries, Test Use, Accountability, Validity
Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013
The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…
Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests
Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013
Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…
Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)

Beller, Michal – Educational Measurement: Issues and Practice, 1994
A broad description is given of admissions procedures to Israeli universities. In Israel, a single unified test, the Psychometric Entrance Test, is used for admission to the various universities. Issues of validity and reliability and problems of ensuring fairness for non-Hebrew speakers are considered. (SLD)
Descriptors: Admission (School), College Bound Students, College Entrance Examinations, Equal Education