ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	8

Descriptor

Foreign Countries	9
Validity	5
Achievement Tests	4
International Assessment	4
Test Items	3
Test Validity	3
Computer Assisted Testing	2
Psychometrics	2
Reading Comprehension	2
Secondary School Students	2
Test Use	2
Accountability	1
Adaptive Testing	1
Admission (School)	1
Adults	1
Case Studies	1
Cognitive Tests	1
College Bound Students	1
College Entrance Examinations	1
Competence	1
Construct Validity	1
Cultural Influences	1
Cutting Scores	1
Diagnostic Tests	1
Difficulty Level	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	9
Reports - Research	6
Reports - Descriptive	2
Reports - Evaluative	1

Education Level

Secondary Education	5
Early Childhood Education	1
Elementary Education	1
Grade 4	1
Grade 9	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Preschool Education	1

Audience

Location

Canada	2
Germany	2
Greece	1
Israel	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
Progress in International…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Psychometric Evaluation of the Preschool Early Numeracy Skills Test--Brief Version within the Item Response Theory Framework

Peer reviewed

Direct link

Tsigilis, Nikolaos; Krousorati, Katerina; Gregoriadis, Athanasios; Grammatikopoulos, Vasilis – Educational Measurement: Issues and Practice, 2023

The Preschool Early Numeracy Skills Test--Brief Version (PENS-B) is a measure of early numeracy skills, developed and mainly used in the United States. The purpose of this study was to examine the factorial validity and measurement invariance across gender of PENS-B in the Greek educational context. PENS-B was administered to 906 preschool…

Descriptors: Psychometrics, Preschool Education, Numeracy, Item Response Theory

Validation as Evaluating Desired and Undesired Effects: Insights from Cross-Classified Mixed Effects Model

Peer reviewed

Direct link

Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023

The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…

Descriptors: Measurement, Validity, Reliability, Models

When Assessment Validation Neglects Any Strand of Validity Evidence: An Instructive Example from PISA

Peer reviewed

Direct link

Pepper, David – Educational Measurement: Issues and Practice, 2020

The Standards for Educational and Psychological Testing identify several strands of validity evidence that may be needed as support for particular interpretations and uses of assessments. Yet assessment validation often does not seem guided by these Standards, with validations lacking a particular strand even when it appears relevant to an…

Descriptors: Validity, Foreign Countries, Achievement Tests, International Assessment

Construct Equivalence of PISA Reading Comprehension Measured with Paper-Based and Computer-Based Assessments

Peer reviewed

Direct link

Kroehne, Ulf; Buerger, Sarah; Hahnel, Carolin; Goldhammer, Frank – Educational Measurement: Issues and Practice, 2019

For many years, reading comprehension in the Programme for International Student Assessment (PISA) was measured via paper-based assessment (PBA). In the 2015 cycle, computer-based assessment (CBA) was introduced, raising the question of whether central equivalence criteria required for a valid interpretation of the results are fulfilled. As an…

Descriptors: Reading Comprehension, Computer Assisted Testing, Achievement Tests, Foreign Countries

Multistage Adaptive Testing Design in International Large-Scale Assessments

Peer reviewed

Direct link

Yamamoto, Kentaro; Shin, Hyo Jeong; Khorramdel, Lale – Educational Measurement: Issues and Practice, 2018

A multistage adaptive testing (MST) design was implemented for the Programme for the International Assessment of Adult Competencies (PIAAC) starting in 2012 for about 40 countries and has been implemented for the 2018 cycle of the Programme for International Student Assessment (PISA) for more than 80 countries. Using examples from PISA and PIAAC,…

Descriptors: International Assessment, Foreign Countries, Achievement Tests, Test Validity

The Multiple-Use of Accountability Assessments: Implications for the Process of Validation

Peer reviewed

Direct link

Koch, Martha J. – Educational Measurement: Issues and Practice, 2014

Implications of the multiple-use of accountability assessments for the process of validation are examined. Multiple-use refers to the simultaneous use of results from a single administration of an assessment for its intended use and for one or more additional uses. A theoretical discussion of the issues for validation which emerge from…

Descriptors: Foreign Countries, Test Use, Accountability, Validity

Validating Student Score Inferences with Person-Fit Statistic and Verbal Reports: A Person-Fit Study for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013

The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…

Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests

Setting Standards for English Foreign Language Assessment: Methodology, Validation, and a Degree of Arbitrariness

Peer reviewed

Direct link

Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013

Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…

Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)

Psychometric and Social Issues in Admissions to Israeli Universities.

Peer reviewed

Beller, Michal – Educational Measurement: Issues and Practice, 1994

A broad description is given of admissions procedures to Israeli universities. In Israel, a single unified test, the Psychometric Entrance Test, is used for admission to the various universities. Issues of validity and reliability and problems of ensuring fairness for non-Hebrew speakers are considered. (SLD)

Descriptors: Admission (School), College Bound Students, College Entrance Examinations, Equal Education

Beller, Michal	1
Buerger, Sarah	1
Cui, Ying	1
Goldhammer, Frank	1
Grammatikopoulos, Vasilis	1
Gregoriadis, Athanasios	1
Hahnel, Carolin	1
Ji, Xuejun Ryan	1
Khorramdel, Lale	1
Koch, Martha J.	1
Koller, Olaf	1
Kroehne, Ulf	1
Krousorati, Katerina	1
Pant, Hans Anand	1
Pepper, David	1
Roberts, Mary Roduta	1
Shin, Hyo Jeong	1
Tiffin-Richards, Simon P.	1
Tsigilis, Nikolaos	1
Wu, Amery D.	1
Yamamoto, Kentaro	1
More ▼