ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	15

Source

International Journal of…

Publication Type

Journal Articles	16
Reports - Research	10
Reports - Evaluative	3
Book/Product Reviews	1
Guides - General	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Elementary Education	1
Grade 3	1
Grade 6	1
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Canada	2
Mongolia	1
Netherlands	1
Saudi Arabia	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English for…	2
Program for International…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Exploring Task Features That Predict Psychometric Quality of Test Items: The Case for the Dutch Driving Theory Exam

Peer reviewed

Direct link

Roelofs, Erik C.; Emons, Wilco H. M.; Verschoor, Angela J. – International Journal of Testing, 2021

This study reports on an Evidence Centered Design (ECD) project in the Netherlands, involving the theory exam for prospective car drivers. In particular, we illustrate how cognitive load theory, task-analysis, response process models, and explanatory item-response theory can be used to systematically develop and refine task models. Based on a…

Descriptors: Foreign Countries, Psychometrics, Test Items, Evidence Based Practice

Investigating Technology-Enhanced Item Formats Using Cognitive and Item Response Theory Approaches

Peer reviewed

Direct link

Moon, Jung Aa; Sinharay, Sandip; Keehner, Madeleine; Katz, Irvin R. – International Journal of Testing, 2020

The current study examined the relationship between test-taker cognition and psychometric item properties in multiple-selection multiple-choice and grid items. In a study with content-equivalent mathematics items in alternative item formats, adult participants' tendency to respond to an item was affected by the presence of a grid and variations of…

Descriptors: Computer Assisted Testing, Multiple Choice Tests, Test Wiseness, Psychometrics

Dynamic Multistage Testing: A Highly Efficient and Regulated Adaptive Testing Method

Peer reviewed

Direct link

Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019

This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics

Differential Distractor Functioning as a Method for Explaining DIF: The Case of a National Admissions Test in Saudi Arabia

Peer reviewed

Direct link

Tsaousis, Ioannis; Sideridis, Georgios; Al-Saawi, Fahad – International Journal of Testing, 2018

The aim of the present study was to examine Differential Distractor Functioning (DDF) as a means of improving the quality of a measure through understanding biased responses across groups. A DDF analysis could shed light on the potential sources of construct-irrelevant variance by examining whether the differential selection of incorrect choices…

Descriptors: Foreign Countries, College Entrance Examinations, Test Bias, Chemistry

On Designing Construct Driven Situational Judgment Tests: Some Preliminary Recommendations

Peer reviewed

Direct link

Guenole, Nigel; Chernyshenko, Oleksandr S.; Weekly, Jeff – International Journal of Testing, 2017

Situational judgment tests (SJTs) are widely agreed to be a measurement technique. It is also widely agreed that SJTs are a questionable methodological choice for measurement of psychological constructs, such as behavioral competencies, due to a lack of evidence supporting appropriate factor structures and high internal consistencies.…

Descriptors: Situational Tests, Psychological Evaluation, Test Construction, Industrial Psychology

Adding Value to Second-Language Listening and Reading Subscores: Using a Score Augmentation Approach

Peer reviewed

Direct link

Papageorgiou, Spiros; Choi, Ikkyu – International Journal of Testing, 2018

This study examined whether reporting subscores for groups of items within a test section assessing a second-language modality (specifically reading or listening comprehension) added value from a measurement perspective to the information already provided by the section scores. We analyzed the responses of 116,489 test takers to reading and…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Language Tests

An Anthropologist among the Psychometricians: Assessment Events, Ethnography, and Differential Item Functioning in the Mongolian Gobi

Peer reviewed

Direct link

Maddox, Bryan; Zumbo, Bruno D.; Tay-Lim, Brenda; Qu, Demin – International Journal of Testing, 2015

This article explores the potential for ethnographic observations to inform the analysis of test item performance. In 2010, a standardized, large-scale adult literacy assessment took place in Mongolia as part of the United Nations Educational, Scientific and Cultural Organization Literacy Assessment and Monitoring Programme (LAMP). In a novel form…

Descriptors: Anthropology, Psychometrics, Ethnography, Adult Literacy

Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim – International Journal of Testing, 2013

The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

Descriptors: Scoring, Test Format, Licensing Examinations (Professions), Test Items

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

The Role of Item Models in Automatic Item Generation

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis – International Journal of Testing, 2012

Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

Descriptors: Foreign Countries, Psychometrics, Test Construction, Test Items

Modeling Item-Level and Step-Level Invariance Effects in Polytomous Items Using the Partial Credit Model

Peer reviewed

Direct link

Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012

Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…

Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items

Defining and Comparing the Reading Comprehension Construct: A Cognitive-Psychometric Modeling Approach

Peer reviewed

Direct link

Svetina, Dubravka; Gorin, Joanna S.; Tatsuoka, Kikumi K. – International Journal of Testing, 2011

As a construct definition, the current study develops a cognitive model describing the knowledge, skills, and abilities measured by critical reading test items on a high-stakes assessment used for selection decisions in the United States. Additionally, in order to establish generalizability of the construct meaning to other similarly structured…

Descriptors: Reading Tests, Reading Comprehension, Critical Reading, Test Items

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees' Knowledge and Skills in Mathematics: An Operational Implementation of Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Alves, Cecilia; Majeau, Renate Taylor – International Journal of Testing, 2010

The purpose of this study is to apply the attribute hierarchy method in an operational diagnostic mathematics program at Grades 3 and 6 to promote cognitive inferences about students' problem-solving skills. The attribute hierarchy method is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute…

Descriptors: Test Items, Student Reaction, Diagnostic Tests, Psychometrics

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Which Item Types Are Better Suited to the Linking of Verbal Adapted Tests?

Peer reviewed

Direct link

Allalouf, Avi; Rapp, Joel; Stoller, Reuven – International Journal of Testing, 2009

When a test is adapted from a source language (SL) into a target language (TL), the two forms are usually not psychometrically equivalent. If linking between test forms is necessary, those items that have had their psychometric characteristics altered by the translation (differential item functioning [DIF] items) should be eliminated from the…

Descriptors: Test Items, Test Format, Verbal Tests, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2

Psychometrics	16
Test Items	16
Foreign Countries	7
Item Response Theory	6
Test Construction	6
Test Bias	5
Models	4
Cognitive Processes	3
Computer Assisted Testing	3
Difficulty Level	3
Educational Assessment	3
Evaluation Methods	3
Scoring	3
Standardized Tests	3
Statistical Analysis	3
Test Format	3
Translation	3
Achievement Tests	2
Classification	2
College Entrance Examinations	2
Comparative Analysis	2
Computer Software	2
Correlation	2
English	2
English (Second Language)	2
More ▼

Gierl, Mark J.	2
Al-Saawi, Fahad	1
Allalouf, Avi	1
Alves, Cecilia	1
Chernyshenko, Oleksandr S.	1
Childs, Ruth A.	1
Choi, Ikkyu	1
Emons, Wilco H. M.	1
Gattamorta, Karina A.	1
Gorin, Joanna S.	1
Guenole, Nigel	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Kim, Sooyeon	1
Lai, Hollis	1
Luo, Xiao	1
Maddox, Bryan	1
Majeau, Renate Taylor	1
Mapuranga, Raymond	1
Moon, Jung Aa	1
Moses, Tim	1
Myers, Nicholas D.	1
Papageorgiou, Spiros	1
Penfield, Randall D.	1
Qu, Demin	1
More ▼