ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	14

Descriptor

Evaluation Methods	19
Item Response Theory	19
Test Format	19
Test Items	10
Item Analysis	7
Equated Scores	6
Educational Assessment	4
Error of Measurement	4
Measurement Techniques	4
Simulation	4
Statistical Analysis	4
Test Construction	4
Comparative Analysis	3
Computer Assisted Testing	3
Correlation	3
Mathematics Tests	3
Models	3
Test Bias	3
Achievement Tests	2
Difficulty Level	2
Evaluation Criteria	2
Foreign Countries	2
Grade 8	2
Higher Education	2
Multidimensional Scaling	2
More ▼

Source

Applied Psychological…	3
Applied Measurement in…	2
Educational and Psychological…	2
Journal of Educational…	2
ProQuest LLC	2
Educational Sciences: Theory…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Online Submission	1
Psychometrika	1
South African Journal of…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	10
Reports - Evaluative	5
Dissertations/Theses -…	2
Information Analyses	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Grade 8	2
Higher Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1

Audience

Location

Germany	1
South Africa	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

California Learning…	1
Graduate Record Examinations	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

COVID-19 Impact on Group Invariance Property of Equating

Download full text

Kim, Dong-In; Julian, Marc; Hermann, Pam – Online Submission, 2022

In test equating, one critical equating property is the group invariance property which indicates that the equating function used to convert performance on each alternate form to the reporting scale should be the same for various subgroups. To mitigate the impact of disrupted learning on the item parameters during the COVID-19 pandemic, a…

Descriptors: COVID-19, Pandemics, Test Format, Equated Scores

Assessing the Impact of Characteristics of the Test, Common-Items, and Examinees on the Preservation of Equity Properties in Mixed-Format Test Equating

Direct link

Wolf, Raffaela – ProQuest LLC, 2013

Preservation of equity properties was examined using four equating methods--IRT True Score, IRT Observed Score, Frequency Estimation, and Chained Equipercentile--in a mixed-format test under a common-item nonequivalent groups (CINEG) design. Equating of mixed-format tests under a CINEG design can be influenced by factors such as attributes of the…

Descriptors: Testing, Item Response Theory, Equated Scores, Test Items

The Need for Invariant Assessments in South African Education

Peer reviewed
PDF on ERIC

Download full text

Dampier, Graham A. – South African Journal of Education, 2014

Presently, a plethora of instruments designed to assess a mathematical skill, disposition, or competence prevail in South Africa. Yet few of them adhere to the basic requirements of the unidimensionality and invariance of measures. The Marko-D is a mathematical instrument designed to test learners between the ages of 4 and 8. The instrument, thus…

Descriptors: Foreign Countries, Student Evaluation, Mathematics Skills, Measurement Techniques

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

An Investigation of Sample Size Splitting on ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…

Descriptors: Sample Size, Test Length, Correlation, Test Format

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Developing and Evaluating a Paper-and-Pencil Test to Assess Components of Physics Teachers' Pedagogical Content Knowledge

Peer reviewed

Direct link

Kirschner, Sophie; Borowski, Andreas; Fischer, Hans E.; Gess-Newsome, Julie; von Aufschnaiter, Claudia – International Journal of Science Education, 2016

Teachers' professional knowledge is assumed to be a key variable for effective teaching. As teacher education has the goal to enhance professional knowledge of current and future teachers, this knowledge should be described and assessed. Nevertheless, only a limited number of studies quantitatively measures physics teachers' professional…

Descriptors: Evaluation Methods, Tests, Test Format, Science Instruction

Data Collection Design for Equivalent Groups Equating: Using a Matrix Stratification Framework for Mixed-Format Assessment

Direct link

Mbella, Kinge Keka – ProQuest LLC, 2012

Mixed-format assessments are increasingly being used in large scale standardized assessments to measure a continuum of skills ranging from basic recall to higher order thinking skills. These assessments are usually comprised of a combination of (a) multiple-choice items which can be efficiently scored, have stable psychometric properties, and…

Descriptors: Educational Assessment, Test Format, Evaluation Methods, Multiple Choice Tests

A New Concurrent Calibration Method for Nonequivalent Group Design under Nonrandom Assignment

Peer reviewed

Direct link

Miyazaki, Kei; Hoshino, Takahiro; Mayekawa, Shin-ichi; Shigemasu, Kazuo – Psychometrika, 2009

This study proposes a new item parameter linking method for the common-item nonequivalent groups design in item response theory (IRT). Previous studies assumed that examinees are randomly assigned to either test form. However, examinees can frequently select their own test forms and tests often differ according to examinees' abilities. In such…

Descriptors: Test Format, Item Response Theory, Test Items, Test Bias

Anchor Test Type and Population Invariance: An Exploration across Subpopulations and Test Administrations

Peer reviewed

Direct link

Dorans, Neil J.; Liu, Jinghua; Hammond, Shelby – Applied Psychological Measurement, 2008

This exploratory study was built on research spanning three decades. Petersen, Marco, and Stewart (1982) conducted a major empirical investigation of the efficacy of different equating methods. The studies reported in Dorans (1990) examined how different equating methods performed across samples selected in different ways. Recent population…

Descriptors: Test Format, Equated Scores, Sampling, Evaluation Methods

Creating IRT-Based Parallel Test Forms Using the Genetic Algorithm Method

Peer reviewed

Direct link

Sun, Koun-Tem; Chen, Yu-Jen; Tsai, Shu-Yen; Cheng, Chien-Fen – Applied Measurement in Education, 2008

In educational measurement, the construction of parallel test forms is often a combinatorial optimization problem that involves the time-consuming selection of items to construct tests having approximately the same test information functions (TIFs) and constraints. This article proposes a novel method, genetic algorithm (GA), to construct parallel…

Descriptors: Test Format, Measurement Techniques, Equations (Mathematics), Item Response Theory

The Performance of a Method for the Long-Term Equating of Mixed-Format Assessment

Peer reviewed

Direct link

Kamata, Akihito; Tate, Richard – Journal of Educational Measurement, 2005

The goal of this study was the development of a procedure to predict the equating error associated with the long-term equating method of Tate (2003) for mixed-format tests. An expression for the determination of the error of an equating based on multiple links using the error for the component links was derived and illustrated with simulated data.…

Descriptors: Computer Simulation, Item Response Theory, Test Format, Evaluation Methods

Assessment of Differential Item Functioning in Testlet-Based Items Using the Rasch Testlet Model

Peer reviewed

Direct link

Wang, Wen-Chung; Wilson, Mark – Educational and Psychological Measurement, 2005

This study presents a procedure for detecting differential item functioning (DIF) for dichotomous and polytomous items in testlet-based tests, whereby DIF is taken into account by adding DIF parameters into the Rasch testlet model. Simulations were conducted to assess recovery of the DIF and other parameters. Two independent variables, test type…

Descriptors: Test Format, Test Bias, Item Response Theory, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Wilson, Mark	2
Albano, Anthony D.	1
Austin, James T.	1
Berberoglu, Giray	1
Borowski, Andreas	1
Chen, Yu-Jen	1
Cheng, Chien-Fen	1
Dampier, Graham A.	1
DeMars, Christine E.	1
Dorans, Neil J.	1
Fischer, Hans E.	1
Gess-Newsome, Julie	1
Hammond, Shelby	1
Hermann, Pam	1
Hoshino, Takahiro	1
Julian, Marc	1
Kamata, Akihito	1
Kelecioglu, Hülya	1
Ki Lynn Cole	1
Kim, Dong-In	1
Kirschner, Sophie	1
Liu, Jinghua	1
Mahlman, Robert A.	1
Mayekawa, Shin-ichi	1
Mbella, Kinge Keka	1
More ▼