ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Source

International Journal of…

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Descriptive	2
Reports - Evaluative	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Cognitive Abilities Test	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

Psychometrics in Support of a Valid Assessment of Linguistic Minorities: Implications for the Test and Sampling Designs

Peer reviewed

Direct link

Oliveri, María Elena; von Davier, Alina A. – International Journal of Testing, 2016

In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…

Descriptors: Psychometrics, Linguistics, Test Construction, Testing

Multiple-Group Noncompensatory Differential Item Functioning in Raju's Differential Functioning of Items and Tests

Peer reviewed

Direct link

Oshima, T. C.; Wright, Keith; White, Nick – International Journal of Testing, 2015

Raju, van der Linden, and Fleer (1995) introduced a framework for differential functioning of items and tests (DFIT) for unidimensional dichotomous models. Since then, DFIT has been shown to be a quite versatile framework as it can handle polytomous as well as multidimensional models both at the item and test levels. However, DFIT is still limited…

Descriptors: Test Bias, Item Response Theory, Test Items, Simulation

Grain Size and Parameter Recovery with TIMSS and the General Diagnostic Model

Peer reviewed

Direct link

Skaggs, Gary; Wilkins, Jesse L. M.; Hein, Serge F. – International Journal of Testing, 2016

The purpose of this study was to explore the degree of grain size of the attributes and the sample sizes that can support accurate parameter recovery with the General Diagnostic Model (GDM) for a large-scale international assessment. In this resampling study, bootstrap samples were obtained from the 2003 Grade 8 TIMSS in Mathematics at varying…

Descriptors: Achievement Tests, Foreign Countries, Elementary Secondary Education, Science Achievement

Review of Sample Size for Structural Equation Models in Second Language Testing and Learning Research: A Monte Carlo Approach

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – International Journal of Testing, 2013

The importance of sample size, although widely discussed in the literature on structural equation modeling (SEM), has not been widely recognized among applied SEM researchers. To narrow this gap, we focus on second language testing and learning studies and examine the following: (a) Is the sample size sufficient in terms of precision and power of…

Descriptors: Structural Equation Models, Sample Size, Second Language Instruction, Monte Carlo Methods

Observed-Score Equating with a Heterogeneous Target Population

Peer reviewed

Direct link

Duong, Minh Q.; von Davier, Alina A. – International Journal of Testing, 2012

Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…

Descriptors: Ability Grouping, Difficulty Level, Psychometrics, Statistical Analysis

Impact of Inclusion or Exclusion of Repeaters on Test Equating

Peer reviewed

Direct link

Puhan, Gautam – International Journal of Testing, 2011

This study examined the effect of including or excluding repeaters on the equating process and results. New forms of two tests were equated to their respective old forms using either all examinees or only the first timer examinees in the new form sample. Results showed that for both tests used in this study, including or excluding repeaters in the…

Descriptors: Equated Scores, Educational Testing, Student Evaluation, Sample Size

Evaluating the Invariance of Cognitive Profile Patterns Derived from Profile Analysis via Multidimensional Scaling (PAMS): A Bootstrapping Approach

Peer reviewed

Direct link

Kim, Se-Kang – International Journal of Testing, 2010

The aim of the current study is to validate the invariance of major profile patterns derived from multidimensional scaling (MDS) by bootstrapping. Profile Analysis via Multidimensional Scaling (PAMS) was employed to obtain profiles and bootstrapping was used to construct the sampling distributions of the profile coordinates and the empirical…

Descriptors: Intervals, Multidimensional Scaling, Profiles, Evaluation

Estimation of Generalizability Coefficients via a Structural Equation Modeling Approach to Scale Reliability Evaluation

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – International Journal of Testing, 2006

A structural equation modeling approach to scale reliability evaluation can be employed to estimate generalizability theory indexes in settings where sampling of subjects and conditions is carried out. In one- and two-facet crossed designs, it is demonstrated how this method can be used to obtain estimates of relative generalizability…

Descriptors: Computation, Generalizability Theory, Structural Equation Models, Reliability

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Sampling	10
Error of Measurement	4
Item Response Theory	4
Sample Size	4
Comparative Analysis	3
Evaluation Methods	3
Psychometrics	3
Simulation	3
Statistical Analysis	3
Statistical Bias	3
Statistical Inference	3
Test Bias	3
Testing	3
Computation	2
Equated Scores	2
Measurement	2
Monte Carlo Methods	2
Statistical Distributions	2
Structural Equation Models	2
Student Evaluation	2
Test Items	2
Ability	1
Ability Grouping	1
Academic Achievement	1
Achievement Tests	1
More ▼

von Davier, Alina A.	2
Duong, Minh Q.	1
Foster, Jeff L.	1
Hein, Serge F.	1
In'nami, Yo	1
Kim, Se-Kang	1
Koizumi, Rie	1
Maeda, Hotaka	1
Marcoulides, George A.	1
Meyer, Kevin D.	1
Oliveri, María Elena	1
Oshima, T. C.	1
Puhan, Gautam	1
Raykov, Tenko	1
Skaggs, Gary	1
White, Nick	1
Wilkins, Jesse L. M.	1
Wright, Keith	1
Zhang, Bo	1
More ▼