ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	16

Descriptor

Comparative Analysis	21
Item Response Theory	21
Sampling	20
Test Items	11
Computation	7
Sample Size	6
Difficulty Level	5
Equated Scores	5
Error of Measurement	5
Simulation	5
College Entrance Examinations	4
Equations (Mathematics)	4
Evaluation Methods	4
Statistical Inference	4
Ability	3
Accuracy	3
Foreign Countries	3
High School Students	3
Item Analysis	3
Item Banks	3
Models	3
Monte Carlo Methods	3
Statistical Analysis	3
Classification	2
Efficiency	2
More ▼

Source

ETS Research Report Series	3
ProQuest LLC	3
Applied Psychological…	2
Educational and Psychological…	2
International Journal of…	2
ACT, Inc.	1
College Board	1
Eurasian Journal of…	1
Grantee Submission	1
Ministerial Council on…	1
Research Papers in Education	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	12
Reports - Research	12
Reports - Evaluative	4
Dissertations/Theses -…	3
Numerical/Quantitative Data	3
Reports - Descriptive	2
Speeches/Meeting Papers	1

Education Level

Elementary Education	3
Secondary Education	3
High Schools	2
Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Australia	1
Turkey	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
National Assessment of…	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

An Efficient Alternative Mixed Randomized Response Procedure

Peer reviewed

Direct link

Singh, Housila P.; Tarray, Tanveer A. – Sociological Methods & Research, 2015

In this article, we have suggested a new modified mixed randomized response (RR) model and studied its properties. It is shown that the proposed mixed RR model is always more efficient than the Kim and Warde's mixed RR model. The proposed mixed RR model has also been extended to stratified sampling. Numerical illustrations and graphical…

Descriptors: Item Response Theory, Models, Efficiency, Comparative Analysis

An Algorithm to Improve Test Answer Copying Detection Using the Omega Statistic

Peer reviewed

Direct link

Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017

The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…

Descriptors: Cheating, Test Items, Mathematics, Statistics

Evaluating Equity at the Local Level Using Bootstrap Tests. Research Report 2016-4

Download full text

Kim, YoungKoung; DeCarlo, Lawrence T. – College Board, 2016

Because of concerns about test security, different test forms are typically used across different testing occasions. As a result, equating is necessary in order to get scores from the different test forms that can be used interchangeably. In order to assure the quality of equating, multiple equating methods are often examined. Various equity…

Descriptors: Equated Scores, Evaluation Methods, Sampling, Statistical Inference

Multiple-Group Noncompensatory Differential Item Functioning in Raju's Differential Functioning of Items and Tests

Peer reviewed

Direct link

Oshima, T. C.; Wright, Keith; White, Nick – International Journal of Testing, 2015

Raju, van der Linden, and Fleer (1995) introduced a framework for differential functioning of items and tests (DFIT) for unidimensional dichotomous models. Since then, DFIT has been shown to be a quite versatile framework as it can handle polytomous as well as multidimensional models both at the item and test levels. However, DFIT is still limited…

Descriptors: Test Bias, Item Response Theory, Test Items, Simulation

Bootstrap Standard Errors for Maximum Likelihood Ability Estimates When Item Parameters Are Unknown

Peer reviewed

Direct link

Patton, Jeffrey M.; Cheng, Ying; Yuan, Ke-Hai; Diao, Qi – Educational and Psychological Measurement, 2014

When item parameter estimates are used to estimate the ability parameter in item response models, the standard error (SE) of the ability estimate must be corrected to reflect the error carried over from item calibration. For maximum likelihood (ML) ability estimates, a corrected asymptotic SE is available, but it requires a long test and the…

Descriptors: Sampling, Statistical Inference, Maximum Likelihood Statistics, Computation

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

A Comparison of Four Linear Equating Methods for the Common-Item Nonequivalent Groups Design Using Simulation Methods. ACT Research Report Series, 2013 (2)

Download full text

Topczewski, Anna; Cui, Zhongmin; Woodruff, David; Chen, Hanwei; Fang, Yu – ACT, Inc., 2013

This paper investigates four methods of linear equating under the common item nonequivalent groups design. Three of the methods are well known: Tucker, Angoff-Levine, and Congeneric-Levine. A fourth method is presented as a variant of the Congeneric-Levine method. Using simulation data generated from the three-parameter logistic IRT model we…

Descriptors: Comparative Analysis, Equated Scores, Methods, Simulation

Item Selection for the Development of Parallel Forms from an IRT-Based Seed Test Using a Sampling and Classification Approach

Peer reviewed

Direct link

Chen, Pei-Hua; Chang, Hua-Hua; Wu, Haiyan – Educational and Psychological Measurement, 2012

Two sampling-and-classification-based procedures were developed for automated test assembly: the Cell Only and the Cell and Cube methods. A simulation study based on a 540-item bank was conducted to compare the performance of the procedures with the performance of a mixed-integer programming (MIP) method for assembling multiple parallel test…

Descriptors: Test Items, Selection, Test Construction, Item Response Theory

Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013

Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

Descriptors: Item Response Theory, Test Items, Sampling, True Scores

Ability Level Estimation of Students on Probability Unit via Computerized Adaptive Testing

Peer reviewed
PDF on ERIC

Download full text

Özyurt, Hacer; Özyurt, Özcan – Eurasian Journal of Educational Research, 2015

Problem Statement: Learning-teaching activities bring along the need to determine whether they achieve their goals. Thus, multiple choice tests addressing the same set of questions to all are frequently used. However, this traditional assessment and evaluation form contrasts with modern education, where individual learning characteristics are…

Descriptors: Probability, Adaptive Testing, Computer Assisted Testing, Item Response Theory

An Investigation of Measurement Invariance of the Key Stage 2 National Curriculum Science Sampling Test in England

Peer reviewed

Direct link

He, Qingping; Anwyll, Steve; Glanville, Matthew; Opposs, Dennis – Research Papers in Education, 2014

Since 2010, the whole national cohort Key Stage 2 (KS2) National Curriculum test in science in England has been replaced with a sampling test taken by pupils at the age of 11 from a nationally representative sample of schools annually. The study reported in this paper compares the performance of different subgroups of the samples (classified by…

Descriptors: National Curriculum, Sampling, Foreign Countries, Factor Analysis

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

A Comparison of Kernel Equating and Traditional Equipercentile Equating Methods and the Parametric Bootstrap Methods for Estimating Standard Errors in Equipercentile Equating

Direct link

Choi, Sae Il – ProQuest LLC, 2009

This study used simulation (a) to compare the kernel equating method to traditional equipercentile equating methods under the equivalent-groups (EG) design and the nonequivalent-groups with anchor test (NEAT) design and (b) to apply the parametric bootstrap method for estimating standard errors of equating. A two-parameter logistic item response…

Descriptors: Item Response Theory, Comparative Analysis, Sampling, Statistical Inference

DIF Detection with Small Samples: Applying Smoothing Techniques to Frequency Distributions in the Mantel-Haenszel Procedure. Research Report. ETS RR-08-44

Peer reviewed
PDF on ERIC

Download full text

Yu, Lei; Moses, Tim; Puhan, Gautam; Dorans, Neil – ETS Research Report Series, 2008

All differential item functioning (DIF) methods require at least a moderate sample size for effective DIF detection. Samples that are less than 200 pose a challenge for DIF analysis. Smoothing can improve upon the estimation of the population distribution by preserving major features of an observed frequency distribution while eliminating the…

Descriptors: Test Bias, Item Response Theory, Sample Size, Evaluation Criteria

Previous Page | Next Page »

Pages: 1 | 2

Anwyll, Steve	1
Berger, Martjin P. F.	1
Chang, Hua-Hua	1
Chen, Hanwei	1
Chen, Pei-Hua	1
Cheng, Ying	1
Choi, Sae Il	1
Chun Wang	1
Cui, Zhongmin	1
DeCarlo, Lawrence T.	1
Diao, Qi	1
Donovan, Jenny	1
Dorans, Neil	1
Dorans, Neil J.	1
Fang, Yu	1
Geisinger, Kurt F.	1
Glanville, Matthew	1
Gongjun Xu	1
He, Qingping	1
Hutton, Penny	1
Jenkins, Frank	1
Jiang, Yanming	1
Jiaying Xiao	1
Johnson, Matthew S.	1
Kim, YoungKoung	1
More ▼