ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	17

Descriptor

Evaluation Methods	21
Item Response Theory	21
Sampling	21
Equated Scores	8
Computation	6
College Entrance Examinations	5
Sample Size	5
Comparative Analysis	4
Data Analysis	4
Research Methodology	4
Achievement Tests	3
Bayesian Statistics	3
Correlation	3
Cutting Scores	3
Error of Measurement	3
Gender Differences	3
High School Students	3
Item Analysis	3
Measurement	3
Models	3
Monte Carlo Methods	3
Science Achievement	3
Simulation	3
Standardized Tests	3
Statistical Analysis	3
More ▼

Source

Applied Psychological…	4
Applied Measurement in…	3
Educational Measurement:…	2
Educational and Psychological…	2
College Board	1
Grantee Submission	1
Journal of Educational…	1
Journal of Educational and…	1
Measurement:…	1
Ministerial Council on…	1
ProQuest LLC	1
Psicologica: International…	1
Psychological Review	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	11
Reports - Evaluative	4
Reports - Descriptive	3
Information Analyses	2
Numerical/Quantitative Data	2
Dissertations/Theses -…	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Elementary Education	1
Elementary Secondary Education	1
Grade 6	1
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
National Merit Scholarship…	1
Preliminary Scholastic…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Computational Strategies and Estimation Performance with Bayesian Semiparametric Item Response Theory Models

Peer reviewed

Direct link

Paganin, Sally; Paciorek, Christopher J.; Wehrhahn, Claudia; Rodríguez, Abel; Rabe-Hesketh, Sophia; de Valpine, Perry – Journal of Educational and Behavioral Statistics, 2023

Item response theory (IRT) models typically rely on a normality assumption for subject-specific latent traits, which is often unrealistic in practice. Semiparametric extensions based on Dirichlet process mixtures (DPMs) offer a more flexible representation of the unknown distribution of the latent trait. However, the use of such models in the IRT…

Descriptors: Bayesian Statistics, Item Response Theory, Guidance, Evaluation Methods

A New Bayesian Person-Fit Analysis Method Using Pivotal Discrepancy Measures

Peer reviewed

Direct link

Combs, Adam – Journal of Educational Measurement, 2023

A common method of checking person-fit in Bayesian item response theory (IRT) is the posterior-predictive (PP) method. In recent years, more powerful approaches have been proposed that are based on resampling methods using the popular L*[subscript z] statistic. There has also been proposed a new Bayesian model checking method based on pivotal…

Descriptors: Bayesian Statistics, Goodness of Fit, Evaluation Methods, Monte Carlo Methods

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

An Improved Estimation Using Polya-Gamma Augmentation for Bayesian Structural Equation Models with Dichotomous Variables

Peer reviewed

Direct link

Kim, Seohyun; Lu, Zhenqiu; Cohen, Allan S. – Measurement: Interdisciplinary Research and Perspectives, 2018

Bayesian algorithms have been used successfully in the social and behavioral sciences to analyze dichotomous data particularly with complex structural equation models. In this study, we investigate the use of the Polya-Gamma data augmentation method with Gibbs sampling to improve estimation of structural equation models with dichotomous variables.…

Descriptors: Bayesian Statistics, Structural Equation Models, Computation, Social Science Research

Evaluating Equity at the Local Level Using Bootstrap Tests. Research Report 2016-4

Download full text

Kim, YoungKoung; DeCarlo, Lawrence T. – College Board, 2016

Because of concerns about test security, different test forms are typically used across different testing occasions. As a result, equating is necessary in order to get scores from the different test forms that can be used interchangeably. In order to assure the quality of equating, multiple equating methods are often examined. Various equity…

Descriptors: Equated Scores, Evaluation Methods, Sampling, Statistical Inference

Detection of Q-Matrix Misspecification Using Two Criteria for Validation of Cognitive Structures under the Least Squares Distance Model

Peer reviewed
PDF on ERIC

Download full text

Romero, Sonia J.; Ordoñez, Xavier G.; Ponsoda, Vincente; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2014

Cognitive Diagnostic Models (CDMs) aim to provide information about the degree to which individuals have mastered specific attributes that underlie the success of these individuals on test items. The Q-matrix is a key element in the application of CDMs, because contains links item-attributes representing the cognitive structure proposed for solve…

Descriptors: Evaluation Methods, Q Methodology, Matrices, Sampling

Parsimonious Testing of Transitive or Intransitive Preferences: Reply to Birnbaum (2011)

Peer reviewed

Direct link

Regenwetter, Michel; Dana, Jason; Davis-Stober, Clintin P.; Guo, Ying – Psychological Review, 2011

Birnbaum raised important challenges to testing transitivity. We summarize why an approach based on counting response patterns does not solve these challenges. Foremost, we show why parsimonious tests of transitivity require at least 5 choice alternatives. While the approach of Regenwetter, Dana, and Davis-Stober achieves high power with modest…

Descriptors: Testing, Item Response Theory, Responses, Evaluation Methods

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Standard Errors and Confidence Intervals from Bootstrapping for Ramsay-Curve Item Response Theory Model Item Parameters

Peer reviewed

Direct link

Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011

Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…

Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods

Improving IRT Parameter Estimates with Small Sample Sizes: Evaluating the Efficacy of a New Data Augmentation Technique

Direct link

Foley, Brett Patrick – ProQuest LLC, 2010

The 3PL model is a flexible and widely used tool in assessment. However, it suffers from limitations due to its need for large sample sizes. This study introduces and evaluates the efficacy of a new sample size augmentation technique called Duplicate, Erase, and Replace (DupER) Augmentation through a simulation study. Data are augmented using…

Descriptors: Test Length, Sample Size, Simulation, Item Response Theory

Anchor Test Type and Population Invariance: An Exploration across Subpopulations and Test Administrations

Peer reviewed

Direct link

Dorans, Neil J.; Liu, Jinghua; Hammond, Shelby – Applied Psychological Measurement, 2008

This exploratory study was built on research spanning three decades. Petersen, Marco, and Stewart (1982) conducted a major empirical investigation of the efficacy of different equating methods. The studies reported in Dorans (1990) examined how different equating methods performed across samples selected in different ways. Recent population…

Descriptors: Test Format, Equated Scores, Sampling, Evaluation Methods

Evaluating Item Fit for Multidimensional Item Response Models

Peer reviewed

Direct link

Zhang, Bo; Stone, Clement A. – Educational and Psychological Measurement, 2008

This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…

Descriptors: Monte Carlo Methods, Sampling, Goodness of Fit, Evaluation Methods

A Discussion of Population Invariance

Peer reviewed

Direct link

Brennan, Robert L. – Applied Psychological Measurement, 2008

The discussion here covers five articles that are linked in the sense that they all treat population invariance. This discussion of population invariance is a somewhat broader treatment of the subject than simply a discussion of these five articles. In particular, occasional reference is made to publications other than those in this issue. The…

Descriptors: Advanced Placement, Law Schools, Science Achievement, Achievement Tests

An NCME Instructional Module on Estimating Item Response Theory Models Using Markov Chain Monte Carlo Methods

Peer reviewed

Direct link

Kim, Jee-Seon; Bolt, Daniel M. – Educational Measurement: Issues and Practice, 2007

The purpose of this ITEMS module is to provide an introduction to Markov chain Monte Carlo (MCMC) estimation for item response models. A brief description of Bayesian inference is followed by an overview of the various facets of MCMC algorithms, including discussion of prior specification, sampling procedures, and methods for evaluating chain…

Descriptors: Placement, Monte Carlo Methods, Markov Processes, Measurement

Previous Page | Next Page »

Pages: 1 | 2

Dorans, Neil J.	2
Bolt, Daniel M.	1
Brennan, Robert L.	1
Chun Wang	1
Cohen, Allan S.	1
Combs, Adam	1
Dana, Jason	1
Davis-Stober, Clintin P.	1
DeCarlo, Lawrence T.	1
Donovan, Jenny	1
Foley, Brett Patrick	1
Geisinger, Kurt F.	1
Gongjun Xu	1
Gu, Fei	1
Guo, Ying	1
Hammond, Shelby	1
Hoyle, Larry	1
Hutton, Penny	1
Jiaying Xiao	1
Kim, Jee-Seon	1
Kim, Seohyun	1
Kim, YoungKoung	1
Kingston, Neal M.	1
Lennon, Melissa	1
More ▼