ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	18

Descriptor

Models	24
Test Items	24
Item Response Theory	18
Scaling	15
Multidimensional Scaling	9
Difficulty Level	6
Test Construction	6
Computation	5
Scores	5
Simulation	5
Foreign Countries	4
Goodness of Fit	4
Measurement	4
Comparative Analysis	3
Correlation	3
Data Analysis	3
Error of Measurement	3
Factor Analysis	3
Item Analysis	3
Mathematics	3
Measurement Techniques	3
Regression (Statistics)	3
Scoring	3
Achievement Tests	2
Classification	2
More ▼

Source

Applied Psychological…	4
Educational and Psychological…	3
ProQuest LLC	3
Journal of Educational…	2
Educational Measurement:…	1
Educational Technology…	1
European Educational Research…	1
International Journal of…	1
Journal of Early Intervention	1
Large-scale Assessments in…	1
Measurement and Evaluation in…	1
Measurement:…	1
Multivariate Behavioral…	1
National Center for Research…	1
Scandinavian Journal of…	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	10
Reports - Evaluative	8
Dissertations/Theses -…	3
Reports - Descriptive	3
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	3
Postsecondary Education	2
High Schools	1
Higher Education	1

Audience

Researchers

Location

Germany	2
Arizona	1
Europe	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Statistical Estimation and Inference for Large-Scale Categorical Data

Direct link

Chengcheng Li – ProQuest LLC, 2022

Categorical data become increasingly ubiquitous in the modern big data era. In this dissertation, we propose novel statistical learning and inference methods for large-scale categorical data, focusing on latent variable models and their applications to psychometrics. In psychometric assessments, the subjects' underlying aptitude often cannot be…

Descriptors: Statistical Inference, Data Analysis, Psychometrics, Raw Scores

Comparing Different Trend Estimation Approaches in Country Means and Standard Deviations in International Large-Scale Assessment Studies

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Large-scale Assessments in Education, 2023

One major aim of international large-scale assessments (ILSA) like PISA is to monitor changes in student performance over time. To accomplish this task, a set of common items (i.e., link items) is repeatedly administered in each assessment. Linking methods based on item response theory (IRT) models are used to align the results from the different…

Descriptors: Educational Trends, Trend Analysis, International Assessment, Achievement Tests

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

Unidimensional Interpretations for Multidimensional Test Items

Peer reviewed

Direct link

Kahraman, Nilufer – Journal of Educational Measurement, 2013

This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…

Descriptors: Test Items, Item Response Theory, Computation, Models

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

Modeling Rater Effects and Complex Learning Progressions Using Item Response Models

Direct link

Shin, Hyo Jeong – ProQuest LLC, 2015

This dissertation is comprised of three papers that propose and apply psychometric models to deal with complexities and challenges in large-scale assessments, focusing on modeling rater effects and complex learning progressions. In particular, three papers investigate extensions and applications of multilevel and multidimensional item response…

Descriptors: Item Response Theory, Psychometrics, Models, Measurement

Assessment of Computer and Information Literacy in ICILS 2013: Do Different Item Types Measure the Same Construct?

Peer reviewed

Direct link

Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia – European Educational Research Journal, 2017

The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…

Descriptors: Foreign Countries, Computer Literacy, Information Literacy, International Assessment

Lord-Wingersky Algorithm Version 2.0 for Hierarchical Item Factor Models with Applications in Test Scoring, Scale Alignment, and Model Fit Testing. CRESST Report 830

Download full text

Cai, Li – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2013

Lord and Wingersky's (1984) recursive algorithm for creating summed score based likelihoods and posteriors has a proven track record in unidimensional item response theory (IRT) applications. Extending the recursive algorithm to handle multidimensionality is relatively simple, especially with fixed quadrature because the recursions can be defined…

Descriptors: Mathematics, Scores, Item Response Theory, Computation

Minimum Sample Size Requirements for Mokken Scale Analysis

Peer reviewed

Direct link

Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014

An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…

Descriptors: Sampling, Test Items, Effect Size, Scaling

Profile Analysis: A Closer Look at the PISA 2000 Reading Data

Peer reviewed

Direct link

Verhelst, Norman D. – Scandinavian Journal of Educational Research, 2012

When using IRT models in Educational Achievement Testing, the model is as a rule too simple to catch all the relevant dimensions in the test. It is argued that a simple model may nevertheless be useful but that it can be complemented with additional analyses. Such an analysis, called profile analysis, is proposed and applied to the reading data of…

Descriptors: Multidimensional Scaling, Profiles, Item Response Theory, Achievement Tests

An Application of Explanatory Item Response Modeling for Model-Based Proficiency Scaling

Peer reviewed

Direct link

Hartig, Johannes; Frey, Andreas; Nold, Gunter; Klieme, Eckhard – Educational and Psychological Measurement, 2012

The article compares three different methods to estimate effects of task characteristics and to use these estimates for model-based proficiency scaling: prediction of item difficulties from the Rasch model, the linear logistic test model (LLTM), and an LLTM including random item effects (LLTM+e). The methods are applied to empirical data from a…

Descriptors: Item Response Theory, Models, Methods, Computation

Exploring Unidimensional Proficiency Classification Accuracy from Multidimensional Data in a Vertical Scaling Context

Direct link

Kroopnick, Marc Howard – ProQuest LLC, 2010

When Item Response Theory (IRT) is operationally applied for large scale assessments, unidimensionality is typically assumed. This assumption requires that the test measures a single latent trait. Furthermore, when tests are vertically scaled using IRT, the assumption of unidimensionality would require that the battery of tests across grades…

Descriptors: Simulation, Scaling, Standard Setting, Item Response Theory

Linear Equating for the NEAT Design: Parameter Substitution Models and Chained Linear Relationship Models

Peer reviewed

Direct link

Kane, Michael T.; Mroch, Andrew A.; Suh, Youngsuk; Ripkey, Douglas R. – Measurement: Interdisciplinary Research and Perspectives, 2009

This paper analyzes five linear equating models for the "nonequivalent groups with anchor test" (NEAT) design with internal anchors (i.e., the anchor test is part of the full test). The analysis employs a two-dimensional framework. The first dimension contrasts two general approaches to developing the equating relationship. Under a "parameter…

Descriptors: Scaling, Equated Scores, Methods, Test Items

A Method to Examine Content Domain Structures

Peer reviewed

Direct link

D'Agostino, Jerome; Karpinski, Aryn; Welsh, Megan – International Journal of Testing, 2011

After a test is developed, most content validation analyses shift from ascertaining domain definition to studying domain representation and relevance because the domain is assumed to be set once a test exists. We present an approach that allows for the examination of alternative domain structures based on extant test items. In our example based on…

Descriptors: Expertise, Test Items, Mathematics Tests, Factor Analysis

Previous Page | Next Page »

Pages: 1 | 2

Abdel-fattah, Abdel-fattah A.	1
Batchelder, William H.	1
Cai, Li	1
Chengcheng Li	1
Culpepper, Steven Andrew	1
D'Agostino, Jerome	1
Davey, Tim	1
France, Stephen L.	1
Frey, Andreas	1
Gerick, Julia	1
Gierl, Mark J.	1
Goldhammer, Frank	1
Güler Yavuz Temel	1
Hartig, Johannes	1
Ihme, Jan Marten	1
Kahraman, Nilufer	1
Kane, Michael T.	1
Karpinski, Aryn	1
Klieme, Eckhard	1
Kroopnick, Marc Howard	1
Lüdtke, Oliver	1
Meijer, Rob R.	1
Mroch, Andrew A.	1
Nold, Gunter	1
More ▼