ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	29

Source

ETS Research Report Series	14
Educational Testing Service	5
Measurement:…	4
Educational and Psychological…	3
Journal of Educational and…	3
Journal of Educational…	2
Psychometrika	1

Author

von Davier, Matthias	32
Xu, Xueli	8
Sinharay, Sandip	4
Carstensen, Claus H.	2
Khorramdel, Lale	2
von Davier, Alina A.	2
Carlson, James E.	1
Chen, Haiwen	1
DiBello, Lou	1
González B., Jorge	1
González, B. Jorge	1
Guo, Zhumei	1
Haberman, Shelby J.	1
He, Qiwei	1
Hsieh, Chueh-an	1
Jeon, Minjeong	1
Lee, Yi-Hsuan	1
Naemi, Bobby	1
Pohl, Steffi	1
Rabe-Hesketh, Sophia	1
Rijmen, Frank	1
Roberts, Richard D.	1
Shin, Hyo Jeong	1
Tyack, Lillian	1
Ulitzsch, Esther	1
More ▼

Publication Type

Journal Articles	27
Reports - Research	25
Reports - Evaluative	3
Opinion Papers	2
Reports - Descriptive	2
Information Analyses	1

Education Level

Elementary Education	7
Secondary Education	7
Grade 8	6
Grade 4	5
Junior High Schools	5
Middle Schools	5
Intermediate Grades	3
Elementary Secondary Education	2
Grade 12	2
High Schools	2
Grade 10	1
Grade 9	1
More ▼

Audience

Location

Bermuda	1
Canada	1
Germany	1
Italy	1
Norway	1
Switzerland	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	8
Trends in International…	3
Program for International…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

A Multiprocess Item Response Model for Not-Reached Items Due to Time Limits and Quitting

Peer reviewed

Direct link

Ulitzsch, Esther; von Davier, Matthias; Pohl, Steffi – Educational and Psychological Measurement, 2020

So far, modeling approaches for not-reached items have considered one single underlying process. However, missing values at the end of a test can occur for a variety of reasons. On the one hand, examinees may not reach the end of a test due to time limits and lack of working speed. On the other hand, examinees may not attempt all items and quit…

Descriptors: Item Response Theory, Test Items, Response Style (Tests), Computer Assisted Testing

Diagnosing Diagnostic Models: From Von Neumann's Elephant to Model Equivalencies and Network Psychometrics

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2018

This article critically reviews how diagnostic models have been conceptualized and how they compare to other approaches used in educational measurement. In particular, certain assumptions that have been taken for granted and used as defining characteristics of diagnostic models are reviewed and it is questioned whether these assumptions are the…

Descriptors: Criticism, Psychometrics, Diagnostic Tests, Educational Assessment

Developments in Psychometric Population Models for Technology-Based Large-Scale Assessments: An Overview of Challenges and Opportunities

Peer reviewed

Direct link

von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…

Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models. Research Report. ETS RR-16-34

Peer reviewed
PDF on ERIC

Download full text

von Davier, Matthias – ETS Research Report Series, 2016

This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…

Descriptors: Psychometrics, Mathematics, Models, Statistical Analysis

The Log-Linear Cognitive Diagnostic Model (LCDM) as a Special Case of The General Diagnostic Model (GDM). Research Report. ETS RR-14-40

Peer reviewed
PDF on ERIC

Download full text

von Davier, Matthias – ETS Research Report Series, 2014

Diagnostic models combine multiple binary latent variables in an attempt to produce a latent structure that provides more information about test takers' performance than do unidimensional latent variable models. Recent developments in diagnostic modeling emphasize the possibility that multiple skills may interact in a conjunctive way within the…

Descriptors: Models, Equations (Mathematics), Measurement Techniques, Item Response Theory

A Third-Order Item Response Theory Model for Modeling the Effects of Domains and Subdomains in Large-Scale Educational Assessment Surveys

Peer reviewed

Direct link

Rijmen, Frank; Jeon, Minjeong; von Davier, Matthias; Rabe-Hesketh, Sophia – Journal of Educational and Behavioral Statistics, 2014

Second-order item response theory models have been used for assessments consisting of several domains, such as content areas. We extend the second-order model to a third-order model for assessments that include subdomains nested in domains. Using a graphical model framework, it is shown how the model does not suffer from the curse of…

Descriptors: Item Response Theory, Models, Educational Assessment, Computation

Statistical Models and Inference for the True Equating Transformation in the Context of Local Equating

Peer reviewed

Direct link

González, B. Jorge; von Davier, Matthias – Journal of Educational Measurement, 2013

Based on Lord's criterion of equity of equating, van der Linden (this issue) revisits the so-called local equating method and offers alternative as well as new thoughts on several topics including the types of transformations, symmetry, reliability, and population invariance appropriate for equating. A remarkable aspect is to define equating…

Descriptors: Equated Scores, Statistical Analysis, Models, Statistical Inference

Local Equating Using the Rasch Model, the OPLM, and the 2PL IRT Model--or--What Is It Anyway if the Model Captures Everything There Is to Know about the Test Takers?

Peer reviewed

Direct link

von Davier, Matthias; González B., Jorge; von Davier, Alina A. – Journal of Educational Measurement, 2013

Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…

Descriptors: Equated Scores, Transformations (Mathematics), Item Response Theory, Raw Scores

Item Response Theory. Research Report. ETS RR-13-28. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-05

Peer reviewed
PDF on ERIC

Download full text

Carlson, James E.; von Davier, Matthias – ETS Research Report Series, 2013

Few would doubt that ETS researchers have contributed more to the general topic of item response theory (IRT) than individuals from any other institution. In this report, we briefly review most of those contributions, dividing them into sections by decades of publication, beginning with early work by Fred Lord and Bert Green in the 1950s and…

Descriptors: Item Response Theory, Educational Research, Measurement Techniques, Psychometrics

Equivalency of the DINA Model and a Constrained General Diagnostic Model. Research Report. ETS RR-11-37

Download full text

von Davier, Matthias – Educational Testing Service, 2011

This report shows that the deterministic-input noisy-AND (DINA) model is a special case of more general compensatory diagnostic models by means of a reparameterization of the skill space and the design (Q-) matrix of item by skills associations. This reparameterization produces a compensatory model that is equivalent to the (conjunctive) DINA…

Descriptors: Clinical Diagnosis, Classification, Models, Matrices

An Alternative Way to Model Population Ability Distributions in Large-Scale Educational Surveys

Peer reviewed

Direct link

Wetzel, Eunike; Xu, Xueli; von Davier, Matthias – Educational and Psychological Measurement, 2015

In large-scale educational surveys, a latent regression model is used to compensate for the shortage of cognitive information. Conventionally, the covariates in the latent regression model are principal components extracted from background data. This operational method has several important disadvantages, such as the handling of missing data and…

Descriptors: Surveys, Regression (Statistics), Models, Research Methodology

Variance Estimation for NAEP Data Using a Resampling-Based Approach: An Application of Cognitive Diagnostic Models. Research Report. ETS RR-10-26

Download full text

Hsieh, Chueh-an; Xu, Xueli; von Davier, Matthias – Educational Testing Service, 2010

This paper presents an application of a jackknifing approach to variance estimation of ability inferences for groups of students, using a multidimensional discrete model for item response data. The data utilized to demonstrate the approach come from the National Assessment of Educational Progress (NAEP). In contrast to the operational approach…

Descriptors: National Competency Tests, Reading Tests, Grade 4, Computation

Measuring Growth in a Longitudinal Large-Scale Assessment with a General Latent Variable Model

Peer reviewed

Direct link

von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Psychometrika, 2011

The aim of the research presented here is the use of extensions of longitudinal item response theory (IRT) models in the analysis and comparison of group-specific growth in large-scale assessments of educational outcomes. A general discrete latent variable model was used to specify and compare two types of multidimensional item-response-theory…

Descriptors: Educational Objectives, Outcomes of Education, Measures (Individuals), Item Response Theory

Is There Need for the 3PL Model? Guess What?

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009

If questioned about their beliefs, psychometricians in one camp would argue the firm conviction that the Rasch model is mathematically elegant and intuitive as well as plausible for practitioners, pointing out the advantages of a simple model that "counts" every item in the same way. Psychometricians of another camp would argue that the three…

Descriptors: Item Response Theory, Models, Guessing (Tests), Probability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Models	32
Item Response Theory	23
Comparative Analysis	10
Statistical Analysis	10
Computation	9
National Competency Tests	9
Classification	8
Test Items	8
Reading Tests	7
Grade 8	6
Mathematics Tests	6
Measurement Techniques	6
Probability	6
Psychometrics	6
Grade 4	5
Multivariate Analysis	5
Regression (Statistics)	5
Diagnostic Tests	4
Foreign Countries	4
Cognitive Ability	3
Computer Assisted Testing	3
Correlation	3
Data Analysis	3
Educational Assessment	3
Equations (Mathematics)	3
More ▼