Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 29 |
Descriptor
Models | 32 |
Item Response Theory | 23 |
Comparative Analysis | 10 |
Statistical Analysis | 10 |
Computation | 9 |
National Competency Tests | 9 |
Classification | 8 |
Test Items | 8 |
Reading Tests | 7 |
Grade 8 | 6 |
Mathematics Tests | 6 |
More ▼ |
Source
ETS Research Report Series | 14 |
Educational Testing Service | 5 |
Measurement:… | 4 |
Educational and Psychological… | 3 |
Journal of Educational and… | 3 |
Journal of Educational… | 2 |
Psychometrika | 1 |
Author
von Davier, Matthias | 32 |
Xu, Xueli | 8 |
Sinharay, Sandip | 4 |
Carstensen, Claus H. | 2 |
Khorramdel, Lale | 2 |
von Davier, Alina A. | 2 |
Carlson, James E. | 1 |
Chen, Haiwen | 1 |
DiBello, Lou | 1 |
González B., Jorge | 1 |
González, B. Jorge | 1 |
More ▼ |
Publication Type
Journal Articles | 27 |
Reports - Research | 25 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Reports - Descriptive | 2 |
Information Analyses | 1 |
Education Level
Elementary Education | 7 |
Secondary Education | 7 |
Grade 8 | 6 |
Grade 4 | 5 |
Junior High Schools | 5 |
Middle Schools | 5 |
Intermediate Grades | 3 |
Elementary Secondary Education | 2 |
Grade 12 | 2 |
High Schools | 2 |
Grade 10 | 1 |
More ▼ |
Audience
Location
Bermuda | 1 |
Canada | 1 |
Germany | 1 |
Italy | 1 |
Norway | 1 |
Switzerland | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 8 |
Trends in International… | 3 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Ulitzsch, Esther; von Davier, Matthias; Pohl, Steffi – Educational and Psychological Measurement, 2020
So far, modeling approaches for not-reached items have considered one single underlying process. However, missing values at the end of a test can occur for a variety of reasons. On the one hand, examinees may not reach the end of a test due to time limits and lack of working speed. On the other hand, examinees may not attempt all items and quit…
Descriptors: Item Response Theory, Test Items, Response Style (Tests), Computer Assisted Testing
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2018
This article critically reviews how diagnostic models have been conceptualized and how they compare to other approaches used in educational measurement. In particular, certain assumptions that have been taken for granted and used as defining characteristics of diagnostic models are reviewed and it is questioned whether these assumptions are the…
Descriptors: Criticism, Psychometrics, Diagnostic Tests, Educational Assessment
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
von Davier, Matthias – ETS Research Report Series, 2016
This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…
Descriptors: Psychometrics, Mathematics, Models, Statistical Analysis
von Davier, Matthias – ETS Research Report Series, 2014
Diagnostic models combine multiple binary latent variables in an attempt to produce a latent structure that provides more information about test takers' performance than do unidimensional latent variable models. Recent developments in diagnostic modeling emphasize the possibility that multiple skills may interact in a conjunctive way within the…
Descriptors: Models, Equations (Mathematics), Measurement Techniques, Item Response Theory
Rijmen, Frank; Jeon, Minjeong; von Davier, Matthias; Rabe-Hesketh, Sophia – Journal of Educational and Behavioral Statistics, 2014
Second-order item response theory models have been used for assessments consisting of several domains, such as content areas. We extend the second-order model to a third-order model for assessments that include subdomains nested in domains. Using a graphical model framework, it is shown how the model does not suffer from the curse of…
Descriptors: Item Response Theory, Models, Educational Assessment, Computation
González, B. Jorge; von Davier, Matthias – Journal of Educational Measurement, 2013
Based on Lord's criterion of equity of equating, van der Linden (this issue) revisits the so-called local equating method and offers alternative as well as new thoughts on several topics including the types of transformations, symmetry, reliability, and population invariance appropriate for equating. A remarkable aspect is to define equating…
Descriptors: Equated Scores, Statistical Analysis, Models, Statistical Inference
von Davier, Matthias; González B., Jorge; von Davier, Alina A. – Journal of Educational Measurement, 2013
Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…
Descriptors: Equated Scores, Transformations (Mathematics), Item Response Theory, Raw Scores
Carlson, James E.; von Davier, Matthias – ETS Research Report Series, 2013
Few would doubt that ETS researchers have contributed more to the general topic of item response theory (IRT) than individuals from any other institution. In this report, we briefly review most of those contributions, dividing them into sections by decades of publication, beginning with early work by Fred Lord and Bert Green in the 1950s and…
Descriptors: Item Response Theory, Educational Research, Measurement Techniques, Psychometrics
von Davier, Matthias – Educational Testing Service, 2011
This report shows that the deterministic-input noisy-AND (DINA) model is a special case of more general compensatory diagnostic models by means of a reparameterization of the skill space and the design (Q-) matrix of item by skills associations. This reparameterization produces a compensatory model that is equivalent to the (conjunctive) DINA…
Descriptors: Clinical Diagnosis, Classification, Models, Matrices
Wetzel, Eunike; Xu, Xueli; von Davier, Matthias – Educational and Psychological Measurement, 2015
In large-scale educational surveys, a latent regression model is used to compensate for the shortage of cognitive information. Conventionally, the covariates in the latent regression model are principal components extracted from background data. This operational method has several important disadvantages, such as the handling of missing data and…
Descriptors: Surveys, Regression (Statistics), Models, Research Methodology
Hsieh, Chueh-an; Xu, Xueli; von Davier, Matthias – Educational Testing Service, 2010
This paper presents an application of a jackknifing approach to variance estimation of ability inferences for groups of students, using a multidimensional discrete model for item response data. The data utilized to demonstrate the approach come from the National Assessment of Educational Progress (NAEP). In contrast to the operational approach…
Descriptors: National Competency Tests, Reading Tests, Grade 4, Computation
von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Psychometrika, 2011
The aim of the research presented here is the use of extensions of longitudinal item response theory (IRT) models in the analysis and comparison of group-specific growth in large-scale assessments of educational outcomes. A general discrete latent variable model was used to specify and compare two types of multidimensional item-response-theory…
Descriptors: Educational Objectives, Outcomes of Education, Measures (Individuals), Item Response Theory
von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009
If questioned about their beliefs, psychometricians in one camp would argue the firm conviction that the Rasch model is mathematically elegant and intuitive as well as plausible for practitioners, pointing out the advantages of a simple model that "counts" every item in the same way. Psychometricians of another camp would argue that the three…
Descriptors: Item Response Theory, Models, Guessing (Tests), Probability