ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	14

Descriptor

Educational Testing	17
Models	17
Psychometrics	17
Educational Assessment	10
Item Response Theory	9
Measurement	9
Student Evaluation	9
Measurement Techniques	8
Evaluation Methods	6
Test Construction	6
Criterion Referenced Tests	5
Diagnostic Tests	5
Adaptive Testing	4
Classification	4
Computer Assisted Testing	4
Evaluation Problems	4
Evidence	4
State of the Art Reviews	4
Test Items	4
Comparative Analysis	3
Probability	3
Validity	3
Bayesian Statistics	2
Educational Research	2
Educational Technology	2
More ▼

Source

Measurement:…	5
Journal of Educational…	3
ETS Research Report Series	1
Educational Measurement:…	1
IAP - Information Age…	1
Journal of Applied Testing…	1
Multivariate Behavioral…	1
Online Submission	1
ProQuest LLC	1
Psychometrika	1
Studies in Educational…	1
More ▼

Publication Type

Journal Articles	14
Opinion Papers	6
Reports - Research	5
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Books	1
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2

Audience

Researchers	1
Students	1
Teachers	1

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

Descriptors: Educational Testing, Scores, Reports, Psychometrics

What IRT Can and Cannot Do

Peer reviewed

Direct link

Glas, Cees A. W. – Measurement: Interdisciplinary Research and Perspectives, 2009

This author states that, while the article by Gunter Maris and Timo Bechger ("On Interpreting the Model Parameters for the Three Parameter Logistic Model," this issue) is highly interesting, the interest is not so much in the practical implications, but rather in the issue of the meaning and role of statistical models in psychometrics and…

Descriptors: Item Response Theory, Measurement, Psychometrics, Models

Random or Fixed Testlet Effects: A Comparison of Two Multilevel Testlet Models

Direct link

Chen, Tzu-An – ProQuest LLC, 2010

This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…

Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models

Using and Developing Measurement Instruments in Science Education: A Rasch Modeling Approach. Science & Engineering Education Sources

Direct link

Liu, Xiufeng – IAP - Information Age Publishing, Inc., 2010

This book meets a demand in the science education community for a comprehensive and introductory measurement book in science education. It describes measurement instruments reported in refereed science education research journals, and introduces the Rasch modeling approach to developing measurement instruments in common science assessment domains,…

Descriptors: Graduate Students, Textbooks, Research Methodology, Science Tests

Exploring the Relationship between Static and Dynamic Vertical Scaling from Cross-Sectional and Longitudinal Design Perspectives

Download full text

Wang, Shudong; Jiao, Hong; Jiang, Yanming – Online Submission, 2009

The concept of dynamic vertical scaling (DVS) from longitudinal point of view has been proposed as comparing to traditional vertical scaling or static vertical scaling (SVS) from cross-sectional perspective. The effects of differences between DVS and SVS on large-scale student achievements have been investigated. The potential application of DVS…

Descriptors: Scaling, Longitudinal Studies, Academic Achievement, Models

Graded Response Model Based on the Logistic Positive Exponent Family of Models for Dichotomous Responses

Peer reviewed

Direct link

Samejima, Fumiko – Psychometrika, 2008

Samejima ("Psychometrika "65:319--335, 2000) proposed the logistic positive exponent family of models (LPEF) for dichotomous responses in the unidimensional latent space. The objective of the present paper is to propose and discuss a graded response model that is expanded from the LPEF, in the context of item response theory (IRT). This…

Descriptors: Psychological Testing, Item Response Theory, Psychometrics, Educational Testing

Modeling Change in Large-Scale Longitudinal Studies of Educational Growth: Four Decades of Contributions to the Assessment of Educational Growth. Research Report. ETS RR-12-04. ETS R&D Scientific and Policy Contributions Series. ETS SPC-12-01

Peer reviewed
PDF on ERIC

Download full text

Rock, Donald A. – ETS Research Report Series, 2012

This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…

Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

Automatic Item Generation of Probability Word Problems

Peer reviewed

Direct link

Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009

Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

Descriptors: Word Problems (Mathematics), Probability, Automation, College Students

Diagnostic Models as Partially Ordered Sets

Peer reviewed

Direct link

Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…

Descriptors: Test Items, Classification, Psychometrics, Item Response Theory

Equivalent Diagnostic Classification Models

Peer reviewed

Direct link

Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009

Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…

Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

Diagnostic Classification Models and Multidimensional Adaptive Testing: A Commentary on Rupp and Templin

Peer reviewed

Direct link

Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009

On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…

Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics

Modeling Diagnostic Assessments with Bayesian Networks

Peer reviewed

Direct link

Almond, Russell G.; DiBello, Louis V.; Moulder, Brad; Zapata-Rivera, Juan-Diego – Journal of Educational Measurement, 2007

This paper defines Bayesian network models and examines their applications to IRT-based cognitive diagnostic modeling. These models are especially suited to building inference engines designed to be synchronous with the finer grained student models that arise in skills diagnostic assessment. Aspects of the theory and use of Bayesian network models…

Descriptors: Inferences, Models, Item Response Theory, Cognitive Measurement

Evaluating Comparability in Computerized Adaptive Testing: Issues, Criteria and an Example.

Peer reviewed

Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001

Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria

Previous Page | Next Page »

Pages: 1 | 2

Haberman, Shelby J.	2
Sinharay, Sandip	2
Almond, Russell G.	1
Armstrong, Ronald D.	1
Bechger, Timo	1
Bertling, Jonas P.	1
Carstensen, Claus H.	1
Chen, Tzu-An	1
DiBello, Louis V.	1
Frey, Andreas	1
Glas, Cees A. W.	1
Holling, Heinz	1
Jiang, Yanming	1
Jiao, Hong	1
Kolen, Michael J.	1
Liu, Xiufeng	1
Luecht, Richard M.	1
Maris, Gunter	1
Mehrens, William A.	1
Moulder, Brad	1
Puhan, Gautam	1
Rock, Donald A.	1
Samejima, Fumiko	1
Shi, Min	1
Tatsuoka, Curtis	1
More ▼