ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	16

Descriptor

Educational Testing	22
Psychometrics	22
Test Items	22
Item Response Theory	14
Test Construction	14
Educational Assessment	11
Student Evaluation	10
Measurement Techniques	9
Measurement	8
Evaluation Methods	7
Simulation	6
Comparative Analysis	5
Achievement Tests	4
Computer Assisted Testing	4
Diagnostic Tests	4
Educational Policy	4
Elementary Secondary Education	4
Models	4
Test Interpretation	4
Accountability	3
Adaptive Testing	3
Data Analysis	3
Equated Scores	3
Evaluation Problems	3
Evaluation Research	3
More ▼

Source

ProQuest LLC	4
Studies in Educational…	3
Measurement:…	2
Applied Psychological…	1
Assessment in Education:…	1
ETS Research Report Series	1
Journal of Educational…	1
Ministerial Council on…	1
Office of Education, US…	1
Psychometrika	1
Review of Research in…	1
More ▼

Publication Type

Journal Articles	11
Reports - Evaluative	6
Dissertations/Theses -…	4
Reports - Research	4
Opinion Papers	3
Speeches/Meeting Papers	3
Information Analyses	2
Non-Print Media	2
Reports - Descriptive	2
ERIC Publications	1
Historical Materials	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	6
Elementary Education	3
High Schools	2
Secondary Education	2
Grade 6	1
Grade 8	1
Higher Education	1
Postsecondary Education	1

Audience

Administrators	1
Counselors	1
Teachers	1

Location

Australia	1
Germany	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Individuals with Disabilities…	1
National Defense Education Act	1

Assessments and Surveys

Advanced Placement…	1
National Teacher Examinations	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Methods for Imputing Scores When All Responses Are Missing for One or More Polytomous Items: Accuracy and Impact on Psychometric Property. Research Report. ETS RR-23-07

Peer reviewed
PDF on ERIC

Download full text

Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2023

Though a substantial amount of research exists on imputing missing scores in educational assessments, there is little research on cases where responses or scores to an item are missing for all test takers. In this paper, we tackled the problem of imputing missing scores for tests for which the responses to an item are missing for all test takers.…

Descriptors: Scores, Test Items, Accuracy, Psychometrics

Psychology's Atomic Bomb

Peer reviewed

Direct link

Borsboom, Denny; Wijsen, Lisa D. – Assessment in Education: Principles, Policy & Practice, 2017

The central role of educational testing practices in contemporary societies can hardly be overstated. It is furthermore evident that psychometric models regulate, justify, and legitimize the processes through which educational testing practices are used. In this commentary, the authors offer some observations that may be relevant for the analyses…

Descriptors: Educational Assessment, Learning, Psychometrics, Power Structure

A Multicomponent Latent Trait Model for Diagnosis

Peer reviewed

Direct link

Embretson, Susan E.; Yang, Xiangdong – Psychometrika, 2013

This paper presents a noncompensatory latent trait model, the multicomponent latent trait model for diagnosis (MLTM-D), for cognitive diagnosis. In MLTM-D, a hierarchical relationship between components and attributes is specified to be applicable to permit diagnosis at two levels. MLTM-D is a generalization of the multicomponent latent trait…

Descriptors: Mathematics Achievement, Achievement Tests, Item Response Theory, Measurement

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Random or Fixed Testlet Effects: A Comparison of Two Multilevel Testlet Models

Direct link

Chen, Tzu-An – ProQuest LLC, 2010

This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…

Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models

The Impact of Equating Method and Format Representation of Common Items on the Adequacy of Mixed-Format Test Equating Using Nonequivalent Groups

Direct link

Hagge, Sarah Lynn – ProQuest LLC, 2010

Mixed-format tests containing both multiple-choice and constructed-response items are widely used on educational tests. Such tests combine the broad content coverage and efficient scoring of multiple-choice items with the assessment of higher-order thinking skills thought to be provided by constructed-response items. However, the combination of…

Descriptors: Test Format, True Scores, Equated Scores, Psychometrics

A Psychometric Evaluation of a State Testing Program: Accommodated versus Non-Accommodated Students

Direct link

Roxbury, Tiese L. – ProQuest LLC, 2010

Federal legislation such as "No Child Left Behind" mandated that students with disabilities be included in accountability standards, creating an important responsibility to fairly assess all students, even those with disabilities. Consequently, a sense of urgency was placed on the entire educational system to ensure that these students…

Descriptors: Test Items, Testing Programs, Federal Legislation, Educational Testing

Automatic Item Generation of Probability Word Problems

Peer reviewed

Direct link

Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina – Studies in Educational Evaluation, 2009

Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

Descriptors: Word Problems (Mathematics), Probability, Automation, College Students

Psychometric Aspects of Pupil Monitoring Systems

Peer reviewed

Direct link

Glas, Cees A. W.; Geerlings, Hanneke – Studies in Educational Evaluation, 2009

Pupil monitoring systems support the teacher in tailoring teaching to the individual level of a student and in comparing the progress and results of teaching with national standards. The systems are based on the availability of an item bank calibrated using item response theory. The assessment of the students' progress and results can be further…

Descriptors: Item Banks, Adaptive Testing, National Standards, Psychometrics

Multidimensional Adaptive Testing in Educational and Psychological Measurement: Current State and Future Challenges

Peer reviewed

Direct link

Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009

The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…

Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods

Consistent Estimation of Rasch Item Parameters and Their Standard Errors under Complex Sample Designs

Peer reviewed

Direct link

Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008

U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…

Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Diagnostic Models as Partially Ordered Sets

Peer reviewed

Direct link

Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…

Descriptors: Test Items, Classification, Psychometrics, Item Response Theory

Diagnostic Classification Models and Multidimensional Adaptive Testing: A Commentary on Rupp and Templin

Peer reviewed

Direct link

Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009

On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…

Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics

Introduction to the Rasch Measurement. [videotape].

Wright, Benjamin D. – 1998

In three lectures, Benjamin D. Wright of the University of Chicago introduces the Rasch model and its basic concepts. The first lecture, March 30, 1994 discusses the model created by Georg Rasch, a Danish mathematician, which Dr. Wright initially saw as merely a way to make raw scores into measures. Eventually, the model developed into a…

Descriptors: Educational Testing, Estimation (Mathematics), Item Response Theory, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

Frey, Andreas	2
Wright, Benjamin D.	2
Bertling, Jonas P.	1
Borsboom, Denny	1
Carstensen, Claus H.	1
Chan, Tsze	1
Chen, Tzu-An	1
Cohen, Jon	1
Cui, Ying	1
Donovan, Jenny	1
Embretson, Susan E.	1
Geerlings, Hanneke	1
Glas, Cees A. W.	1
Hagge, Sarah Lynn	1
Holling, Heinz	1
Hutton, Penny	1
Jiang, Tao	1
Leighton, Jacqueline P.	1
Lennon, Melissa	1
Lockheed, Marlaine E.	1
Masters, Geofferey N.	1
McLaughlin, Kenneth F.	1
Melville, S. Donald	1
Mislevy, Robert J.	1
More ▼