ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	15

Descriptor

Educational Assessment	31
Evaluation Methods	31
Simulation	31
Models	8
Student Evaluation	8
Test Items	8
Psychometrics	7
Educational Testing	6
Item Response Theory	6
Measurement	6
Performance Based Assessment	6
Comparative Analysis	5
Measurement Techniques	5
Test Construction	5
Evaluation Research	4
Test Use	4
Assessment Centers (Personnel)	3
College Students	3
Computer Assisted Testing	3
Educational Research	3
Elementary Secondary Education	3
Error of Measurement	3
Evaluation Criteria	3
Probability	3
Rating Scales	3
More ▼

Source

Journal of Educational…	3
Journal of Educational and…	2
ProQuest LLC	2
Applied Measurement in…	1
Education Finance and Policy	1
Education Policy Analysis…	1
Educational Researcher	1
Educational and Psychological…	1
International Journal of…	1
Journal of Business Education	1
Journal of Continuing…	1
Journal of Personnel…	1
Language Assessment Quarterly	1
Online Submission	1
Physiology Teacher	1
Practical Assessment,…	1
Society for Research on…	1
Studies in Educational…	1
More ▼

Publication Type

Journal Articles	17
Reports - Evaluative	14
Reports - Research	9
Speeches/Meeting Papers	4
Reports - Descriptive	3
Dissertations/Theses -…	2
Guides - General	1
Opinion Papers	1

Education Level

Elementary Secondary Education	3
Postsecondary Education	3
Adult Education	2
Higher Education	2
Elementary Education	1

Audience

Location

China	1
Kentucky	1
Maine	1
New York	1
Pennsylvania	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

ACT Assessment	1
Armed Services Vocational…	1
National Assessment of…	1
Program for International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics

Peer reviewed

Direct link

Tendeiro, Jorge N.; Meijer, Rob R. – Journal of Educational Measurement, 2014

In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…

Descriptors: Educational Assessment, Test Validity, Scores, Statistical Analysis

Teaching Statistics in Language Testing Courses

Peer reviewed

Direct link

Brown, James Dean – Language Assessment Quarterly, 2013

The purpose of this article is to examine the literature on teaching statistics for useful ideas that teachers of language testing courses can draw on and incorporate into their teaching toolkits as they see fit. To those ends, the article addresses eight questions: What is known generally about teaching statistics? Why are students so anxious…

Descriptors: Statistics, Teaching Methods, Mathematics Anxiety, Coping

Maintenance of Vertical Scales under Conditions of Item Parameter Drift and Rasch Model-Data Misfit

Direct link

O'Neil, Timothy P. – ProQuest LLC, 2010

With scant research to draw upon with respect to the maintenance of vertical scales over time, decisions around the creation and performance of vertical scales over time necessarily suffers due to the lack of information. Undetected item parameter drift (IPD) presents one of the greatest threats to scale maintenance within an item response theory…

Descriptors: Scaling, Measures (Individuals), Item Response Theory, Educational Assessment

Improving Explanatory Inferences from Assessments

Direct link

Diakow, Ronli Phyllis – ProQuest LLC, 2013

This dissertation comprises three papers that propose, discuss, and illustrate models to make improved inferences about research questions regarding student achievement in education. Addressing the types of questions common in educational research today requires three different "extensions" to traditional educational assessment: (1)…

Descriptors: Inferences, Educational Assessment, Academic Achievement, Educational Research

Incorporating Learning into the Cognitive Assessment Framework

Download full text

Studer, Cassandra; Junker, Brian; Chan, Helen – Society for Research on Educational Effectiveness, 2012

The authors aimed to incorporate learning into the cognitive assessment framework that exists for static assessment data. In order to accomplish this, they derive a common likelihood function for dynamic models and introduce Parameter Driven Process for Change + Cognitive Diagnosis Model (PDPC + CDM), a dynamic model which tracks learning…

Descriptors: Foreign Countries, Data Analysis, Cognitive Measurement, Measurement Techniques

Rater Training to Support High-Stakes Simulation-Based Assessments

Peer reviewed

Direct link

Feldman, Moshe; Lazzara, Elizabeth H.; Vanderbilt, Allison A.; DiazGranados, Deborah – Journal of Continuing Education in the Health Professions, 2012

Competency-based assessment and an emphasis on obtaining higher-level outcomes that reflect physicians' ability to demonstrate their skills has created a need for more advanced assessment practices. Simulation-based assessments provide medical education planners with tools to better evaluate the 6 Accreditation Council for Graduate Medical…

Descriptors: Performance Based Assessment, Physicians, Accuracy, High Stakes Tests

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Assumptions of Value-Added Models for Estimating School Effects

Peer reviewed

Direct link

Reardon, Sean F.; Raudenbush, Stephen W. – Education Finance and Policy, 2009

The ability of school (or teacher) value-added models to provide unbiased estimates of school (or teacher) effects rests on a set of assumptions. In this article, we identify six assumptions that are required so that the estimands of such models are well defined and the models are able to recover the desired parameters from observable data. These…

Descriptors: School Effectiveness, Inferences, Educational Assessment, Measurement Techniques

Model-Free CUSUM Methods for Person Fit

Peer reviewed

Direct link

Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009

This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…

Descriptors: Probability, Simulation, Models, Psychometrics

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

Multidimensional Adaptive Testing in Educational and Psychological Measurement: Current State and Future Challenges

Peer reviewed

Direct link

Frey, Andreas; Seitz, Nicki-Nils – Studies in Educational Evaluation, 2009

The paper gives an overview of multidimensional adaptive testing (MAT) and evaluates its applicability in educational and psychological testing. The approach of Segall (1996) is described as a general framework for MAT. The main advantage of MAT is its capability to increase measurement efficiency. In simulation studies conceptualizing situations…

Descriptors: Psychological Testing, Adaptive Testing, Simulation, Evaluation Methods

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Previous Page | Next Page »

Pages: 1 | 2 | 3

Altschuld, James W.	1
Armstrong, Ronald D.	1
Ban, Jae-Chun	1
Barr, James	1
Bloxom, Bruce	1
Bolton, Dale L.	1
Brown, James Dean	1
Chan, Helen	1
Cui, Ying	1
Diakow, Ronli Phyllis	1
DiazGranados, Deborah	1
Feldman, Moshe	1
Finch, Fredrick	1
Foertsch, Mary	1
Frey, Andreas	1
Grover, Barbara W.	1
Guerriero, Carl A.	1
Junker, Brian	1
Kazuhiro Yamaguchi	1
Keller, Lisa A.	1
Lazzara, Elizabeth H.	1
Lee, Jaekyung	1
Lee, Won-Chan	1
Leighton, Jacqueline P.	1
More ▼