ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	17

Descriptor

Evaluation Methods	23
Goodness of Fit	23
Test Items	23
Item Response Theory	10
Models	10
Simulation	7
Item Analysis	6
Psychometrics	6
Factor Analysis	5
Test Construction	5
Educational Assessment	4
Evaluation Research	3
Factor Structure	3
Foreign Countries	3
Measures (Individuals)	3
Research Methodology	3
Responses	3
Scores	3
Statistical Analysis	3
Adolescents	2
Anxiety	2
Bayesian Statistics	2
Cognitive Ability	2
Comparative Analysis	2
Computer Assisted Testing	2
More ▼

Source

Applied Psychological…	2
Educational and Psychological…	2
Journal of Educational and…	2
Applied Measurement in…	1
Assessment	1
Journal of Applied Measurement	1
Journal of Educational…	1
Journal of Emotional and…	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Measurement:…	1
Multivariate Behavioral…	1
Online Submission	1
Practical Assessment,…	1
Research Quarterly for…	1
Science Education	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	13
Reports - Evaluative	6
Reports - Descriptive	4
Speeches/Meeting Papers	3
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

High Schools	2
Middle Schools	2
Secondary Education	2
Higher Education	1

Audience

Researchers

Location

California	1
China	1
Pennsylvania	1
United Kingdom	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests	1
Medical College Admission Test	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning within a Test

Peer reviewed

Direct link

Lozano, José H.; Revuelta, Javier – Educational and Psychological Measurement, 2023

The present paper introduces a general multidimensional model to measure individual differences in learning within a single administration of a test. Learning is assumed to result from practicing the operations involved in solving the items. The model accounts for the possibility that the ability to learn may manifest differently for correct and…

Descriptors: Bayesian Statistics, Learning Processes, Test Items, Item Analysis

Development and Validation of the Learning Progression-Based Assessment of Modern Genetics in a High School Context

Peer reviewed

Direct link

Todd, Amber; Romine, William L.; Cook Whitt, Katahdin – Science Education, 2017

We describe the development, validation, and use of the "Learning Progression-Based Assessment of Modern Genetics" (LPA-MG) in a high school biology context. Items were constructed based on a current learning progression framework for genetics (Shea & Duncan, 2013; Todd & Kenyon, 2015). The 34-item instrument, which was tied to…

Descriptors: Genetics, Science Instruction, High School Students, Evaluation Methods

An Algorithm for Testing Unidimensionality and Clustering Items in Rasch Measurement

Peer reviewed

Direct link

Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012

A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…

Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

Development and Validation of the Compliant and Principled Sportspersonship Scale

Peer reviewed

Direct link

Perry, John L.; Clough, Peter J.; Crust, Lee; Nabb, Sam L.; Nicholls, Adam R. – Research Quarterly for Exercise and Sport, 2015

Purpose: A new measure of sportspersonship, which differentiates between compliance and principled approaches, was developed and initially validated in 3 studies. Method: Study 1 developed items, assessed content validity, and proposed a model. Study 2 tested the factorial validity of the model on an independent sample. Study 3 further tested the…

Descriptors: Program Development, Program Validation, Physical Education, Compliance (Legal)

Using Surveillance of Mental Health to Increase Understanding of Youth Involvement in High-Risk Behaviors: A Value-Added Analysis

Peer reviewed

Direct link

Dowdy, Erin; Furlong, Michael J.; Sharkey, Jill D. – Journal of Emotional and Behavioral Disorders, 2013

This study examined the potential utility of adding items that assessed youths' emotional and behavioral disorders to a commonly used surveillance survey. The goal was to evaluate whether the added items could enhance understanding of youths' involvement in high-risk behaviors. A sample of 3,331 adolescents in Grades 8, 10, and 12 from four…

Descriptors: Behavior Disorders, Adolescents, Addictive Behavior, Surveys

The Emergence of a Learning Progression in Middle School Chemistry

Peer reviewed

Direct link

Johnson, Philip; Tymms, Peter – Journal of Research in Science Teaching, 2011

Previously, a small scale, interview-based, 3-year longitudinal study (ages 11-14) in one school had suggested a learning progression related to the concept of a substance. This article presents the results of a large-scale, cross-sectional study which used Rasch modeling to test the hypothesis of the learning progression. Data were collected from…

Descriptors: Computer Assisted Testing, Chemistry, Measures (Individuals), Foreign Countries

Factor Analytic Modeling of within Person Variation in Score Profiles

Peer reviewed

Direct link

Davison, Mark L.; Kim, Se-Kang; Close, Catherine – Multivariate Behavioral Research, 2009

A profile is a vector of scores for one examinee. The mean score in the vector can be interpreted as a measure of overall profile height, the variance can be interpreted as a measure of within person variation, and the ipsatized vector of score deviations about the mean can be said to describe the pattern in the score profile. A within person…

Descriptors: Vocational Interests, Interest Inventories, Profiles, Scores

Modified Likelihood-Based Item Fit Statistics for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Roberts, James S. – Applied Psychological Measurement, 2008

Orlando and Thissen (2000) developed an item fit statistic for binary item response theory (IRT) models known as S-X[superscript 2]. This article generalizes their statistic to polytomous unfolding models. Four alternative formulations of S-X[superscript 2] are developed for the generalized graded unfolding model (GGUM). The GGUM is a…

Descriptors: Item Response Theory, Goodness of Fit, Test Items, Models

Impact of Missing Data on Person-Model Fit and Person Trait Estimation

Peer reviewed

Direct link

Zhang, Bo; Walker, Cindy M. – Applied Psychological Measurement, 2008

The purpose of this research was to examine the effects of missing data on person-model fit and person trait estimation in tests with dichotomous items. Under the missing-completely-at-random framework, four missing data treatment techniques were investigated including pairwise deletion, coding missing responses as incorrect, hotdeck imputation,…

Descriptors: Item Response Theory, Computation, Goodness of Fit, Test Items

Factorial Structure of the Anxiety Control Questionnaire in Chinese Adolescents

Peer reviewed

Direct link

Shujuan, Wang; Meihua, Qian; Jianxin, Zhang – Journal of Psychoeducational Assessment, 2009

This article examines the psychometric structure of the Anxiety Control Questionnaire (ACQ) in Chinese adolescents. With the data collected from 212 senior high school students (94 females, 110 males, 8 unknown), seven models are tested using confirmatory factor analyses in the framework of the multitrait-multimethod strategy. Results indicate…

Descriptors: Multitrait Multimethod Techniques, Factor Structure, Adolescents, Measures (Individuals)

Investigation of a Nonparametric Procedure for Assessing Goodness-of-Fit in Item Response Theory

Peer reviewed

Direct link

Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008

Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…

Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Previous Page | Next Page »

Pages: 1 | 2

Smith, Richard M.	2
Arendasy, Martin	1
Bolt, Daniel M.	1
Close, Catherine	1
Clough, Peter J.	1
Cook Whitt, Katahdin	1
Crust, Lee	1
Cui, Ying	1
Davison, Mark L.	1
Debelak, Rudolf	1
Dowdy, Erin	1
Furlong, Michael J.	1
Hambleton, Ronald K.	1
Heimberg, Richard G.	1
Holaway, Robert M.	1
James O. Ramsay	1
Jianxin, Zhang	1
Joakim Wallmark	1
Johnson, Philip	1
Juan Li	1
Keller, Lisa A.	1
Kim, Se-Kang	1
Leighton, Jacqueline P.	1
Lozano, José H.	1
More ▼