NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019
We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…
Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Fu, Jianbin – ETS Research Report Series, 2016
The multidimensional item response theory (MIRT) models with covariates proposed by Haberman and implemented in the "mirt" program provide a flexible way to analyze data based on item response theory. In this report, we discuss applications of the MIRT models with covariates to longitudinal test data to measure skill differences at the…
Descriptors: Item Response Theory, Longitudinal Studies, Test Bias, Goodness of Fit
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Arieli-Attali, Meirav; Cayton-Hodges, Gabrielle – ETS Research Report Series, 2014
Prior work on the "CBAL"™ mathematics competency model resulted in an initial competency model for middle school grades with several learning progressions (LPs) that elaborate central ideas in the competency model and provide a basis for connecting summative and formative assessment. In the current project, we created a competency model…
Descriptors: Mathematics Tests, Elementary School Students, Elementary School Mathematics, Numbers
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Deane, Paul; Graf, Edith Aurora; Higgins, Derrick; Futagi, Yoko; Lawless, René – ETS Research Report Series, 2006
This study focuses on the relationship between item modeling and evidence-centered design (ECD); it considers how an appropriately generalized item modeling software tool can support systematic identification and exploitation of task-model variables, and then examines the feasibility of this goal, using linear-equation items as a test case. The…
Descriptors: Test Items, Models, Computer Software, Equations (Mathematics)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yan, Duanli; Almond, Russell; Mislevy, Robert – ETS Research Report Series, 2004
Diagnostic score reports linking assessment outcomes to instructional interventions are one of the most requested features of assessment products. There is a body of interesting work done in the last 20 years including Tatsuoka's rule space method (Tatsuoka, 1983), Haertal and Wiley's binary skills model (Haertal, 1984; Haertal & Wiley, 1993),…
Descriptors: Comparative Analysis, Models, Bayesian Statistics, Statistical Inference
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sinharay, Sandip – ETS Research Report Series, 2004
Assessing fit of psychometric models has always been an issue of enormous interest, but there exists no unanimously agreed upon item fit diagnostic for the models. Bayesian networks, frequently used in educational assessments (see, for example, Mislevy, Almond, Yan, & Steinberg, 2001) primarily for learning about students' knowledge and…
Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Graf, Edith Aurora; Peterson, Stephen; Steffen, Manfred; Lawless, René – ETS Research Report Series, 2005
We describe the item modeling development and evaluation process as applied to a quantitative assessment with high-stakes outcomes. In addition to expediting the item-creation process, a model-based approach may reduce pretesting costs, if the difficulty and discrimination of model-generated items may be predicted to a predefined level of…
Descriptors: Psychometrics, Accuracy, Item Analysis, High Stakes Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006
More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…
Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory