Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 8 |
Descriptor
Data Analysis | 10 |
Scores | 10 |
Item Response Theory | 7 |
Scaling | 7 |
Evaluation Methods | 4 |
Test Items | 4 |
Correlation | 3 |
High Schools | 3 |
Item Analysis | 3 |
Mathematical Concepts | 3 |
Mathematics Education | 3 |
More ▼ |
Source
ETS Research Report Series | 2 |
New Meridian Corporation | 2 |
Educational Assessment | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Author
Blais, Jean-Guy | 1 |
D'Agostino, Jerome | 1 |
Feuerstahler, Leah | 1 |
Fu, Jianbin | 1 |
Karpinski, Aryn | 1 |
Lee, Minji K. | 1 |
Lockwood, J. R. | 1 |
Mariano, Louis T. | 1 |
Mavronikolas, Elia | 1 |
McCaffrey, Daniel F. | 1 |
Melican, Gerald J. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 6 |
Reports - Descriptive | 3 |
Numerical/Quantitative Data | 2 |
Speeches/Meeting Papers | 2 |
Reports - Evaluative | 1 |
Education Level
Elementary Education | 3 |
High Schools | 3 |
Secondary Education | 3 |
Early Childhood Education | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Grade 6 | 2 |
Grade 7 | 2 |
Grade 9 | 2 |
Intermediate Grades | 2 |
More ▼ |
Audience
Location
Arizona | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019
Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…
Descriptors: Item Response Theory, Models, Scores, Comparative Analysis
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014
Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…
Descriptors: Simulation, Evaluation Methods, Games, Data Collection
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
von Davier, Alina A. – ETS Research Report Series, 2012
Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…
Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling
Mariano, Louis T.; McCaffrey, Daniel F.; Lockwood, J. R. – Journal of Educational and Behavioral Statistics, 2010
There is an increasing interest in using longitudinal measures of student achievement to estimate individual teacher effects. Current multivariate models assume each teacher has a single effect on student outcomes that persists undiminished to all future test administrations (complete persistence [CP]) or can diminish with time but remains…
Descriptors: Persistence, Academic Achievement, Data Analysis, Teacher Influence
D'Agostino, Jerome; Karpinski, Aryn; Welsh, Megan – International Journal of Testing, 2011
After a test is developed, most content validation analyses shift from ascertaining domain definition to studying domain representation and relevance because the domain is assumed to be set once a test exists. We present an approach that allows for the examination of alternative domain structures based on extant test items. In our example based on…
Descriptors: Expertise, Test Items, Mathematics Tests, Factor Analysis
Micceri, Theodore; And Others – 1987
Several issues relating to agreement estimates for different types of data from performance evaluations are considered. New indices of agreement are presented for ordinal level items and for summative scores produced by nominal or ordinal level items. Two sets of empirical data illustrate the performance of the two formulas derived to estimate…
Descriptors: Correlation, Data Analysis, Educational Research, Estimation (Mathematics)
Blais, Jean-Guy – 1993
Tools used in scaling proficiency scores from the Second International Assessment of Educational Progress (IAEP) are described. The second IAEP study, conducted in 1991, was an international comparative study of the mathematics and science skills of samples of 9- and 13-year-old students from 20 countries. This paper focuses on part of the second…
Descriptors: Academic Achievement, Adolescents, Cross Cultural Studies, Data Analysis