ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Data Analysis	10
Scores	10
Item Response Theory	7
Scaling	7
Evaluation Methods	4
Test Items	4
Correlation	3
High Schools	3
Item Analysis	3
Mathematical Concepts	3
Mathematics Education	3
Mathematics Skills	3
Models	3
Multidimensional Scaling	3
Performance	3
Psychometrics	3
Quality Control	3
Test Construction	3
Test Reliability	3
Test Validity	3
Academic Achievement	2
Achievement Gains	2
Automation	2
Bayesian Statistics	2
Comparative Analysis	2
More ▼

Source

ETS Research Report Series	2
New Meridian Corporation	2
Educational Assessment	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1

Author

Blais, Jean-Guy	1
D'Agostino, Jerome	1
Feuerstahler, Leah	1
Fu, Jianbin	1
Karpinski, Aryn	1
Lee, Minji K.	1
Lockwood, J. R.	1
Mariano, Louis T.	1
Mavronikolas, Elia	1
McCaffrey, Daniel F.	1
Melican, Gerald J.	1
Micceri, Theodore	1
Sweeney, Kevin	1
Welsh, Megan	1
Wilson, Mark	1
Zapata, Diego	1
von Davier, Alina A.	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	6
Reports - Descriptive	3
Numerical/Quantitative Data	2
Speeches/Meeting Papers	2
Reports - Evaluative	1

Education Level

Elementary Education	3
High Schools	3
Secondary Education	3
Early Childhood Education	2
Grade 3	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 9	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Primary Education	2
More ▼

Audience

Location

Arizona

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Statistical Methods for Assessments in Simulations and Serious Games. Research Report. ETS RR-14-12

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Zapata, Diego; Mavronikolas, Elia – ETS Research Report Series, 2014

Simulation or game-based assessments produce outcome data and process data. In this article, some statistical models that can potentially be used to analyze data from simulation or game-based assessments are introduced. Specifically, cognitive diagnostic models that can be used to estimate latent skills from outcome data so as to scale these…

Descriptors: Simulation, Evaluation Methods, Games, Data Collection

New Meridian Technical Report 2018-2019

Download full text

New Meridian Corporation, 2020

The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…

Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation

New Meridian Technical Report 2018-2019: Alternate Blueprint

Download full text

New Meridian Corporation, 2020

The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…

Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation

The Use of Quality Control and Data Mining Techniques for Monitoring Scaled Scores: An Overview. Research Report. ETS RR-12-20

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A. – ETS Research Report Series, 2012

Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…

Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling

A Model for Teacher Effects from Longitudinal Data without Assuming Vertical Scaling

Peer reviewed

Direct link

Mariano, Louis T.; McCaffrey, Daniel F.; Lockwood, J. R. – Journal of Educational and Behavioral Statistics, 2010

There is an increasing interest in using longitudinal measures of student achievement to estimate individual teacher effects. Current multivariate models assume each teacher has a single effect on student outcomes that persists undiminished to all future test administrations (complete persistence [CP]) or can diminish with time but remains…

Descriptors: Persistence, Academic Achievement, Data Analysis, Teacher Influence

A Method to Examine Content Domain Structures

Peer reviewed

Direct link

D'Agostino, Jerome; Karpinski, Aryn; Welsh, Megan – International Journal of Testing, 2011

After a test is developed, most content validation analyses shift from ascertaining domain definition to studying domain representation and relevance because the domain is assumed to be set once a test exists. We present an approach that allows for the examination of alternative domain structures based on extant test items. In our example based on…

Descriptors: Expertise, Test Items, Mathematics Tests, Factor Analysis

Interrater Agreement: Same Data, Different Definitions, Different Outcomes.

Download full text

Micceri, Theodore; And Others – 1987

Several issues relating to agreement estimates for different types of data from performance evaluations are considered. New indices of agreement are presented for ordinal level items and for summative scores produced by nominal or ordinal level items. Two sets of empirical data illustrate the performance of the two formulas derived to estimate…

Descriptors: Correlation, Data Analysis, Educational Research, Estimation (Mathematics)

Item Response Theory Scaling with Heterogeneous Populations.

Download full text

Blais, Jean-Guy – 1993

Tools used in scaling proficiency scores from the Second International Assessment of Educational Progress (IAEP) are described. The second IAEP study, conducted in 1991, was an international comparative study of the mathematics and science skills of samples of 9- and 13-year-old students from 20 countries. This paper focuses on part of the second…

Descriptors: Academic Achievement, Adolescents, Cross Cultural Studies, Data Analysis