ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	8

Descriptor

Error of Measurement	11
Scaling	11
Item Response Theory	8
Achievement Tests	4
Data Collection	4
Equated Scores	4
Mathematics Tests	4
Scoring	4
Test Construction	4
Test Reliability	4
Testing Programs	4
Academic Achievement	3
Common Core State Standards	3
English	3
Grade 3	3
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
Grade 8	3
Language Arts	3
Language Tests	3
Test Bias	3
Test Validity	3
Testing	3
More ▼

Source

New York State Education…	3
Journal of Educational…	2
Educational Measurement:…	1
International Journal of…	1
Journal of Educational and…	1
Measurement and Evaluation in…	1
Psychometrika	1

Author

Kolen, Michael J.	3
Bentler, Peter M.	1
Bontempo, Daniel	1
Brennan, Robert L.	1
Cook, Linda L.	1
Hanson, Bradley A.	1
Lee, Won-Chan	1
Little, Todd D.	1
Martin, Michael O.	1
Mullis, Ina V. S.	1
Petersen, Nancy S.	1
Rioux, Charlie	1
Tong, Ye	1
Tracy, Allison	1
Willse, John T.	1
Yuan, Ke-Hai	1
Zeng, Lingjia	1
More ▼

Publication Type

Reports - Descriptive	11
Journal Articles	7
Numerical/Quantitative Data	3
Speeches/Meeting Papers	2

Education Level

Early Childhood Education	3
Elementary Education	3
Grade 3	3
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Primary Education	3
Secondary Education	3
Elementary Secondary Education	2
More ▼

Audience

Researchers

Location

New York

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…	2
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

On the Merits of Longitudinal Multiple Group Modelling: An Alternative to Multilevel Modelling for Intervention Evaluations

Peer reviewed

Direct link

Little, Todd D.; Bontempo, Daniel; Rioux, Charlie; Tracy, Allison – International Journal of Research & Method in Education, 2022

Multilevel modelling (MLM) is the most frequently used approach for evaluating interventions with clustered data. MLM, however, has some limitations that are associated with numerous obstacles to model estimation and valid inferences. Longitudinal multiple-group (LMG) modelling is a longstanding approach for testing intervention effects using…

Descriptors: Longitudinal Studies, Hierarchical Linear Modeling, Alternative Assessment, Intervention

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

TIMSS 2015: Illustrating Advancements in Large-Scale International Assessments

Peer reviewed

Direct link

Martin, Michael O.; Mullis, Ina V. S. – Journal of Educational and Behavioral Statistics, 2019

International large-scale assessments of student achievement such as International Association for the Evaluation of Educational Achievement's Trends in International Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy Study and Organization for Economic Cooperation and Development's Program for International…

Descriptors: Achievement Tests, International Assessment, Mathematics Tests, Science Achievement

Positive Definiteness via Off-Diagonal Scaling of a Symmetric Indefinite Matrix

Peer reviewed

Direct link

Bentler, Peter M.; Yuan, Ke-Hai – Psychometrika, 2011

Indefinite symmetric matrices that are estimates of positive-definite population matrices occur in a variety of contexts such as correlation matrices computed from pairwise present missing data and multinormal based methods for discretized variables. This note describes a methodology for scaling selected off-diagonal rows and columns of such a…

Descriptors: Scaling, Factor Analysis, Correlation, Predictor Variables

New York State Testing Program 2016: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2016

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

New York State Testing Program 2014: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2014

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2014 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Scaling: An Items Module

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010

"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…

Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores

Estimators of Conditional Scale-Score Standard Errors of Measurement: A Simulation Study.

Peer reviewed

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational Measurement, 2000

Describes four procedures previously developed for estimating conditional standard errors of measurement for scale scores and compares them in a simulation study. All four procedures appear viable. Recommends that test users select a procedure based on various factors such as the type of scale score of concern, test characteristics, assumptions…

Descriptors: Error of Measurement, Estimation (Mathematics), Item Response Theory, Scaling

Conditional Standard Errors of Measurement for Scale Scores Using IRT.

Peer reviewed

Kolen, Michael J.; Zeng, Lingjia; Hanson, Bradley A. – Journal of Educational Measurement, 1996

Presents an Item Response Theory (IRT) method for estimating standard errors of measurement of scale scores for the situation in which scale scores are nonlinear transformations of number-correct scores. Also describes procedures for estimating the average conditional standard error of measurement for scale scores and the reliability of scale…

Descriptors: Error of Measurement, Estimation (Mathematics), Item Response Theory, Reliability

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods