ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	8

Descriptor

Scaling	9
Grade 2	8
Elementary School Students	5
Item Response Theory	5
Grade 1	4
Test Items	4
Difficulty Level	3
Grade 3	3
Kindergarten	3
Children	2
Correlation	2
Experimental Psychology	2
Grade 4	2
Item Banks	2
Mathematics Tests	2
Measurement Techniques	2
Measures (Individuals)	2
Psychometrics	2
Reliability	2
Statistical Analysis	2
Test Bias	2
Test Construction	2
Test Reliability	2
Verbal Ability	2
Ability Identification	1
More ▼

Source

Applied Measurement in…	1
Applied Psychological…	1
Assessment for Effective…	1
Behavioral Research and…	1
Educational and Psychological…	1
Journal of Experimental Child…	1
Journal of Experimental…	1
Measurement in Physical…	1
SAGE Open	1

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Evaluative	2
Numerical/Quantitative Data	1

Education Level

Grade 2	9
Elementary Education	8
Early Childhood Education	6
Primary Education	6
Grade 1	4
Grade 3	4
Grade 4	3
Grade 5	2
Higher Education	2
Kindergarten	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 6	1
Intermediate Grades	1
More ▼

Audience

Location

Florida	1
Tennessee	1

Laws, Policies, & Programs

Assessments and Surveys

Stanford Achievement Tests

What Works Clearinghouse Rating

Showing all 9 results Save | Export

The Effect of the Ratio of Common Items and the Separation of Grade Distributions on the Precision of Vertical Scaling

Peer reviewed

Direct link

Guangming Li; Zhengyan Liang – SAGE Open, 2024

In order to investigate the influence of separation of grade distributions and ratio of common items on the precision of vertical scaling, this simulation study chooses common item design and first grade as base grade. There are four grades with 1,000 students each to take part in a test which has 100 items. Monte Carlo simulation method is used…

Descriptors: Elementary School Students, Grade 1, Grade 2, Grade 3

Stability of Teacher Value-Added Rankings across Measurement Model and Scaling Conditions

Peer reviewed

Direct link

Hawley, Leslie R.; Bovaird, James A.; Wu, ChaoRong – Applied Measurement in Education, 2017

Value-added assessment methods have been criticized by researchers and policy makers for a number of reasons. One issue includes the sensitivity of model results across different outcome measures. This study examined the utility of incorporating multivariate latent variable approaches within a traditional value-added framework. We evaluated the…

Descriptors: Value Added Models, Reliability, Multivariate Analysis, Scaling

The Role of Reading Time Complexity and Reading Speed in Text Comprehension

Peer reviewed

Direct link

Wallot, Sebastian; O'Brien, Beth A.; Haussmann, Anna; Kloos, Heidi; Lyby, Marlene S. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2014

Reading speed is commonly used as an index of reading fluency. However, reading speed is not a consistent predictor of text comprehension, when speed and comprehension are measured on the same text within the same reader. This might be due to the somewhat ambiguous nature of reading speed, which is sometimes regarded as a feature of the reading…

Descriptors: Experimental Psychology, Reading Rate, Reading Comprehension, Reading Processes

Child Proportional Scaling: Is 1/3 = 2/6 = 3/9 = 4/12?

Peer reviewed

Direct link

Boyer, Ty W.; Levine, Susan C. – Journal of Experimental Child Psychology, 2012

The current experiments examined the role of scale factor in children's proportional reasoning. Experiment 1 used a choice task and Experiment 2 used a production task to examine the abilities of kindergartners through fourth-graders to match equivalent, visually depicted proportional relations. The findings of both experiments show that accuracy…

Descriptors: Scaling, Measures (Individuals), Mathematical Concepts, Task Analysis

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

The Development and Scaling of the easyCBM CCSS Elementary Mathematics Measures: Grade 2. Technical Report #1316

Download full text

Irvin, P. Shawn; Saven, Jessica L.; Alonzo, Julie; Park, Bitnara Jasmine; Anderson, Daniel; Tindal, Gerald – Behavioral Research and Teaching, 2012

The results of formative assessments are regularly used to inform important instructional decisions (e.g., targeted intervention) within a response to intervention (RTI) system of teaching and learning. The validity of such instructional decision-making depends, in part, on the alignment between formative measures and the academic content…

Descriptors: Elementary School Mathematics, Curriculum Based Assessment, Mathematics Tests, Academic Standards

Psychometric Analysis of the Diagnostic Evaluation of Language Variation Assessment

Peer reviewed

Direct link

Petscher, Yaacov; Connor, Carol McDonald; Al Otaiba, Stephanie – Assessment for Effective Intervention, 2012

This study investigated the psychometrics of the "Diagnostic Evaluation of Language Variation-Screening Test" (DELV-S) test using confirmatory factor analysis, item response theory, and differential item functioning (DIF). Responses from 1,764 students in kindergarten through second grade were used in the study, with results indicating…

Descriptors: Diagnostic Tests, Screening Tests, Language Variation, Psychometrics

Peer reviewed

Direct link

Fox, Connie; Zhu, Weimo; Park, Youngsik; Fisette, Jennifer L.; Graber, Kim C.; Dyson, Ben; Avery, Marybell; Franck, Marian; Placek, Judith H.; Rink, Judy; Raynes, De – Measurement in Physical Education and Exercise Science, 2011

In addition to validity and reliability evidence, other psychometric qualities of the PE Metrics assessments needed to be examined. This article describes how those critical psychometric issues were addressed during the PE Metrics assessment bank construction. Specifically, issues included (a) number of items or assessments needed, (b) training…

Descriptors: Measures (Individuals), Psychometrics, Interrater Reliability, Training

A Comparison of Winsteps and Bilog-MG for Vertical Scaling with the Rasch Model

Peer reviewed

Direct link

Pomplun, Mark; Omar, Hafidz; Custer, Michael – Educational and Psychological Measurement, 2004

The present study compares vertical scaling results for the Rasch model from BILOG-MG and WINSTEPS. The item and ability parameters for the real and simulated mathematics tests were scaled across five grades, second to sixth. The simulated data were based on real data for a series of mathematics tests for Grades 2 to 6. The results from WINSTEPS…

Descriptors: Elementary Education, Scaling, Mathematics Tests, Item Response Theory

Al Otaiba, Stephanie	1
Almehrizi, Rashid S.	1
Alonzo, Julie	1
Anderson, Daniel	1
Avery, Marybell	1
Bovaird, James A.	1
Boyer, Ty W.	1
Connor, Carol McDonald	1
Custer, Michael	1
Dyson, Ben	1
Fisette, Jennifer L.	1
Fox, Connie	1
Franck, Marian	1
Graber, Kim C.	1
Guangming Li	1
Haussmann, Anna	1
Hawley, Leslie R.	1
Irvin, P. Shawn	1
Kloos, Heidi	1
Levine, Susan C.	1
Lyby, Marlene S.	1
O'Brien, Beth A.	1
Omar, Hafidz	1
Park, Bitnara Jasmine	1
Park, Youngsik	1
More ▼