Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 8 |
Descriptor
Scaling | 9 |
Grade 2 | 8 |
Elementary School Students | 5 |
Item Response Theory | 5 |
Grade 1 | 4 |
Test Items | 4 |
Difficulty Level | 3 |
Grade 3 | 3 |
Kindergarten | 3 |
Children | 2 |
Correlation | 2 |
More ▼ |
Source
Author
Al Otaiba, Stephanie | 1 |
Almehrizi, Rashid S. | 1 |
Alonzo, Julie | 1 |
Anderson, Daniel | 1 |
Avery, Marybell | 1 |
Bovaird, James A. | 1 |
Boyer, Ty W. | 1 |
Connor, Carol McDonald | 1 |
Custer, Michael | 1 |
Dyson, Ben | 1 |
Fisette, Jennifer L. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 7 |
Reports - Evaluative | 2 |
Numerical/Quantitative Data | 1 |
Education Level
Grade 2 | 9 |
Elementary Education | 8 |
Early Childhood Education | 6 |
Primary Education | 6 |
Grade 1 | 4 |
Grade 3 | 4 |
Grade 4 | 3 |
Grade 5 | 2 |
Higher Education | 2 |
Kindergarten | 2 |
Postsecondary Education | 2 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Guangming Li; Zhengyan Liang – SAGE Open, 2024
In order to investigate the influence of separation of grade distributions and ratio of common items on the precision of vertical scaling, this simulation study chooses common item design and first grade as base grade. There are four grades with 1,000 students each to take part in a test which has 100 items. Monte Carlo simulation method is used…
Descriptors: Elementary School Students, Grade 1, Grade 2, Grade 3
Hawley, Leslie R.; Bovaird, James A.; Wu, ChaoRong – Applied Measurement in Education, 2017
Value-added assessment methods have been criticized by researchers and policy makers for a number of reasons. One issue includes the sensitivity of model results across different outcome measures. This study examined the utility of incorporating multivariate latent variable approaches within a traditional value-added framework. We evaluated the…
Descriptors: Value Added Models, Reliability, Multivariate Analysis, Scaling
Wallot, Sebastian; O'Brien, Beth A.; Haussmann, Anna; Kloos, Heidi; Lyby, Marlene S. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2014
Reading speed is commonly used as an index of reading fluency. However, reading speed is not a consistent predictor of text comprehension, when speed and comprehension are measured on the same text within the same reader. This might be due to the somewhat ambiguous nature of reading speed, which is sometimes regarded as a feature of the reading…
Descriptors: Experimental Psychology, Reading Rate, Reading Comprehension, Reading Processes
Boyer, Ty W.; Levine, Susan C. – Journal of Experimental Child Psychology, 2012
The current experiments examined the role of scale factor in children's proportional reasoning. Experiment 1 used a choice task and Experiment 2 used a production task to examine the abilities of kindergartners through fourth-graders to match equivalent, visually depicted proportional relations. The findings of both experiments show that accuracy…
Descriptors: Scaling, Measures (Individuals), Mathematical Concepts, Task Analysis
Almehrizi, Rashid S. – Applied Psychological Measurement, 2013
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Descriptors: Raw Scores, Scaling, Reliability, Computation
Irvin, P. Shawn; Saven, Jessica L.; Alonzo, Julie; Park, Bitnara Jasmine; Anderson, Daniel; Tindal, Gerald – Behavioral Research and Teaching, 2012
The results of formative assessments are regularly used to inform important instructional decisions (e.g., targeted intervention) within a response to intervention (RTI) system of teaching and learning. The validity of such instructional decision-making depends, in part, on the alignment between formative measures and the academic content…
Descriptors: Elementary School Mathematics, Curriculum Based Assessment, Mathematics Tests, Academic Standards
Petscher, Yaacov; Connor, Carol McDonald; Al Otaiba, Stephanie – Assessment for Effective Intervention, 2012
This study investigated the psychometrics of the "Diagnostic Evaluation of Language Variation-Screening Test" (DELV-S) test using confirmatory factor analysis, item response theory, and differential item functioning (DIF). Responses from 1,764 students in kindergarten through second grade were used in the study, with results indicating…
Descriptors: Diagnostic Tests, Screening Tests, Language Variation, Psychometrics
Fox, Connie; Zhu, Weimo; Park, Youngsik; Fisette, Jennifer L.; Graber, Kim C.; Dyson, Ben; Avery, Marybell; Franck, Marian; Placek, Judith H.; Rink, Judy; Raynes, De – Measurement in Physical Education and Exercise Science, 2011
In addition to validity and reliability evidence, other psychometric qualities of the PE Metrics assessments needed to be examined. This article describes how those critical psychometric issues were addressed during the PE Metrics assessment bank construction. Specifically, issues included (a) number of items or assessments needed, (b) training…
Descriptors: Measures (Individuals), Psychometrics, Interrater Reliability, Training
Pomplun, Mark; Omar, Hafidz; Custer, Michael – Educational and Psychological Measurement, 2004
The present study compares vertical scaling results for the Rasch model from BILOG-MG and WINSTEPS. The item and ability parameters for the real and simulated mathematics tests were scaled across five grades, second to sixth. The simulated data were based on real data for a series of mathematics tests for Grades 2 to 6. The results from WINSTEPS…
Descriptors: Elementary Education, Scaling, Mathematics Tests, Item Response Theory