Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 15 |
Since 2006 (last 20 years) | 23 |
Descriptor
Error of Measurement | 23 |
Grade 3 | 20 |
Elementary School Students | 14 |
Test Items | 9 |
Grade 2 | 8 |
Grade 4 | 8 |
Grade 5 | 6 |
Item Response Theory | 6 |
Longitudinal Studies | 6 |
Mathematics Tests | 6 |
Grade 1 | 5 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 23 |
Journal Articles | 19 |
Education Level
Grade 3 | 23 |
Elementary Education | 19 |
Primary Education | 15 |
Early Childhood Education | 14 |
Grade 4 | 10 |
Grade 2 | 9 |
Grade 5 | 8 |
Intermediate Grades | 7 |
Grade 1 | 6 |
Grade 6 | 4 |
Middle Schools | 4 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Early Childhood Longitudinal… | 4 |
Measures of Academic Progress | 1 |
Self Description Questionnaire | 1 |
What Works Clearinghouse Rating
Turner, Kyle T.; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2023
The purpose of this study is to illustrate the use of functional data analysis (FDA) as a general methodology for analyzing person response functions (PRFs). Applications of FDA to psychometrics have included the estimation of item response functions and latent distributions, as well as differential item functioning. Although FDA has been…
Descriptors: Data Analysis, Item Response Theory, Psychometrics, Statistical Distributions
Gilbert, Joshua B.; Kim, James S.; Miratrix, Luke W. – Journal of Educational and Behavioral Statistics, 2023
Analyses that reveal how treatment effects vary allow researchers, practitioners, and policymakers to better understand the efficacy of educational interventions. In practice, however, standard statistical methods for addressing heterogeneous treatment effects (HTE) fail to address the HTE that may exist "within" outcome measures. In…
Descriptors: Test Items, Item Response Theory, Computer Assisted Testing, Program Effectiveness
Denis Dumas; Selcuk Acar; Kelly Berthiaume; Peter Organisciak; David Eby; Katalin Grajzel; Theadora Vlaamster; Michele Newman; Melanie Carrera – Grantee Submission, 2023
Open-ended verbal creativity assessments are commonly administered in psychological research and in educational practice to elementary-aged children. Children's responses are then typically rated by teams of judges who are trained to identify original ideas, hopefully with a degree of inter-rater agreement. Even in cases where the judges are…
Descriptors: Elementary School Students, Grade 3, Grade 4, Grade 5
Joshua B. Gilbert; James S. Kim; Luke W. Miratrix – Annenberg Institute for School Reform at Brown University, 2024
Longitudinal models of individual growth typically emphasize between-person predictors of change but ignore how growth may vary "within" persons because each person contributes only one point at each time to the model. In contrast, modeling growth with multi-item assessments allows evaluation of how relative item performance may shift…
Descriptors: Vocabulary Development, Item Response Theory, Test Items, Student Development
Joshua B. Gilbert; James S. Kim; Luke W. Miratrix – Applied Measurement in Education, 2024
Longitudinal models typically emphasize between-person predictors of change but ignore how growth varies "within" persons because each person contributes only one data point at each time. In contrast, modeling growth with multi-item assessments allows evaluation of how relative item performance may shift over time. While traditionally…
Descriptors: Vocabulary Development, Item Response Theory, Test Items, Student Development
Precision of Single-Skill Math CBM Time-Series Data: The Effect of Probe Stratification and Set Size
Solomon, Benjamin G.; Payne, Lexy L.; Campana, Kayla V.; Marr, Erin A.; Battista, Carmela; Silva, Alex; Dawes, Jillian M. – Journal of Psychoeducational Assessment, 2020
Comparatively little research exists on single-skill math (SSM) curriculum-based measurements (CBMs) for the purpose of monitoring growth, as may be done in practice or when monitoring intervention effectiveness within group or single-case research. Therefore, we examined a common variant of SSM-CBM: 1 digit × 1 digit multiplication. Reflecting…
Descriptors: Curriculum Based Assessment, Mathematics Tests, Mathematics Skills, Multiplication
Gorgun, Guher; Bulut, Okan – Educational and Psychological Measurement, 2021
In low-stakes assessments, some students may not reach the end of the test and leave some items unanswered due to various reasons (e.g., lack of test-taking motivation, poor time management, and test speededness). Not-reached items are often treated as incorrect or not-administered in the scoring process. However, when the proportion of…
Descriptors: Scoring, Test Items, Response Style (Tests), Mathematics Tests
Curran, F. Chris; Kitchin, James – AERA Open, 2019
Recent evidence points to the early elementary grades as a pivotal point for the development of science learning trajectories and achievement gaps. Using data from the Early Childhood Longitudinal Study, this study estimates the degree to which time spent on science and the breadth of science topics/skills covered predict science achievement in…
Descriptors: Science Instruction, Elementary School Science, Science Achievement, Children
Dicke, Theresa; Marsh, Herbert W.; Parker, Philip D.; Pekrun, Reinhard; Guo, Jiesi; Televantou, Ioulia – Journal of Educational Psychology, 2018
School-average achievement is often reported to have positive effects on individual achievement (peer spillover effect). However, it is well established that school-average achievement has negative effects on academic self-concept (big-fish-little-pond effect [BFLPE]) and that academic self-concept and achievement are positively correlated and…
Descriptors: Academic Achievement, Self Concept, Peer Influence, Children
Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Reading and Writing: An Interdisciplinary Journal, 2017
We examined how raters and tasks influence measurement error in writing evaluation and how many raters and tasks are needed to reach a desirable level of 0.90 and 0.80 reliabilities for children in Grades 3 and 4. A total of 211 children (102 boys) were administered three tasks in narrative and expository genres, respectively, and their written…
Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4
Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Grantee Submission, 2017
We examined how raters and tasks influence measurement error in writing evaluation and how many raters and tasks are needed to reach a desirable level of 0.90 and 0.80 reliabilities for children in Grades 3 and 4. A total of 211 children (102 boys) were administered three tasks in narrative and expository genres, respectively, and their written…
Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4
Li, Feifei – ETS Research Report Series, 2017
An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…
Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement
Li, Sylvia; Meyer, Patrick – NWEA, 2019
This simulation study examines the measurement precision, item exposure rates, and the depth of the MAP® Growth™ item pools under various grade-level restrictions. Unlike most summative assessments, MAP Growth allows examinees to see items from any grade level, regardless of the examinee's actual grade level. It does not limit the test to items…
Descriptors: Achievement Tests, Item Banks, Test Items, Instructional Program Divisions
Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017
The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…
Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions
Pereira, Nielsen; Bakhiet, Salaheldin Farah; Gentry, Marcia; Balhmar, Tahani Abdulrahman; Hakami, Sultan Mohammed – Journal of Advanced Academics, 2017
This study examined the psychometric properties and measurement invariance of the Arabic version of "My Class Activities" (MCA), an instrument designed to measure students' perceptions of interest, challenge, choice, and enjoyment in classrooms. Scores of 3,516 Sudanese students in Grades 2 to 8 were used. Confirmatory factor analysis…
Descriptors: Student Attitudes, Factor Analysis, Comparative Analysis, Gifted
Previous Page | Next Page »
Pages: 1 | 2