Publication Date
In 2025 | 1 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 25 |
Since 2016 (last 10 years) | 47 |
Since 2006 (last 20 years) | 75 |
Descriptor
Error of Measurement | 76 |
Grade 8 | 28 |
Middle School Students | 24 |
Grade 7 | 23 |
Academic Achievement | 21 |
Foreign Countries | 21 |
Mathematics Tests | 19 |
Scores | 18 |
Mathematics Achievement | 17 |
Item Response Theory | 16 |
Grade 4 | 15 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 60 |
Journal Articles | 54 |
Numerical/Quantitative Data | 9 |
Reports - Descriptive | 8 |
Reports - Evaluative | 4 |
Dissertations/Theses -… | 3 |
Information Analyses | 1 |
Education Level
Junior High Schools | 76 |
Middle Schools | 74 |
Secondary Education | 73 |
Elementary Education | 50 |
Grade 8 | 30 |
Grade 7 | 24 |
Intermediate Grades | 22 |
High Schools | 17 |
Grade 4 | 15 |
Grade 5 | 15 |
Grade 6 | 12 |
More ▼ |
Audience
Researchers | 3 |
Policymakers | 2 |
Practitioners | 2 |
Teachers | 1 |
Location
New York | 5 |
California | 3 |
Germany | 3 |
Pennsylvania | 3 |
Turkey | 3 |
New Jersey | 2 |
Portugal | 2 |
South Korea | 2 |
Taiwan | 2 |
United States | 2 |
Australia | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jake C. Steggerda; Sandra Yu Rueger; Ana J. Bridges – Children & Schools, 2024
Authors evaluated the Student Behavior Checklist-Brief (SBC-B) to test whether teacher-reports of student learning approach (i.e., learned helplessness [LH] and mastery orientation [MO]) were invariant across academic subjects. The current sample includes ethnically diverse seventh and eighth grade students (N = 145; 53 percent male) and six teams…
Descriptors: Psychometrics, Student Behavior, Check Lists, Scores
Televantou, Ioulia; Marsh, Herbert W.; Xu, Kate M.; Guo, Jiesi; Dicke, Theresa – Educational Psychology Review, 2023
The present study uses doubly latent models to estimate the effect of average mathematics achievement at the class level on students' subsequent mathematics achievement (the "Peer Spillover Effect") and mathematics self-concept (the "Big-Fish-Little-Pond-Effect; BFLPE"), controlling for individual differences in prior…
Descriptors: Error of Measurement, Mathematics Achievement, Self Concept, Individual Differences
Rizki Zakwandi; Edi Istiyono; Wipsar Sunu Brams Dwandaru – Education and Information Technologies, 2024
Computational Thinking (CT) skill was a part of the global framework of reference on Digital Literacy for Indicator 4.4.2, widely developed in mathematics and science learning. This study aimed to promote an assessment tool using a two-tier Computerized Adaptive Test (CAT). The study used the Design and Development Research (DDR) method with four…
Descriptors: Computer Assisted Testing, Adaptive Testing, Student Evaluation, Computation
Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025
Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…
Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students
Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023
In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…
Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis
Kannan, Priya; Zapata-Rivera, Diego; Bryant, Andrew D. – Practical Assessment, Research & Evaluation, 2021
Individual-student score reports sometimes include information about precision of scores (i.e., measurement error). In this study, we specifically investigated if parents understand this information when presented. We conducted an online experimental study where 196 parents of middle school children, from various parts of the country, were…
Descriptors: Comprehension, Parents, Error of Measurement, Test Interpretation
Mahmut Sami Yigiter – Journal of Theoretical Educational Science, 2024
One of the main objectives of international large-scale assessments is to make comparisons between different countries, education policies, education systems, or subgroups. One of the main criteria for making comparisons between different groups is to ensure measurement invariance. The purpose of this study was to test the measurement invariance…
Descriptors: Mathematics, Mathematics Skills, Grade 4, Grade 8
Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021
Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…
Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory
Kathleen Lynne Lane; Wendy Peia Oakes; Mark Matthew Buckman; Nathan Allen Lane; Katie Scarlett Lane; Kandace Fleming; Rebecca E. Swinburne Romine; Rebecca L. Sherod; Chi-Ning Chang; Jamie Jones; Emily Dawn Cantwell; Meredith Crittenden – Remedial and Special Education, 2024
Given the need for a swift, systematic way to identify students with internalizing and externalizing behavior patterns to connect these students with appropriate supports, we present new findings of the Student Risk Screening Scale--Internalizing and Externalizing (SRSS-IE). In this article, we examined (a) factor structure of the SRSS-IE and (b)…
Descriptors: Screening Tests, At Risk Students, Psychometrics, Factor Structure
Chen, Chia-Wen; Andersson, Björn; Zhu, Jinxin – Journal of Educational Measurement, 2023
The certainty of response index (CRI) measures respondents' confidence level when answering an item. In conjunction with the answers to the items, previous studies have used descriptive statistics and arbitrary thresholds to identify student knowledge profiles with the CRIs. Whereas this approach overlooked the measurement error of the observed…
Descriptors: Item Response Theory, Factor Analysis, Psychometrics, Test Items
Visser, Linda; Cartschau, Friederike; von Goldammer, Ariane; Brandenburg, Janin; Timmerman, Marieke; Hasselhorn, Marcus; Mähler, Claudia – Applied Measurement in Education, 2023
The growing number of children in primary schools in Germany who have German as their second language (L2) has raised questions about the fairness of performance assessment. Fair tests are a prerequisite for distinguishing between L2 learning delay and a specific learning disability. We evaluated five commonly used reading and spelling tests for…
Descriptors: Foreign Countries, Error of Measurement, Second Language Learning, German
Wang, Ze – Large-scale Assessments in Education, 2022
In educational and psychological research, it is common to use latent factors to represent constructs and then to examine covariate effects on these latent factors. Using empirical data, this study applied three approaches to covariate effects on latent factors: the multiple-indicator multiple-cause (MIMIC) approach, multiple group confirmatory…
Descriptors: Comparative Analysis, Evaluation Methods, Grade 8, Mathematics Achievement
Qian, Jiahe – ETS Research Report Series, 2020
The finite population correction (FPC) factor is often used to adjust variance estimators for survey data sampled from a finite population without replacement. As a replicated resampling approach, the jackknife approach is usually implemented without the FPC factor incorporated in its variance estimates. A paradigm is proposed to compare the…
Descriptors: Computation, Sampling, Data, Statistical Analysis
Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020
In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…
Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement
Yi, Soohyun; Pereira, Nielsen; Ahn, Inok; Lee, Soonmook – Journal of Psychoeducational Assessment, 2022
For decades, achievement goal theory has been extensively used, but empirical research still requires a clearer understanding of the underlying factors conceptualized and measured during secondary school periods. In light of the increasing use of longitudinal studies in motivation research, this study aims to investigate the longitudinal…
Descriptors: Factor Structure, Secondary School Students, Longitudinal Studies, Goal Orientation