Publication Date
In 2025 | 0 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 14 |
Since 2016 (last 10 years) | 27 |
Since 2006 (last 20 years) | 121 |
Descriptor
Source
Author
Loeb, Susanna | 4 |
Zwick, Rebecca | 3 |
Chen, Peijie | 2 |
Haag, Nicole | 2 |
Isenberg, Eric | 2 |
McCaffrey, Daniel F. | 2 |
Raudenbush, Stephen W. | 2 |
Sachse, Karoline A. | 2 |
Traynor, Anne | 2 |
Wang, Chao | 2 |
Zapata-Rivera, Diego | 2 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 126 |
Elementary Education | 15 |
Higher Education | 12 |
Secondary Education | 12 |
Middle Schools | 7 |
Grade 4 | 6 |
Junior High Schools | 6 |
Postsecondary Education | 6 |
Grade 3 | 5 |
Grade 5 | 5 |
Grade 8 | 5 |
More ▼ |
Audience
Policymakers | 3 |
Researchers | 2 |
Community | 1 |
Parents | 1 |
Practitioners | 1 |
Location
California | 8 |
United States | 8 |
Florida | 5 |
North Carolina | 5 |
Texas | 5 |
United Kingdom (England) | 5 |
Australia | 4 |
New York | 4 |
Illinois | 3 |
Israel | 3 |
New Jersey | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 6 |
Race to the Top | 2 |
Elementary and Secondary… | 1 |
Every Student Succeeds Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Regional Educational Laboratory Mid-Atlantic, 2024
These are the appendixes for the report, "Stabilizing School Performance Indicators in New Jersey to Reduce the Effect of Random Error." This study applied a stabilization model called Bayesian hierarchical modeling to group-level data (with groups assigned according to demographic designations) within schools in New Jersey with the aim…
Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability
Emily A. Brown – ProQuest LLC, 2024
Previous research has been limited regarding the measurement of computational thinking, particularly as a learning progression in K-12. This study proposes to apply a multidimensional item response theory (IRT) model to a newly developed measure of computational thinking utilizing both selected response and open-ended polytomous items to establish…
Descriptors: Models, Computation, Thinking Skills, Item Response Theory
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023
Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…
Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards
Jose Antonio Mola Avila – ProQuest LLC, 2023
Accountability in education was implemented to improve poor learning outcomes by documenting and monitoring learning achievement results. In this process, external standardized achievement tests have played a central role, being the mechanism most frequently used to measure learning outcomes. However, several decades after its initial…
Descriptors: Foreign Countries, Standardized Tests, Achievement Tests, Accountability
Mahmut Sami Yigiter – Journal of Theoretical Educational Science, 2024
One of the main objectives of international large-scale assessments is to make comparisons between different countries, education policies, education systems, or subgroups. One of the main criteria for making comparisons between different groups is to ensure measurement invariance. The purpose of this study was to test the measurement invariance…
Descriptors: Mathematics, Mathematics Skills, Grade 4, Grade 8
Nikola Ebenbeck; Morten Bastian; Andreas Mühling; Markus Gebhardt – Journal of Computer Assisted Learning, 2024
Background: Computerised adaptive tests (CATs) are tests that provide personalised, efficient and accurate measurement while reducing testing time, depending on the desired level of precision. Schools have different types of assessments that can benefit from a significant reduction in testing time to varying degrees, depending on the area of…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Public Schools, Special Schools
Traynor, Anne; Li, Tingxuan; Zhou, Shuqi – Applied Measurement in Education, 2020
During the development of large-scale school achievement tests, panels of independent subject-matter experts use systematic judgmental methods to rate the correspondence between a given test's items and performance objective statements. The individual experts' ratings may then be used to compute summary indices to quantify the match between a…
Descriptors: Alignment (Education), Achievement Tests, Curriculum, Error of Measurement
Grinshtain, Yael; Zibenberg, Alexander; Addi-Raccah, Audrey – International Journal of Research in Education and Science, 2023
The present study aimed to demonstrate how the mixed methods approach was used to develop and validate a quantitative instrument for measuring forms of capital (a "Capital Scale") among K-12 teachers, using the two-phase approach of an exploratory sequential model. The study includes: (1) a qualitative phase based on 16 semi-structured…
Descriptors: Social Capital, Cultural Capital, Arabs, Jews
Morgan Rosendahl; Brian Gill; Jennifer E. Starling – Regional Educational Laboratory Mid-Atlantic, 2024
The Every Student Succeeds Act of 2015 requires states to use a variety of indicators, including standardized tests and attendance records, to designate schools for support and improvement based on schoolwide performance and the performance of groups of students within schools. Schoolwide and group-level performance indicators are also…
Descriptors: Institutional Evaluation, Elementary Secondary Education, Bayesian Statistics, Test Reliability
Kritika Thapa – ProQuest LLC, 2023
Measurement invariance is crucial for making valid comparisons across different groups (Kline, 2016; Vandenberg, 2002). To address the challenges associated with invariance testing such as large sample size requirements, the complexity of the model, etc., applied researchers have incorporated parcels. Parcels have been shown to alleviate skewness,…
Descriptors: Elementary Secondary Education, Achievement Tests, Foreign Countries, International Assessment
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Sachse, Karoline A.; Haag, Nicole – Applied Measurement in Education, 2017
Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…
Descriptors: Error of Measurement, Test Bias, International Assessment, Computation
Yanan Feng – ProQuest LLC, 2021
This dissertation aims to investigate the effect size measures of differential item functioning (DIF) detection in the context of cognitive diagnostic models (CDMs). A variety of DIF detection techniques have been developed in the context of CDMs. However, most of the DIF detection procedures focus on the null hypothesis significance test. Few…
Descriptors: Effect Size, Item Response Theory, Cognitive Measurement, Models
Shergill, Gagan; Camozzi, Hailey; O'Malley, Meagan D.; Ortiz, Arlene – Journal of Psychoeducational Assessment, 2023
The Comprehensive Test of Phonological Processing, 2nd Edition (CTOPP-2; Wagner et al., 2013) is commonly used in k-12 public schools to assess basic cognitive processing skills foundational for reading achievement. Psychometric support for its use with dual language learners (DLLs), a group representing over 10% of the school-aged population in…
Descriptors: Phonology, Language Processing, Bilingualism, English (Second Language)
Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021
Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…
Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference