Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 12 |
Descriptor
Elementary Secondary Education | 36 |
Mathematics Achievement | 36 |
Test Validity | 36 |
Test Reliability | 18 |
Mathematics Tests | 17 |
Achievement Tests | 16 |
Reading Achievement | 14 |
Foreign Countries | 9 |
Student Evaluation | 9 |
International Assessment | 8 |
Academic Achievement | 7 |
More ▼ |
Source
Author
Sugrue, Brenda | 2 |
Abedi, Jamal | 1 |
Adkins, Deborah | 1 |
Bachor, Dan G. | 1 |
Baker, Eva L. | 1 |
Bening, Mary Ellen | 1 |
Bliss, Leonard B. | 1 |
Burstein, Leigh | 1 |
Buzick, Heather | 1 |
Carnoy, Martin | 1 |
Casey, Beth M. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 10 |
Elementary Education | 4 |
Middle Schools | 4 |
Secondary Education | 4 |
Grade 8 | 3 |
Junior High Schools | 3 |
Grade 4 | 2 |
Intermediate Grades | 2 |
Grade 1 | 1 |
Audience
Researchers | 8 |
Policymakers | 1 |
Practitioners | 1 |
Location
United States | 4 |
Massachusetts | 2 |
Arizona | 1 |
California | 1 |
Canada | 1 |
China (Shanghai) | 1 |
Colorado | 1 |
Delaware | 1 |
Idaho | 1 |
Illinois | 1 |
Indiana | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Toker, Turker; Green, Kathy – International Journal of Assessment Tools in Education, 2021
This study provides a comparison of the results of latent class analysis (LCA) and mixture Rasch model (MRM) analysis using data from the Trends in International Mathematics and Science Study -- 2011 (TIMSS-2011) with a focus on the 8th-grade mathematics section. The research study focuses on the comparison of LCA and MRM to determine if results…
Descriptors: Multivariate Analysis, Structural Equation Models, Item Response Theory, Achievement Tests
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
Oon, Pey Tee; Subramaniam, R. – International Journal of Science Education, 2018
We report here on a comparative study of middle school students' attitudes towards science involving three countries: England, Singapore and the U.S.A. Complete attitudinal data sets from TIMSS (Trends in International Mathematics and Science Study) 2011 were used, thus giving a very large sample size (N = 20,246), compared to other studies in the…
Descriptors: Foreign Countries, Comparative Education, Middle School Students, Student Attitudes
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores
Buzick, Heather; Stone, Elizabeth – Educational Measurement: Issues and Practice, 2014
Read aloud is a testing accommodation that has been studied by many researchers, and its use on K-12 assessments continues to be debated because of its potential to change the measured construct or unfairly increase test scores. This study is a summary of quantitative research on the read aloud accommodation. Previous studies contributed…
Descriptors: Meta Analysis, Reading Aloud to Others, Educational Research, Statistical Analysis
Casey, Beth M.; Lombardi, Caitlin McPherran; Pollock, Amanda; Fineman, Bonnie; Pezaris, Elizabeth – Journal of Cognition and Development, 2017
This study investigated longitudinal pathways leading from early spatial skills in first-grade girls to their fifth-grade analytical math reasoning abilities (N = 138). First-grade assessments included spatial skills, verbal skills, addition/subtraction skills, and frequency of choice of a decomposition or retrieval strategy on the…
Descriptors: Females, Arithmetic, Mathematics Instruction, Predictor Variables
Carnoy, Martin – National Education Policy Center, 2015
Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…
Descriptors: Criticism, Reputation, Test Validity, Error of Measurement

Slate, John R.; Saarnio, David A. – B.C. Journal of Special Education, 1996
Reading and math achievement subtest scores on several standard achievement tests were compared for 233 students with mental retardation. Correlations were generally moderate among subtests purporting to measure similar constructs. Significant mean differences were present for five of seven reading test comparisons and for six of eight math…
Descriptors: Achievement Tests, Correlation, Elementary Secondary Education, Mathematics Achievement

Burstein, Leigh; Koretz, Daniel; Linn, Robert; Sugrue, Brenda; Novak, John; Baker, Eva L.; Harris, Elizabeth Lewis – Educational Assessment, 1996
Three studies evaluating the validity of the descriptors and exemplars of the National Assessment of Educational Progress (NAEP) as characterizations of the actual mathematics performance of students at achievement levels are reported. Serious inconsistencies were found between actual performance and descriptors and exemplars. Recommendations for…
Descriptors: Elementary Secondary Education, Mathematics Achievement, Mathematics Tests, National Surveys
Lazarus, Belinda D.; And Others – Diagnostique, 1990
The Peabody Individual Achievement Test-Revised offers a standardized assessment of an individual's level of academic performance in reading, mathematics, spelling, written expression, and encyclopedic knowledge. The test is designed as a screening device for students ages 5-18. This paper describes administration, summation of data,…
Descriptors: Academic Achievement, Elementary Secondary Education, Mathematics Achievement, Reading Achievement
Miller, Lamoine – Diagnostique, 1990
The Kaufman Test of Educational Achievement is a norm-referenced, individually administered measure of the school achievement of students in grades 1-12, focusing on the areas of reading decoding, reading comprehension, mathematics application, mathematics computation, and spelling. The test's administration, summation of data, standardization,…
Descriptors: Achievement Tests, Elementary Secondary Education, Mathematics Achievement, Norm Referenced Tests
Isaacson, Stephen L. – Diagnostique, 1990
The Multilevel Academic Survey Test is intended for students in grades 3-8 and older students who perform inadequately on K-8 reading and math content. It determines which students need special services and determines appropriate instruction according to specific curriculum objectives. This paper describes administration, data summation,…
Descriptors: Diagnostic Teaching, Elementary Secondary Education, Handicap Identification, Mathematics Achievement

Parmar, Rene S.; And Others – Learning Disability Quarterly, 1996
This study used the Assessment Standards of the National Council of Teachers of Mathematics to evaluate the appropriateness and adequacy of selected standardized tests of mathematics achievement as they pertain to students with disabilities. Problems with content validity included inadequate representation of content domains, inappropriate…
Descriptors: Academic Standards, Content Validity, Elementary Secondary Education, Mathematics Achievement