Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 421 |
Since 2006 (last 20 years) | 1077 |
Descriptor
Reading Tests | 1170 |
Statistical Analysis | 1170 |
Elementary School Students | 579 |
Scores | 572 |
Comparative Analysis | 508 |
Reading Achievement | 504 |
Reading Comprehension | 481 |
Gender Differences | 341 |
Foreign Countries | 309 |
Public Schools | 284 |
Reading | 275 |
More ▼ |
Source
Author
Alonzo, Julie | 14 |
Tindal, Gerald | 14 |
Petscher, Yaacov | 11 |
Bianchini, John C. | 10 |
Loret, Peter G. | 10 |
Vaughn, Sharon | 10 |
McNamara, Danielle S. | 9 |
Lai, Cheng-Fei | 8 |
Allen, Laura K. | 7 |
Anderson, Daniel | 6 |
Irvin, P. Shawn | 6 |
More ▼ |
Publication Type
Education Level
Location
Texas | 51 |
Iran | 39 |
Florida | 26 |
California | 23 |
North Carolina | 23 |
Canada | 20 |
Pennsylvania | 19 |
Turkey | 19 |
Georgia | 18 |
Michigan | 18 |
Germany | 16 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 18 |
Does not meet standards | 17 |
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Liu, Ivy; Suesse, Thomas; Harvey, Samuel; Gu, Peter Yongqi; Fernández, Daniel; Randal, John – Educational and Psychological Measurement, 2023
The Mantel-Haenszel estimator is one of the most popular techniques for measuring differential item functioning (DIF). A generalization of this estimator is applied to the context of DIF to compare items by taking the covariance of odds ratio estimators between dependent items into account. Unlike the Item Response Theory, the method does not rely…
Descriptors: Test Bias, Computation, Statistical Analysis, Achievement Tests
Erbeli, Florina; He, Kai; Cheek, Connor; Rice, Marianne; Qian, Xiaoning – Scientific Studies of Reading, 2023
Purpose: Researchers have developed a constellation model of decodingrelated reading disabilities (RD) to improve the RD risk determination. The model's hallmark is its inclusion of various RD indicators to determine RD risk. Classification methods such as logistic regression (LR) might be one way to determine RD risk within the constellation…
Descriptors: At Risk Students, Reading Difficulties, Classification, Comparative Analysis
Mehrazmay, Roghayeh; Ghonsooly, Behzad; de la Torre, Jimmy – Applied Measurement in Education, 2021
The present study aims to examine gender differential item functioning (DIF) in the reading comprehension section of a high stakes test using cognitive diagnosis models. Based on the multiple-group generalized deterministic, noisy "and" gate (MG G-DINA) model, the Wald test and likelihood ratio test are used to detect DIF. The flagged…
Descriptors: Test Bias, College Entrance Examinations, Gender Differences, Reading Tests
Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020
In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…
Descriptors: Test Format, Reading Tests, Language Tests, English
Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2022
This simulation study examines the characteristics of the Explanatory Item Response Model (EIRM) when estimating treatment effects when compared to classical test theory (CTT) sum and mean scores and item response theory (IRT)-based theta scores. Results show that the EIRM and IRT theta scores provide generally equivalent bias and false positive…
Descriptors: Item Response Theory, Models, Test Theory, Computation
Joshua B. Gilbert – Annenberg Institute for School Reform at Brown University, 2024
When analyzing treatment effects on test scores, researchers face many choices and competing guidance for scoring tests and modeling results. This study examines the impact of scoring choices through simulation and an empirical application. Results show that estimates from multiple methods applied to the same data will vary because two-step models…
Descriptors: Scores, Statistical Bias, Statistical Inference, Scoring
Khoshsima, Hooshang; Saed, Amin; Mousaei, Fatemeh – Advances in Language and Literary Studies, 2018
Language proficiency tests have become common instruments to judge people based on their performance. Thus, the scores on language proficiency tests, such as the International English Language Testing System (IELTS) or Teaching English as a Foreign Language (TOEFL), play a crucial role in the test-takers' lives. Because of increasing demands on…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Test Wiseness
Li, Feifei – ETS Research Report Series, 2017
An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…
Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement
McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018
This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…
Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements
Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018
In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…
Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Classification
Wei, Youhua; Low, Albert – ETS Research Report Series, 2017
In most large-scale programs of tests that aid in making high-stakes decisions, such as the "TOEIC"® family of products and service, it is not unusual for a significant portion of test takers to retake the test at multiple times.The study reported here used multilevel growth modeling to explore the score change patterns of nearly 20,000…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
De Clercq-Quaegebeur, Maryse; Casalis, Séverine; Vilette, Bruno; Lemaitre, Marie-Pierre; Vallée, Louis – Journal of Learning Disabilities, 2018
A high comorbidity between reading and arithmetic disabilities has already been reported. The present study aims at identifying more precisely patterns of arithmetic performance in children with developmental dyslexia, defined with severe and specific criteria. By means of a standardized test of achievement in mathematics ("Calculation and…
Descriptors: Arithmetic, Mathematics Skills, Elementary School Students, Dyslexia
Cavalli, Eddy; Colé, Pascale; Leloup, Gilles; Poracchia-George, Florence; Sprenger-Charolles, Liliane; El Ahmadi, Abdessadek – Journal of Learning Disabilities, 2018
Developmental dyslexia is a lifelong impairment affecting 5% to 10% of the population. In French-speaking countries, although a number of standardized tests for dyslexia in children are available, tools suitable to screen for dyslexia in adults are lacking. In this study, we administered the "Alouette" reading test to a normative sample…
Descriptors: Foreign Countries, Screening Tests, Disability Identification, Dyslexia
Hosp, John L.; Ford, Jeremy W.; Huddle, Sally M.; Hensley, Kiersten K. – Assessment for Effective Intervention, 2018
Replication is a foundation of the development of a knowledge base in an evidence-based field such as education. This study includes two direct replications of Hosp, Hensley, Huddle, and Ford which found evidence of criterion-related validity of curriculum-based measurement (CBM) for reading and mathematics with postsecondary students with…
Descriptors: Replication (Evaluation), Evaluation Research, Curriculum Based Assessment, Developmental Disabilities