Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 20 |
Descriptor
Difficulty Level | 26 |
Mathematics Tests | 26 |
Reading Tests | 26 |
Test Items | 20 |
Achievement Tests | 10 |
Elementary Secondary Education | 9 |
Academic Achievement | 6 |
Item Response Theory | 6 |
Language Tests | 6 |
Science Tests | 6 |
Standardized Tests | 6 |
More ▼ |
Source
Author
He, Wei | 2 |
Meyers, Jason L. | 2 |
Rodriguez, Michael C. | 2 |
Steele, D. Joyce | 2 |
Turhan, Ahmet | 2 |
Barden, Tiffannie M. | 1 |
Beddow, Peter A. | 1 |
Bielinski, John | 1 |
Binici, Salih | 1 |
Bolt, Daniel M. | 1 |
Bridgeman, Brent | 1 |
More ▼ |
Publication Type
Reports - Research | 19 |
Journal Articles | 9 |
Speeches/Meeting Papers | 6 |
Numerical/Quantitative Data | 4 |
Reports - Evaluative | 4 |
Dissertations/Theses -… | 2 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 11 |
Secondary Education | 6 |
Elementary Education | 5 |
Intermediate Grades | 4 |
Middle Schools | 4 |
Grade 4 | 3 |
Grade 5 | 3 |
Grade 6 | 3 |
Grade 7 | 3 |
Grade 8 | 3 |
High Schools | 3 |
More ▼ |
Audience
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Location
Arizona | 2 |
Alabama | 1 |
Arkansas | 1 |
Colorado | 1 |
District of Columbia | 1 |
Hawaii | 1 |
Idaho | 1 |
Illinois | 1 |
Indiana | 1 |
Maryland | 1 |
Massachusetts | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021
Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…
Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests
He, Wei – NWEA, 2021
New MAP® Growth™ assessments are being developed that administer items more closely matched to the grade level of the student. However, MAP Growth items are calibrated with samples that typically consist of students from a variety of grades, including the target grade to which an item is aligned. While this choice of calibration sample is…
Descriptors: Achievement Tests, Test Items, Instructional Program Divisions, Difficulty Level
He, Wei – NWEA, 2022
To ensure that student academic growth in a subject area is accurately captured, it is imperative that the underlying scale remains stable over time. As item parameter stability constitutes one of the factors that affects scale stability, NWEA® periodically conducts studies to check for the stability of the item parameter estimates for MAP®…
Descriptors: Achievement Tests, Test Items, Test Reliability, Academic Achievement
von Davier, Matthias; Yamamoto, Kentaro; Shin, Hyo Jeong; Chen, Henry; Khorramdel, Lale; Weeks, Jon; Davis, Scott; Kong, Nan; Kandathil, Mat – Assessment in Education: Principles, Policy & Practice, 2019
Based on concerns about the item response theory (IRT) linking approach used in the Programme for International Student Assessment (PISA) until 2012 as well as the desire to include new, more complex, interactive items with the introduction of computer-based assessments, alternative IRT linking methods were implemented in the 2015 PISA round. The…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Steedle, Jeffrey T.; Morrison, Kristin M. – Educational Assessment, 2019
Assessment items are commonly field tested prior to operational use to observe statistical item properties such as difficulty. Item parameter estimates from field testing may be used to assign scores via pre-equating or computer adaptive designs. This study examined differences between item difficulty estimates based on field test and operational…
Descriptors: Field Tests, Test Items, Statistics, Difficulty Level
Kevelson, Marisol J. C. – ETS Research Report Series, 2019
This study presents estimates of Black-White, Hispanic-White, and income achievement gaps using data from two different types of reading and mathematics assessments: constructed-response assessments that were likely more cognitively demanding and state achievement tests that were likely less cognitively demanding (i.e., composed solely or largely…
Descriptors: Racial Differences, Achievement Gap, White Students, African American Students
Smarter Balanced Assessment Consortium, 2016
The goal of this study was to gather comprehensive evidence about the alignment of the Smarter Balanced summative assessments to the Common Core State Standards (CCSS). Alignment of the Smarter Balanced summative assessments to the CCSS is a critical piece of evidence regarding the validity of inferences students, teachers and policy makers can…
Descriptors: Alignment (Education), Summative Evaluation, Common Core State Standards, Test Content
Stricker, Lawrence J.; Rock, Donald A.; Bridgeman, Brent – ETS Research Report Series, 2015
This study explores stereotype threat on low-stakes tests used in a large-scale assessment, math and reading tests in the Education Longitudinal Study of 2002 (ELS). Issues identified in laboratory research (though not observed in studies of high-stakes tests) were assessed: whether inquiring about their race and gender is related to the…
Descriptors: Stereotypes, Reading Tests, Mathematics Tests, Longitudinal Studies
Steedle, Jeffrey; McBride, Malena; Johnson, Marc; Keng, Leslie – Partnership for Assessment of Readiness for College and Careers, 2016
The first operational administration of the Partnership for Assessment of Readiness for College and Careers (PARCC) took place during the 2014-2015 school year. In addition to the traditional paper-and-pencil format, the assessments were available for administration on a variety of electronic devices, including desktop computers, laptop computers,…
Descriptors: Computer Assisted Testing, Difficulty Level, Test Items, Scores
Doorey, Nancy – Smarter Balanced Assessment Consortium, 2014
Between March and June of 2014, the Smarter Balanced Assessment Consortium conducted a field test of its new online assessment system. Thirteen participating states provided the results of surveys given to students and adults involved in the Field Test. Overall, more than 70% of test coordinators in each of seven states indicated that the Field…
Descriptors: Field Tests, Computer Assisted Testing, Student Surveys, Surveys
Durant, Sarah; Dahlin, Michael – Northwest Evaluation Association, 2011
In 2007, the Northwest Evaluation Association (NWEA) and the Thomas B. Fordham Institute collaborated on "The Proficiency Illusion," a study that illustrated the issues created by having each state set its own standards for what constitutes student proficiency for reading and mathematics tests, while holding all states to the same…
Descriptors: Mathematics Tests, Cutting Scores, Achievement Tests, State Standards
Cawthon, Stephanie – American Annals of the Deaf, 2011
Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64…
Descriptors: Language Styles, Test Content, Syntax, Linguistics
Kettler, Ryan J.; Rodriguez, Michael C.; Bolt, Daniel M.; Elliott, Stephen N.; Beddow, Peter A.; Kurz, Alexander – Applied Measurement in Education, 2011
Federal policy on alternate assessment based on modified academic achievement standards (AA-MAS) inspired this research. Specifically, an experimental study was conducted to determine whether tests composed of modified items would have the same level of reliability as tests composed of original items, and whether these modified items helped reduce…
Descriptors: Multiple Choice Tests, Test Items, Alternative Assessment, Test Reliability
Yuan, Kun; Le, Vi-Nhuan – RAND Corporation, 2014
In 2010, the William and Flora Hewlett Foundation's Education Program has established the Deeper Learning Initiative, which focuses on students' development of deeper learning skills (i.e., the mastery of core academic content, critical-thinking, problem-solving, collaboration, communication, and "learn-how-to-learn" skills). Two test…
Descriptors: Test Items, Cognitive Processes, Difficulty Level, Skill Development
Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012
The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…
Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests
Previous Page | Next Page »
Pages: 1 | 2