ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	20

Descriptor

Difficulty Level	26
Mathematics Tests	26
Reading Tests	26
Test Items	20
Achievement Tests	10
Elementary Secondary Education	9
Academic Achievement	6
Item Response Theory	6
Language Tests	6
Science Tests	6
Standardized Tests	6
Test Format	6
Comparative Analysis	5
High School Students	5
Test Construction	5
Testing Programs	5
Computer Assisted Testing	4
High Schools	4
Minimum Competency Testing	4
Test Reliability	4
White Students	4
Comparative Testing	3
Computation	3
Grade 9	3
Instructional Program…	3
More ▼

Source

Applied Measurement in…	2
ETS Research Report Series	2
Educational Assessment	2
NWEA	2
Pearson	2
ProQuest LLC	2
Smarter Balanced Assessment…	2
American Annals of the Deaf	1
Assessment in Education:…	1
Journal of Applied Testing…	1
Northwest Evaluation…	1
Partnership for Assessment of…	1
RAND Corporation	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	9
Speeches/Meeting Papers	6
Numerical/Quantitative Data	4
Reports - Evaluative	4
Dissertations/Theses -…	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education	11
Secondary Education	6
Elementary Education	5
Intermediate Grades	4
Middle Schools	4
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
Grade 8	3
High Schools	3
Junior High Schools	3
Grade 3	2
Grade 9	2
Early Childhood Education	1
Primary Education	1
More ▼

Audience

Policymakers	1
Practitioners	1
Researchers	1

Location

Arizona	2
Alabama	1
Arkansas	1
Colorado	1
District of Columbia	1
Hawaii	1
Idaho	1
Illinois	1
Indiana	1
Maryland	1
Massachusetts	1
Mississippi	1
New Jersey	1
New Mexico	1
Ohio	1
Rhode Island	1
Texas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Elementary and Secondary…	1

Assessments and Surveys

Alabama High School…	2
Measures of Academic Progress	2
National Assessment of…	2
Program for International…	2
Preliminary Scholastic…	1
Progress in International…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Using Full-Information Item Analysis to Improve Item Quality

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021

Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…

Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests

An Investigation of Item Parameter Invariance Using Focused Calibration Samples for MAP Growth

Download full text

He, Wei – NWEA, 2021

New MAP® Growth™ assessments are being developed that administer items more closely matched to the grade level of the student. However, MAP Growth items are calibrated with samples that typically consist of students from a variety of grades, including the target grade to which an item is aligned. While this choice of calibration sample is…

Descriptors: Achievement Tests, Test Items, Instructional Program Divisions, Difficulty Level

MAP Growth Item Parameter Drift Study

Download full text

He, Wei – NWEA, 2022

To ensure that student academic growth in a subject area is accurately captured, it is imperative that the underlying scale remains stable over time. As item parameter stability constitutes one of the factors that affects scale stability, NWEA® periodically conducts studies to check for the stability of the item parameter estimates for MAP®…

Descriptors: Achievement Tests, Test Items, Test Reliability, Academic Achievement

Evaluating Item Response Theory Linking and Model Fit for Data from PISA 2000-2012

Peer reviewed

Direct link

von Davier, Matthias; Yamamoto, Kentaro; Shin, Hyo Jeong; Chen, Henry; Khorramdel, Lale; Weeks, Jon; Davis, Scott; Kong, Nan; Kandathil, Mat – Assessment in Education: Principles, Policy & Practice, 2019

Based on concerns about the item response theory (IRT) linking approach used in the Programme for International Student Assessment (PISA) until 2012 as well as the desire to include new, more complex, interactive items with the introduction of computer-based assessments, alternative IRT linking methods were implemented in the 2015 PISA round. The…

Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment

Embedded Field Test Item Statistics: Can They Be Trusted for Estimating Student Proficiency?

Peer reviewed

Direct link

Steedle, Jeffrey T.; Morrison, Kristin M. – Educational Assessment, 2019

Assessment items are commonly field tested prior to operational use to observe statistical item properties such as difficulty. Item parameter estimates from field testing may be used to assign scores via pre-equating or computer adaptive designs. This study examined differences between item difficulty estimates based on field test and operational…

Descriptors: Field Tests, Test Items, Statistics, Difficulty Level

The Measure Matters: Examining Achievement Gaps on Cognitively Demanding Reading and Mathematics Assessments. Policy Information Report and ETS Research Report Series No. RR-19-43

Peer reviewed
PDF on ERIC

Download full text

Kevelson, Marisol J. C. – ETS Research Report Series, 2019

This study presents estimates of Black-White, Hispanic-White, and income achievement gaps using data from two different types of reading and mathematics assessments: constructed-response assessments that were likely more cognitively demanding and state achievement tests that were likely less cognitively demanding (i.e., composed solely or largely…

Descriptors: Racial Differences, Achievement Gap, White Students, African American Students

Smarter Balanced Assessment Consortium: Alignment Study Report. Revised

Download full text

Smarter Balanced Assessment Consortium, 2016

The goal of this study was to gather comprehensive evidence about the alignment of the Smarter Balanced summative assessments to the Common Core State Standards (CCSS). Alignment of the Smarter Balanced summative assessments to the CCSS is a critical piece of evidence regarding the validity of inferences students, teachers and policy makers can…

Descriptors: Alignment (Education), Summative Evaluation, Common Core State Standards, Test Content

Stereotype Threat, Inquiring about Test Takers' Race and Gender, and Performance on Low-Stakes Tests in a Large-Scale Assessment. Research Report. ETS RR-15-02

Peer reviewed
PDF on ERIC

Download full text

Stricker, Lawrence J.; Rock, Donald A.; Bridgeman, Brent – ETS Research Report Series, 2015

This study explores stereotype threat on low-stakes tests used in a large-scale assessment, math and reading tests in the Education Longitudinal Study of 2002 (ELS). Issues identified in laboratory research (though not observed in studies of high-stakes tests) were assessed: whether inquiring about their race and gender is related to the…

Descriptors: Stereotypes, Reading Tests, Mathematics Tests, Longitudinal Studies

Spring 2015 Digital Devices Comparability Research Study

Download full text

Steedle, Jeffrey; McBride, Malena; Johnson, Marc; Keng, Leslie – Partnership for Assessment of Readiness for College and Careers, 2016

The first operational administration of the Partnership for Assessment of Readiness for College and Careers (PARCC) took place during the 2014-2015 school year. In addition to the traditional paper-and-pencil format, the assessments were available for administration on a variety of electronic devices, including desktop computers, laptop computers,…

Descriptors: Computer Assisted Testing, Difficulty Level, Test Items, Scores

Smarter Balanced "Tests of the Test" Successful: Field Test Provides Clear Path Forward

Download full text

Doorey, Nancy – Smarter Balanced Assessment Consortium, 2014

Between March and June of 2014, the Smarter Balanced Assessment Consortium conducted a field test of its new online assessment system. Thirteen participating states provided the results of surveys given to students and adults involved in the Field Test. Overall, more than 70% of test coordinators in each of seven states indicated that the Field…

Descriptors: Field Tests, Computer Assisted Testing, Student Surveys, Surveys

The State of Proficiency: How Student Proficiency Rates Vary across States, Subjects, and Grades between 2002 and 2010

Download full text

Durant, Sarah; Dahlin, Michael – Northwest Evaluation Association, 2011

In 2007, the Northwest Evaluation Association (NWEA) and the Thomas B. Fordham Institute collaborated on "The Proficiency Illusion," a study that illustrated the issues created by having each state set its own standards for what constitutes student proficiency for reading and mathematics tests, while holding all states to the same…

Descriptors: Mathematics Tests, Cutting Scores, Achievement Tests, State Standards

Test Item Linguistic Complexity and Assessments for Deaf Students

Peer reviewed

Direct link

Cawthon, Stephanie – American Annals of the Deaf, 2011

Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64…

Descriptors: Language Styles, Test Content, Syntax, Linguistics

Modified Multiple-Choice Items for Alternate Assessments: Reliability, Difficulty, and Differential Boost

Peer reviewed

Direct link

Kettler, Ryan J.; Rodriguez, Michael C.; Bolt, Daniel M.; Elliott, Stephen N.; Beddow, Peter A.; Kurz, Alexander – Applied Measurement in Education, 2011

Federal policy on alternate assessment based on modified academic achievement standards (AA-MAS) inspired this research. Specifically, an experimental study was conducted to determine whether tests composed of modified items would have the same level of reliability as tests composed of original items, and whether these modified items helped reduce…

Descriptors: Multiple Choice Tests, Test Items, Alternative Assessment, Test Reliability

Measuring Deeper Learning through Cognitively Demanding Test Items: Results from the Analysis of Six National and International Exams. Research Report

Direct link

Yuan, Kun; Le, Vi-Nhuan – RAND Corporation, 2014

In 2010, the William and Flora Hewlett Foundation's Education Program has established the Deeper Learning Initiative, which focuses on students' development of deeper learning skills (i.e., the mastery of core academic content, critical-thinking, problem-solving, collaboration, communication, and "learn-how-to-learn" skills). Two test…

Descriptors: Test Items, Cognitive Processes, Difficulty Level, Skill Development

Population Invariance of Vertical Scaling Results

Direct link

Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012

The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…

Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests

Previous Page | Next Page »

Pages: 1 | 2

He, Wei	2
Meyers, Jason L.	2
Rodriguez, Michael C.	2
Steele, D. Joyce	2
Turhan, Ahmet	2
Barden, Tiffannie M.	1
Beddow, Peter A.	1
Bielinski, John	1
Binici, Salih	1
Bolt, Daniel M.	1
Bridgeman, Brent	1
Cawthon, Stephanie	1
Chen, Henry	1
Dahlin, Michael	1
Davis, Scott	1
Doorey, Nancy	1
Durant, Sarah	1
Elliott, Stephen N.	1
Freidebach, Jim	1
Freidebach, Melodie	1
Goodman, Joshua	1
Haladyna, Thomas M.	1
Hu, P. Gillian	1
Johnson, Marc	1
More ▼