Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Advanced Placement Programs | 7 |
| Test Construction | 7 |
| Test Items | 7 |
| Test Validity | 5 |
| Models | 3 |
| Multiple Choice Tests | 3 |
| Psychometrics | 3 |
| Scores | 3 |
| Test Reliability | 3 |
| Difficulty Level | 2 |
| Educational Assessment | 2 |
| More ▼ | |
Author
| Hendrickson, Amy | 4 |
| Huff, Kristen | 3 |
| Ewing, Maureen | 2 |
| Kaliski, Pamela | 2 |
| Patterson, Brian | 2 |
| France, Megan | 1 |
| Luecht, Richard | 1 |
| Melican, Gerald | 1 |
| Melican, Gerald J. | 1 |
| Reshetar, Rosemary | 1 |
| Thurber, Allison | 1 |
| More ▼ | |
Publication Type
| Reports - Evaluative | 3 |
| Journal Articles | 2 |
| Non-Print Media | 2 |
| Reference Materials - General | 2 |
| Reports - Research | 2 |
| Speeches/Meeting Papers | 2 |
| Collected Works - General | 1 |
| Guides - Non-Classroom | 1 |
Education Level
| High Schools | 3 |
| Secondary Education | 3 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
| Practitioners | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
| Advanced Placement… | 3 |
| National Assessment of… | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013
Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…
Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories
Kaliski, Pamela; France, Megan; Huff, Kristen; Thurber, Allison – College Board, 2011
Developing a cognitive model of task performance is an important and often overlooked phase in assessment design; failing to establish such a model can threaten the validity of the inferences made from the scores produced by an assessment (e.g., Leighton, 2004). Conducting think aloud interviews (TAIs), where students think aloud while completing…
Descriptors: World History, Advanced Placement Programs, Achievement Tests, Protocol Analysis
Hendrickson, Amy; Huff, Kristen; Luecht, Richard – Applied Measurement in Education, 2010
Evidence-centered assessment design (ECD) explicates a transparent evidentiary argument to warrant the inferences we make from student test performance. This article describes how the vehicles for gathering student evidence--task models and test specifications--are developed. Task models, which are the basis for item development, flow directly…
Descriptors: Evidence, Test Construction, Measurement, Classification
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity
Hendrickson, Amy; Patterson, Brian; Melican, Gerald – College Board, 2008
Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.
Descriptors: Multiple Choice Tests, Test Format, Test Validity, Test Reliability
Educational Testing Service, Princeton, NJ. Policy Information Center. – 1993
Performance assessment, constructed-response, and authentic assessment are topics of current interest in educational testing and reform. This workbook presents the following articles on educational assessment to highlight work being done in this area: (1) "Aquarium Problem and Teacher Guidelines: New Standards Project" (University of…
Descriptors: Advanced Placement Programs, Cognitive Tests, Constructed Response, Educational Assessment

Peer reviewed
Direct link
