Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
College Board | 5 |
Journal of Educational… | 2 |
Applied Measurement in… | 1 |
Contemporary Educational… | 1 |
Journal of Applied Testing… | 1 |
Measurement:… | 1 |
New England Journal of History | 1 |
ProQuest LLC | 1 |
Author
Bennett, Randy Elliot | 3 |
Huff, Kristen | 3 |
Braun, Henry I. | 2 |
Ewing, Maureen | 2 |
Kaliski, Pamela | 2 |
Livingston, Samuel A. | 2 |
Clark, Allison | 1 |
France, Megan | 1 |
Fremer, John | 1 |
Grigorenko, Elena L. | 1 |
Hagge, Sarah Lynn | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 3 |
High Schools | 3 |
Secondary Education | 3 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Administrators | 1 |
Practitioners | 1 |
Researchers | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 24 |
SAT (College Admission Test) | 2 |
College Board Achievement… | 1 |
National Assessment of… | 1 |
Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013
Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…
Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories
Kaliski, Pamela; France, Megan; Huff, Kristen; Thurber, Allison – College Board, 2011
Developing a cognitive model of task performance is an important and often overlooked phase in assessment design; failing to establish such a model can threaten the validity of the inferences made from the scores produced by an assessment (e.g., Leighton, 2004). Conducting think aloud interviews (TAIs), where students think aloud while completing…
Descriptors: World History, Advanced Placement Programs, Achievement Tests, Protocol Analysis
Huff, Kristen; Steinberg, Linda; Matts, Tom – College Board, 2009
Presented at the Annual Meeting of National Council on Measurement in Education (NCME) in San Diego, CA in April 2009. This presentation provides an overview of ECD. In addition, the presentation describes the benefits of, as well as the challenges that were faced, in implementing ECD in the Advanced Placement Program.
Descriptors: Measurement, Evidence Based Practice, Test Construction, Advanced Placement Programs
Luecht, Richard M.; Sireci, Stephen G. – College Board, 2011
Over the past four decades, there has been incremental growth in computer-based testing (CBT) as a viable alternative to paper-and-pencil testing. However, the transition to CBT is neither easy nor inexpensive. As Drasgow, Luecht, and Bennett (2006) noted, many design engineering, test development, operations/logistics, and psychometric changes…
Descriptors: College Entrance Examinations, Computer Assisted Testing, Educational Technology, Evaluation Methods
Ewing, Maureen; Packman, Sheryl; Hamen, Cynthia; Clark, Allison – College Board, 2009
Presented at the Annual Meeting of National Council on Measurement in Education (NCME) in San Diego, CA in April 2009. This presentation describes the methodology that was used with subject-matter experts (SMEs) to articulate the content and skills important in the domain, and then the iterative processes that were used to articulate the claims…
Descriptors: Evidence Based Practice, Advanced Placement Programs, Achievement Tests, Test Construction
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
Hagge, Sarah Lynn – ProQuest LLC, 2010
Mixed-format tests containing both multiple-choice and constructed-response items are widely used on educational tests. Such tests combine the broad content coverage and efficient scoring of multiple-choice items with the assessment of higher-order thinking skills thought to be provided by constructed-response items. However, the combination of…
Descriptors: Test Format, True Scores, Equated Scores, Psychometrics
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
Stemler, Steven E.; Sternberg, Robert J.; Grigorenko, Elena L.; Jarvin, Linda; Sharpes, Kirsten – Contemporary Educational Psychology, 2009
A new test of Advanced Placement Physics, explicitly designed to balance both content and cognitive-processing skills, was developed using Sternberg's theory of successful intelligence. The test was administered to 281 AP Physics students from 10 schools during the 2006-2007 school year. Six empirically distinguishable profiles of strengths and…
Descriptors: Science Tests, Intelligence, Advanced Placement, Ethnic Groups
Livingston, Samuel A. – 1988
When test-takers are offered a choice of essay questions, some questions may be harder than others. If the test includes a common portion taken by all test-takers, an adjustment to the scores is possible. Previously proposed adjustment procedures disregard the test-makers' efforts to create questions of equal difficulty; these procedures tend to…
Descriptors: Advanced Placement, Correlation, Difficulty Level, Essays
Rudas, Tamas; Zwick, Rebecca – 1995
A method is proposed to assess the importance of differential item functioning (DIF) by estimating the largest possible fraction of the population in which DIF does not occur, or equivalently, the smallest possible portion of the population in which DIF may occur. The approach is based on latent class (C. C. Clogg, 1981) or mixture concepts, and…
Descriptors: Estimation (Mathematics), Goodness of Fit, Item Bias, Maximum Likelihood Statistics
Wainer, Howard; And Others – 1993
The relationship between the multiple-choice and free-response sections of the Computer Science and Chemistry tests of the College Board's Advanced Placement program was studied. Confirmatory factor analysis showed that the free-response sections measure the same underlying proficiency as the multiple-choice sections for the most part. However,…
Descriptors: Advanced Placement, Chemistry, Computer Science, High School Students

Thissen, David; And Others – Journal of Educational Measurement, 1994
Restricted factor analysis shows that the multiple-choice and free-response sections of the Computer Science and Chemistry Advanced Placement examinations (College Board) measure the same proficiencies for the most part. There is a small degree of multidimensionality because of local dependence among free-response items. (SLD)
Descriptors: Advanced Placement, Chemistry, Computer Science, Factor Analysis
Braun, Henry I.; And Others – 1989
The use of constructed response items in large scale standardized testing has been hampered by the costs and difficulties associated with obtaining reliable scores. The advent of expert systems may signal the eventual removal of this impediment. This study investigated the accuracy with which expert systems could score a new, non-multiple choice…
Descriptors: Computer Science, Constructed Response, Expert Systems, High School Seniors
Livingston, Samuel A. – 1992
This study investigated the extent to which log-linear smoothing could improve the accuracy of common-item equating by the chained equipercentile method in small samples of examinees. Examinee response data from a 100-item test (the Advanced Placement Examination in United States History) were used to create two overlapping forms of 58 items each,…
Descriptors: Advanced Placement Programs, College Entrance Examinations, Equated Scores, High School Students
Previous Page | Next Page »
Pages: 1 | 2