Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 15 |
Descriptor
Test Construction | 15 |
Test Items | 15 |
Computer Assisted Testing | 7 |
Models | 5 |
Difficulty Level | 4 |
Psychometrics | 4 |
Test Validity | 4 |
Adaptive Testing | 3 |
Decision Making | 3 |
Foreign Countries | 3 |
Item Response Theory | 3 |
More ▼ |
Source
Journal of Applied Testing… | 15 |
Author
Dickison, Philip | 2 |
Kim, Doyoung | 2 |
Muntean, William | 2 |
Ackermann, Richard | 1 |
Bergstrom, Betty | 1 |
Betts, Joe | 1 |
Brunnert, Kim | 1 |
Burke, Matthew | 1 |
Cole, Brian S. | 1 |
Davis-Becker, Susan L. | 1 |
Devore, Richard | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Descriptive | 5 |
Reports - Evaluative | 5 |
Reports - Research | 5 |
Tests/Questionnaires | 1 |
Education Level
Adult Education | 2 |
Elementary Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Kindergarten | 1 |
Postsecondary Education | 1 |
Primary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
What Works Clearinghouse Rating
Cole, Brian S.; Lima-Walton, Elia; Brunnert, Kim; Vesey, Winona Burt; Raha, Kaushik – Journal of Applied Testing Technology, 2020
Automatic item generation can rapidly generate large volumes of exam items, but this creates challenges for assembly of exams which aim to include syntactically diverse items. First, we demonstrate a diminishing marginal syntactic return for automatic item generation using a saturation detection approach. This analysis can help users of automatic…
Descriptors: Artificial Intelligence, Automation, Test Construction, Test Items
Kosh, Audra E. – Journal of Applied Testing Technology, 2021
In recent years, Automatic Item Generation (AIG) has increasingly shifted from theoretical research to operational implementation, a shift raising some unforeseen practical challenges. Specifically, generating high-quality answer choices presents several challenges such as ensuring that answer choices blend in nicely together for all possible item…
Descriptors: Test Items, Multiple Choice Tests, Decision Making, Test Construction
Furter, Robert T. – Journal of Applied Testing Technology, 2019
Standard setting is the process of identifying the point(s) on a scale that serve to differentiate between individuals of distinct proficiency levels. While standard setting is ultimately a policy decision, most of the process is carried out by subject matter experts who are tasked with reconciling item-level or examinee-level information (e.g.…
Descriptors: Standard Setting, Cutting Scores, Decision Making, Test Construction
Betts, Joe; Muntean, William; Kim, Doyoung; Jorion, Natalie; Dickison, Philip – Journal of Applied Testing Technology, 2019
Clinical judgment has become an increasingly important aspect of modern health service professionals. To ensure public safety, licensure exams must go beyond assessing only knowledge and skills when evaluating entry-level professions to evaluating clinical judgment. This importance necessitates licensure and certification examinations in these…
Descriptors: Decision Making, Licensing Examinations (Professions), Certification, Nursing Education
Dickison, Philip; Luo, Xiao; Kim, Doyoung; Woo, Ada; Muntean, William; Bergstrom, Betty – Journal of Applied Testing Technology, 2016
Designing a theory-based assessment with sound psychometric qualities to measure a higher-order cognitive construct is a highly desired yet challenging task for many practitioners. This paper proposes a framework for designing a theory-based assessment to measure a higher-order cognitive construct. This framework results in a modularized yet…
Descriptors: Thinking Skills, Cognitive Tests, Test Construction, Nursing
Gierl, Mark J.; Lai, Hollis; Hogan, James B.; Matovinovic, Donna – Journal of Applied Testing Technology, 2015
The demand for test items far outstrips the current supply. This increased demand can be attributed, in part, to the transition to computerized testing, but, it is also linked to dramatic changes in how 21st century educational assessments are designed and administered. One way to address this growing demand is with automatic item generation.…
Descriptors: Common Core State Standards, Test Items, Alignment (Education), Test Construction
Luecht, Richard M. – Journal of Applied Testing Technology, 2013
Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…
Descriptors: Engineering, Test Construction, Test Items, Models
Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013
Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…
Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories
Burke, Matthew; Devore, Richard; Stopek, Josh – Journal of Applied Testing Technology, 2013
This paper describes efforts to bring principled assessment design to a large-scale, high-stakes licensure examination by employing the frameworks of Assessment Engineering (AE), the Revised Bloom's Taxonomy (RBT), and Cognitive Task Analysis (CTA). The Uniform CPA Examination is practice-oriented and focuses on the skills of accounting. In…
Descriptors: Licensing Examinations (Professions), Accounting, Engineering, Test Construction
Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S. – Journal of Applied Testing Technology, 2014
This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…
Descriptors: Standard Setting, Difficulty Level, Test Items, Rating Scales
Wandall, Jakob – Journal of Applied Testing Technology, 2011
Testing and test results can be used in different ways. They can be used for regulation and control, but they can also be a pedagogic tool for assessment of student proficiency in order to target teaching, improve learning and facilitate local pedagogical leadership. To serve these purposes the test has to be used for low stakes purposes, and to…
Descriptors: Test Results, Standardized Tests, Information Technology, Foreign Countries
Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011
A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…
Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries
Makransky, Guido; Glas, Cees A. W. – Journal of Applied Testing Technology, 2010
An accurately calibrated item bank is essential for a valid computerized adaptive test. However, in some settings, such as occupational testing, there is limited access to test takers for calibration. As a result of the limited access to possible test takers, collecting data to accurately calibrate an item bank in an occupational setting is…
Descriptors: Foreign Countries, Simulation, Adaptive Testing, Computer Assisted Testing
Kingsbury, G. Gage; Wise, Steven L. – Journal of Applied Testing Technology, 2011
Development of adaptive tests used in K-12 settings requires the creation of stable measurement scales to measure the growth of individual students from one grade to the next, and to measure change in groups from one year to the next. Accountability systems like No Child Left Behind require stable measurement scales so that accountability has…
Descriptors: Elementary Secondary Education, Adaptive Testing, Academic Achievement, Measures (Individuals)
Parshall, Cynthia G.; Harmes, J. Christine – Journal of Applied Testing Technology, 2009
Many exam programs have begun to include innovative item types in their operational assessments. While innovative item types appear to have great promise for expanding measurement, there can also be genuine challenges to their successful implementation. In this paper we present a set of four activities that can be beneficially incorporated into…
Descriptors: Test Items, Test Construction, Measurement, Educational Assessment