ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	15

Descriptor

Test Construction	15
Test Items	15
Computer Assisted Testing	7
Models	5
Difficulty Level	4
Psychometrics	4
Test Validity	4
Adaptive Testing	3
Decision Making	3
Foreign Countries	3
Item Response Theory	3
Licensing Examinations…	3
Mathematics Tests	3
Alignment (Education)	2
Content Validity	2
Educational Assessment	2
Elementary Secondary Education	2
Engineering	2
Evaluation Methods	2
Evidence	2
Guidelines	2
Item Analysis	2
Item Banks	2
Learning Theories	2
Measurement Objectives	2
More ▼

Source

Journal of Applied Testing…

Publication Type

Journal Articles	15
Reports - Descriptive	5
Reports - Evaluative	5
Reports - Research	5
Tests/Questionnaires	1

Education Level

Adult Education	2
Elementary Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Kindergarten	1
Postsecondary Education	1
Primary Education	1

Audience

Location

Denmark	2
Singapore	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Taming the Firehose: Unsupervised Machine Learning for Syntactic Partitioning of Large Volumes of Automatically Generated Items to Assist Automated Test Assembly

Peer reviewed

Direct link

Cole, Brian S.; Lima-Walton, Elia; Brunnert, Kim; Vesey, Winona Burt; Raha, Kaushik – Journal of Applied Testing Technology, 2020

Automatic item generation can rapidly generate large volumes of exam items, but this creates challenges for assembly of exams which aim to include syntactically diverse items. First, we demonstrate a diminishing marginal syntactic return for automatic item generation using a saturation detection approach. This analysis can help users of automatic…

Descriptors: Artificial Intelligence, Automation, Test Construction, Test Items

Distractor Suites: A Method for Developing Answer Choices in Automatically Generated Multiple-Choice Items

Peer reviewed

Direct link

Kosh, Audra E. – Journal of Applied Testing Technology, 2021

In recent years, Automatic Item Generation (AIG) has increasingly shifted from theoretical research to operational implementation, a shift raising some unforeseen practical challenges. Specifically, generating high-quality answer choices presents several challenges such as ensuring that answer choices blend in nicely together for all possible item…

Descriptors: Test Items, Multiple Choice Tests, Decision Making, Test Construction

Bridging the Standard Setting Gap via Assessment Engineering

Peer reviewed

Direct link

Furter, Robert T. – Journal of Applied Testing Technology, 2019

Standard setting is the process of identifying the point(s) on a scale that serve to differentiate between individuals of distinct proficiency levels. While standard setting is ultimately a policy decision, most of the process is carried out by subject matter experts who are tasked with reconciling item-level or examinee-level information (e.g.…

Descriptors: Standard Setting, Cutting Scores, Decision Making, Test Construction

Building a Method for Writing Clinical Judgment Items for Entry-Level Nursing Exams

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Jorion, Natalie; Dickison, Philip – Journal of Applied Testing Technology, 2019

Clinical judgment has become an increasingly important aspect of modern health service professionals. To ensure public safety, licensure exams must go beyond assessing only knowledge and skills when evaluating entry-level professions to evaluating clinical judgment. This importance necessitates licensure and certification examinations in these…

Descriptors: Decision Making, Licensing Examinations (Professions), Certification, Nursing Education

Assessing Higher-Order Cognitive Constructs by Using an Information-Processing Framework

Peer reviewed

Direct link

Dickison, Philip; Luo, Xiao; Kim, Doyoung; Woo, Ada; Muntean, William; Bergstrom, Betty – Journal of Applied Testing Technology, 2016

Designing a theory-based assessment with sound psychometric qualities to measure a higher-order cognitive construct is a highly desired yet challenging task for many practitioners. This paper proposes a framework for designing a theory-based assessment to measure a higher-order cognitive construct. This framework results in a modularized yet…

Descriptors: Thinking Skills, Cognitive Tests, Test Construction, Nursing

A Method for Generating Educational Test Items That Are Aligned to the Common Core State Standards

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Hogan, James B.; Matovinovic, Donna – Journal of Applied Testing Technology, 2015

The demand for test items far outstrips the current supply. This increased demand can be attributed, in part, to the transition to computerized testing, but, it is also linked to dramatic changes in how 21st century educational assessments are designed and administered. One way to address this growing demand is with automatic item generation.…

Descriptors: Common Core State Standards, Test Items, Alignment (Education), Test Construction

Assessment Engineering Task Model Maps, Task Models and Templates as a New Way to Develop and Implement Test Specifications

Peer reviewed

Direct link

Luecht, Richard M. – Journal of Applied Testing Technology, 2013

Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…

Descriptors: Engineering, Test Construction, Test Items, Models

Evidence-Centered Design: Recommendations for Implementation and Practice

Peer reviewed

Direct link

Hendrickson, Amy; Ewing, Maureen; Kaliski, Pamela; Huff, Kristen – Journal of Applied Testing Technology, 2013

Evidence-centered design (ECD) is an orientation towards assessment development. It differs from conventional practice in several ways and consists of multiple activities. Each of these activities results in a set of useful documentation: domain analysis, domain modeling, construction of the assessment framework, and assessment…

Descriptors: Evidence, Test Construction, Educational Assessment, Learning Theories

Implementing Assessment Engineering in the Uniform Certified Public Accountant (CPA) Examination

Peer reviewed

Direct link

Burke, Matthew; Devore, Richard; Stopek, Josh – Journal of Applied Testing Technology, 2013

This paper describes efforts to bring principled assessment design to a large-scale, high-stakes licensure examination by employing the frameworks of Assessment Engineering (AE), the Revised Bloom's Taxonomy (RBT), and Cognitive Task Analysis (CTA). The Uniform CPA Examination is practice-oriented and focuses on the skills of accounting. In…

Descriptors: Licensing Examinations (Professions), Accounting, Engineering, Test Construction

Combining the Best of Two Standard Setting Methods: The Ordered Item Booklet Angoff

Peer reviewed

Direct link

Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S. – Journal of Applied Testing Technology, 2014

This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…

Descriptors: Standard Setting, Difficulty Level, Test Items, Rating Scales

National Tests in Denmark--CAT as a Pedagogic Tool

Peer reviewed

Direct link

Wandall, Jakob – Journal of Applied Testing Technology, 2011

Testing and test results can be used in different ways. They can be used for regulation and control, but they can also be a pedagogic tool for assessment of student proficiency in order to target teaching, improve learning and facilitate local pedagogical leadership. To serve these purposes the test has to be used for low stakes purposes, and to…

Descriptors: Test Results, Standardized Tests, Information Technology, Foreign Countries

Design of a Computer-Adaptive Test to Measure English Literacy and Numeracy in the Singapore Workforce: Considerations, Benefits, and Implications

Peer reviewed

Direct link

Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011

A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…

Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries

An Automatic Online Calibration Design in Adaptive Testing

Peer reviewed

Direct link

Makransky, Guido; Glas, Cees A. W. – Journal of Applied Testing Technology, 2010

An accurately calibrated item bank is essential for a valid computerized adaptive test. However, in some settings, such as occupational testing, there is limited access to test takers for calibration. As a result of the limited access to possible test takers, collecting data to accurately calibrate an item bank in an occupational setting is…

Descriptors: Foreign Countries, Simulation, Adaptive Testing, Computer Assisted Testing

Creating a K-12 Adaptive Test: Examining the Stability of Item Parameter Estimates and Measurement Scales

Peer reviewed

Direct link

Kingsbury, G. Gage; Wise, Steven L. – Journal of Applied Testing Technology, 2011

Development of adaptive tests used in K-12 settings requires the creation of stable measurement scales to measure the growth of individual students from one grade to the next, and to measure change in groups from one year to the next. Accountability systems like No Child Left Behind require stable measurement scales so that accountability has…

Descriptors: Elementary Secondary Education, Adaptive Testing, Academic Achievement, Measures (Individuals)

Improving the Quality of Innovative Item Types: Four Tasks for Design and Development

Peer reviewed

Direct link

Parshall, Cynthia G.; Harmes, J. Christine – Journal of Applied Testing Technology, 2009

Many exam programs have begun to include innovative item types in their operational assessments. While innovative item types appear to have great promise for expanding measurement, there can also be genuine challenges to their successful implementation. In this paper we present a set of four activities that can be beneficially incorporated into…

Descriptors: Test Items, Test Construction, Measurement, Educational Assessment

Dickison, Philip	2
Kim, Doyoung	2
Muntean, William	2
Ackermann, Richard	1
Bergstrom, Betty	1
Betts, Joe	1
Brunnert, Kim	1
Burke, Matthew	1
Cole, Brian S.	1
Davis-Becker, Susan L.	1
Devore, Richard	1
Eguez, Jane	1
Ewing, Maureen	1
Furter, Robert T.	1
Ganguli, Debalina	1
Gierl, Mark J.	1
Glas, Cees A. W.	1
Harmes, J. Christine	1
Hendrickson, Amy	1
Hogan, James B.	1
Huff, Kristen	1
Jacobsen, Jared	1
Jorion, Natalie	1
Kaliski, Pamela	1
Kingsbury, G. Gage	1
More ▼