ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	17

Source

Journal of Educational…

Publication Type

Journal Articles	24
Reports - Descriptive	24
Speeches/Meeting Papers	3
Opinion Papers	1

Education Level

High Schools

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Improving Item-Exposure Control in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Choi, Seung W. – Journal of Educational Measurement, 2020

One of the methods of controlling test security in adaptive testing is imposing random item-ineligibility constraints on the selection of the items with probabilities automatically updated to maintain a predetermined upper bound on the exposure rates. Three major improvements of the method are presented. First, a few modifications to improve the…

Descriptors: Adaptive Testing, Item Response Theory, Feedback (Response), Item Analysis

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

A Comparison of Constraint Programming and Mixed-Integer Programming for Automated Test-Form Generation

Peer reviewed

Direct link

Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018

The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…

Descriptors: Programming, Automation, Test Items, Test Format

Classroom Assessment and Large-Scale Psychometrics: Shall the Twain Meet? (A Conversation with Margaret Heritage and Neal Kingston)

Peer reviewed

Direct link

Heritage, Margaret; Kingston, Neal M. – Journal of Educational Measurement, 2019

Classroom assessment and large-scale assessment have, for the most part, existed in mutual isolation. Some experts have felt this is for the best and others have been concerned that the schism limits the potential contribution of both forms of assessment. Margaret Heritage has long been a champion of best practices in classroom assessment. Neal…

Descriptors: Measurement, Psychometrics, Context Effect, Classroom Environment

Section Preequating under the Equivalent Groups Design without IRT

Peer reviewed

Direct link

Guo, Hongwen; Puhan, Gautam – Journal of Educational Measurement, 2014

In this article, we introduce a section preequating (SPE) method (linear and nonlinear) under the randomly equivalent groups design. In this equating design, sections of Test X (a future new form) and another existing Test Y (an old form already on scale) are administered. The sections of Test X are equated to Test Y, after adjusting for the…

Descriptors: Equated Scores, Correlation, Simulation, Testing

Equating of Augmented Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Journal of Educational Measurement, 2011

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008b) suggested reporting an augmented subscore that is a linear combination of a subscore and the total score. Sinharay and Haberman (2008) and Sinharay (2010) showed that augmented subscores often lead to more accurate…

Descriptors: Diagnostic Tests, Psychometrics, Testing, Equated Scores

The Errors of Our Ways

Peer reviewed

Direct link

Kane, Michael – Journal of Educational Measurement, 2011

Errors don't exist in our data, but they serve a vital function. Reality is complicated, but our models need to be simple in order to be manageable. We assume that attributes are invariant over some conditions of observation, and once we do that we need some way of accounting for the variability in observed scores over these conditions of…

Descriptors: Error of Measurement, Scores, Test Interpretation, Testing

Automated Test-Form Generation

Peer reviewed

Direct link

van der Linden, Wim J.; Diao, Qi – Journal of Educational Measurement, 2011

In automated test assembly (ATA), the methodology of mixed-integer programming is used to select test items from an item bank to meet the specifications for a desired test form and optimize its measurement accuracy. The same methodology can be used to automate the formatting of the set of selected items into the actual test form. Three different…

Descriptors: Test Items, Test Format, Test Construction, Item Banks

Linking Response-Time Parameters onto a Common Scale

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2010

Although response times on test items are recorded on a natural scale, the scale for some of the parameters in the lognormal response-time model (van der Linden, 2006) is not fixed. As a result, when the model is used to periodically calibrate new items in a testing program, the parameter are not automatically mapped onto a common scale. Several…

Descriptors: Test Items, Testing Programs, Measures (Individuals), Item Response Theory

The Circle-Arc Method for Equating in Small Samples

Peer reviewed

Direct link

Livingston, Samuel A.; Kim, Sooyeon – Journal of Educational Measurement, 2009

This article suggests a method for estimating a test-score equating relationship from small samples of test takers. The method does not require the estimated equating transformation to be linear. Instead, it constrains the estimated equating curve to pass through two pre-specified end points and a middle point determined from the data. In a…

Descriptors: Measurement, Measurement Techniques, Psychometrics, Sample Size

Is It Necessary to Make Anchor Tests Mini-Versions of the Tests Being Equated or Can Some Restrictions Be Relaxed?

Peer reviewed

Direct link

Sinharay, Sandip; Holland, Paul W. – Journal of Educational Measurement, 2007

It is a widely held belief that anchor tests should be miniature versions (i.e., "minitests"), with respect to content and statistical characteristics, of the tests being equated. This article examines the foundations for this belief regarding statistical characteristics. It examines the requirement of statistical representativeness of…

Descriptors: Test Items, Comparative Testing

Automated Test Assembly for Cognitive Diagnosis Models Using a Genetic Algorithm

Peer reviewed

Direct link

Finkelman, Matthew; Kim, Wonsuk; Roussos, Louis A. – Journal of Educational Measurement, 2009

Much recent psychometric literature has focused on cognitive diagnosis models (CDMs), a promising class of instruments used to measure the strengths and weaknesses of examinees. This article introduces a genetic algorithm to perform automated test assembly alongside CDMs. The algorithm is flexible in that it can be applied whether the goal is to…

Descriptors: Identification, Genetics, Test Construction, Mathematics

When Adaptation Is Not an Option: An Application of Multilingual Standard Setting

Peer reviewed

Direct link

Davis, Susan L.; Buckendahl, Chad W.; Plake, Barbara S. – Journal of Educational Measurement, 2008

As an alternative to adaptation, tests may also be developed simultaneously in multiple languages. Although the items on such tests could vary substantially, scores from these tests may be used to make the same types of decisions about different groups of examinees. The ability to make such decisions is contingent upon setting performance…

Descriptors: Test Results, Testing Programs, Multilingualism, Standard Setting

Previous Page | Next Page »

Pages: 1 | 2

Test Items	11
Computer Assisted Testing	8
Psychometrics	8
Test Construction	8
Testing	8
Automation	5
Scores	5
Adaptive Testing	4
Comparative Analysis	4
Educational Testing	4
Equated Scores	4
Item Response Theory	4
Measurement Techniques	4
Models	4
Cognitive Measurement	3
Computer Software	3
Context Effect	3
Diagnostic Tests	3
Item Banks	3
Test Format	3
Testing Programs	3
Construct Validity	2
Inferences	2
Licensing Examinations…	2
Measurement	2
More ▼

van der Linden, Wim J.	4
Kim, Sooyeon	2
Puhan, Gautam	2
Sinharay, Sandip	2
Almond, Russell G.	1
Baldwin, Peter	1
Buckendahl, Chad W.	1
Choi, Seung W.	1
Clauser, Brian E.	1
Cole, Nancy S.	1
Davis, Susan L.	1
DiBello, Louis V.	1
Diao, Qi	1
Embretson, Susan	1
Emre Gonulates	1
Ferrara, Steven	1
Finkelman, Matthew	1
Gibson, Wade M.	1
Gierl, Mark J.	1
Gorin, Joanna	1
Guo, Hongwen	1
Haberman, Shelby J.	1
Harold Doran	1
Heritage, Margaret	1
Holland, Paul W.	1
More ▼