ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	27

Descriptor

Test Items	69
Computer Assisted Testing	33
Adaptive Testing	30
Test Construction	23
Item Banks	20
Item Response Theory	18
Models	13
Ability	9
Foreign Countries	9
Probability	9
Reaction Time	9
Responses	9
Estimation (Mathematics)	8
Selection	8
Algorithms	7
Error of Measurement	7
Law Schools	7
Simulation	7
Test Format	7
Test Theory	7
Bayesian Statistics	6
Mathematical Models	6
College Entrance Examinations	5
Equated Scores	5
Evaluation Methods	5
More ▼

Source

Journal of Educational and…	12
Applied Psychological…	11
Journal of Educational…	10
Psychometrika	5
Applied Measurement in…	1
ETS Research Report Series	1
Evaluation in Education:…	1
International Journal of…	1
Measurement:…	1

Author

van der Linden, Wim J.	69
Veldkamp, Bernard P.	13
Glas, Cees A. W.	6
Ariel, Adelaide	4
Reese, Lynda M.	3
Schnipke, Deborah L.	3
Scrams, David J.	3
Boekkooi-Timminga, Ellen	2
Jeon, Minjeong	2
Sotaridona, Leonardo	2
Vos, Hans J.	2
Zwarts, Michel A.	2
Adema, Jos J.	1
Barrett, Michelle D.	1
Belov, Dmitry I.	1
Breithaupt, Krista	1
Carlson, James E.	1
Chang, Hua-Hua	1
Chang, Lei	1
Chuah, Siang Chee	1
Diao, Qi	1
Eignor, Daniel R.	1
Ferrara, Steve	1
Geerlings, Hanneke	1
Li, Jie	1
More ▼

Publication Type

Journal Articles	43
Reports - Research	28
Reports - Evaluative	26
Reports - Descriptive	13
Speeches/Meeting Papers	7
Information Analyses	1
Opinion Papers	1
Reference Materials -…	1
Reports - General	1

Education Level

Higher Education	2
Grade 8	1

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	10
Armed Services Vocational…	3

What Works Clearinghouse Rating

Showing 1 to 15 of 69 results Save | Export

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

A Statistical Test for the Detection of Item Compromise Combining Responses and Response Times

Peer reviewed

Direct link

van der Linden, Wim J.; Belov, Dmitry I. – Journal of Educational Measurement, 2023

A test of item compromise is presented which combines the test takers' responses and response times (RTs) into a statistic defined as the number of correct responses on the item for test takers with RTs flagged as suspicious. The test has null and alternative distributions belonging to the well-known family of compound binomial distributions, is…

Descriptors: Item Response Theory, Reaction Time, Test Items, Item Analysis

Two Statistical Tests for the Detection of Item Compromise

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

Two independent statistical tests of item compromise are presented, one based on the test takers' responses and the other on their response times (RTs) on the same items. The tests can be used to monitor an item in real time during online continuous testing but are also applicable as part of post hoc forensic analysis. The two test statistics are…

Descriptors: Test Items, Item Analysis, Item Response Theory, Computer Assisted Testing

Estimating Linking Functions for Response Model Parameters

Peer reviewed

Direct link

Barrett, Michelle D.; van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2019

Parameter linking in item response theory is generally necessary to adjust for differences between the true values for the same item and ability parameters due to the use of different identifiability restrictions in different calibrations. The research reported in this article explores a precision-weighted (PW) approach to the problem of…

Descriptors: Item Response Theory, Computation, Error of Measurement, Test Items

A Comparison of Constraint Programming and Mixed-Integer Programming for Automated Test-Form Generation

Peer reviewed

Direct link

Li, Jie; van der Linden, Wim J. – Journal of Educational Measurement, 2018

The final step of the typical process of developing educational and psychological tests is to place the selected test items in a formatted form. The step involves the grouping and ordering of the items to meet a variety of formatting constraints. As this activity tends to be time-intensive, the use of mixed-integer programming (MIP) has been…

Descriptors: Programming, Automation, Test Items, Test Format

Speededness and Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Xiong, Xinhui – Journal of Educational and Behavioral Statistics, 2013

Two simple constraints on the item parameters in a response--time model are proposed to control the speededness of an adaptive test. As the constraints are additive, they can easily be included in the constraint set for a shadow-test approach (STA) to adaptive testing. Alternatively, a simple heuristic is presented to control speededness in plain…

Descriptors: Adaptive Testing, Heuristics, Test Length, Reaction Time

Optimal Test Design with Rule-Based Item Generation

Peer reviewed

Direct link

Geerlings, Hanneke; van der Linden, Wim J.; Glas, Cees A. W. – Applied Psychological Measurement, 2013

Optimal test-design methods are applied to rule-based item generation. Three different cases of automated test design are presented: (a) test assembly from a pool of pregenerated, calibrated items; (b) test generation on the fly from a pool of calibrated item families; and (c) test generation on the fly directly from calibrated features defining…

Descriptors: Test Construction, Test Items, Item Banks, Automation

Modeling Answer Changes on Test Items

Peer reviewed

Direct link

van der Linden, Wim J.; Jeon, Minjeong – Journal of Educational and Behavioral Statistics, 2012

The probability of test takers changing answers upon review of their initial choices is modeled. The primary purpose of the model is to check erasures on answer sheets recorded by an optical scanner for numbers and patterns that may be indicative of irregular behavior, such as teachers or school administrators changing answer sheets after their…

Descriptors: Probability, Models, Test Items, Educational Testing

A Paradox in the Study of the Benefits of Test-Item Review

Peer reviewed

Direct link

van der Linden, Wim J.; Jeon, Minjeong; Ferrara, Steve – Journal of Educational Measurement, 2011

According to a popular belief, test takers should trust their initial instinct and retain their initial responses when they have the opportunity to review test items. More than 80 years of empirical research on item review, however, has contradicted this belief and shown minor but consistently positive score gains for test takers who changed…

Descriptors: Test Items, Item Response Theory, Test Wiseness, Beliefs

Local Observed-Score Equating with Anchor-Test Designs

Peer reviewed

Direct link

van der Linden, Wim J.; Wiberg, Marie – Applied Psychological Measurement, 2010

For traditional methods of observed-score equating with anchor-test designs, such as chain and poststratification equating, it is difficult to satisfy the criteria of equity and population invariance. Their equatings are therefore likely to be biased. The bias in these methods was evaluated against a simple local equating method in which the…

Descriptors: Methods, Equated Scores, Test Items, Bias

Statistical Tests of Conditional Independence between Responses and/or Response Times on Test Items

Peer reviewed

Direct link

van der Linden, Wim J.; Glas, Cees A. W. – Psychometrika, 2010

Three plausible assumptions of conditional independence in a hierarchical model for responses and response times on test items are identified. For each of the assumptions, a Lagrange multiplier test of the null hypothesis of conditional independence against a parametric alternative is derived. The tests have closed-form statistics that are easy to…

Descriptors: Test Items, Computation, Responses, Reaction Time

Automated Test-Form Generation

Peer reviewed

Direct link

van der Linden, Wim J.; Diao, Qi – Journal of Educational Measurement, 2011

In automated test assembly (ATA), the methodology of mixed-integer programming is used to select test items from an item bank to meet the specifications for a desired test form and optimize its measurement accuracy. The same methodology can be used to automate the formatting of the set of selected items into the actual test form. Three different…

Descriptors: Test Items, Test Format, Test Construction, Item Banks

Linking Response-Time Parameters onto a Common Scale

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2010

Although response times on test items are recorded on a natural scale, the scale for some of the parameters in the lognormal response-time model (van der Linden, 2006) is not fixed. As a result, when the model is used to periodically calibrate new items in a testing program, the parameter are not automatically mapped onto a common scale. Several…

Descriptors: Test Items, Testing Programs, Measures (Individuals), Item Response Theory

Conceptual Issues in Response-Time Modeling

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2009

Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…

Descriptors: Test Items, Models, Reaction Time, Measurement

Implementing Sympson-Hetter Item-Exposure Control in a Shadow-Test Approach to Constrained Adaptive Testing

Peer reviewed

Direct link

Veldkamp, Bernard P.; van der Linden, Wim J. – International Journal of Testing, 2008

In most operational computerized adaptive testing (CAT) programs, the Sympson-Hetter (SH) method is used to control the exposure of the items. Several modifications and improvements of the original method have been proposed. The Stocking and Lewis (1998) version of the method uses a multinomial experiment to select items. For severely constrained…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5