ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Source

Journal of Educational and…

Author

van der Linden, Wim J.	3
Berger, Martijn P. F.	2
Bradlow, Eric T.	2
Jewsbury, Paul A.	1
Meijer, Rob R.	1
Nandakumar, Ratna	1
Passos, Valeria Lima	1
Reckase, Mark D.	1
Roussos, Louis	1
Spray, Judith A.	1
Stocking, Martha L.	1
Tan, Frans E. S.	1
Veerkamp, Wim J. J.	1
Veldkamp, Bernard P.	1
Weiss, Robert E.	1
van Krimpen-Stoop, Edith M.…	1
van Rijn, Peter W.	1
More ▼

Publication Type

Journal Articles	12
Reports - Evaluative	12
Reports - Descriptive	1

Education Level

Elementary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Two Statistical Tests for the Detection of Item Compromise

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

Two independent statistical tests of item compromise are presented, one based on the test takers' responses and the other on their response times (RTs) on the same items. The tests can be used to monitor an item in real time during online continuous testing but are also applicable as part of post hoc forensic analysis. The two test statistics are…

Descriptors: Test Items, Item Analysis, Item Response Theory, Computer Assisted Testing

IRT and MIRT Models for Item Parameter Estimation with Multidimensional Multistage Tests

Peer reviewed

Direct link

Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020

In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…

Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing

The D-Optimality Item Selection Criterion in the Early Stage of CAT: A Study with the Graded Response Model

Peer reviewed

Direct link

Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. S. – Journal of Educational and Behavioral Statistics, 2008

During the early stage of computerized adaptive testing (CAT), item selection criteria based on Fisher"s information often produce less stable latent trait estimates than the Kullback-Leibler global information criterion. Robustness against early stage instability has been reported for the D-optimality criterion in a polytomous CAT with the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Evaluation Criteria, Item Analysis

Outlier Measures and Norming Methods for Computerized Adaptive Tests.

Peer reviewed

Bradlow, Eric T.; Weiss, Robert E. – Journal of Educational and Behavioral Statistics, 2001

Compares four methods that map outlier statistics to a familiarity probability scale (a "P" value). Explored these methods in the context of computerized adaptive test data from a 1995 nationally administered computerized examination for professionals in the medical industry. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Probability, Test Construction

CUSUM-Based Person-Fit Statistics for Adaptive Testing.

Peer reviewed

van Krimpen-Stoop, Edith M. L. A.; Meijer, Rob R. – Journal of Educational and Behavioral Statistics, 2001

Proposed person-fit statistics that are designed for use in a computerized adaptive test (CAT) and derived critical values for these statistics using cumulative sum (CUSUM) procedures so that item-score patterns can be classified as fitting or misfitting. Compared nominal Type I errors with empirical Type I errors through simulation studies. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Simulation, Test Construction

Negative Information and the Three-Parameter Logistic Model. Teacher's Corner.

Peer reviewed

Bradlow, Eric T. – Journal of Educational and Behavioral Statistics, 1996

The three-parameter logistic (3-PL) model is described and a derivation of the 3-PL observed information function is presented for a single binary response from one examinee with known item parameters. Formulas are presented for the probability of negative information and for the expected information (always nonnegative). (SLD)

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Item Response Theory

Constraining Item Exposure in Computerized Adaptive Testing with Shadow Tests

Peer reviewed

Direct link

van der Linden, Wim J.; Veldkamp, Bernard P. – Journal of Educational and Behavioral Statistics, 2004

Item-exposure control in computerized adaptive testing is implemented by imposing item-ineligibility constraints on the assembly process of the shadow tests. The method resembles Sympson and Hetter's (1985) method of item-exposure control in that the decisions to impose the constraints are probabilistic. The method does not, however, require…

Descriptors: Probability, Law Schools, Admission (School), Adaptive Testing

Evaluation of the CATSIB DIF Procedure in a Pretest Setting

Peer reviewed

Direct link

Nandakumar, Ratna; Roussos, Louis – Journal of Educational and Behavioral Statistics, 2004

A new procedure, CATSIB, for assessing differential item functioning (DIF) on computerized adaptive tests (CATs) is proposed. CATSIB, a modified SIBTEST procedure, matches test takers on estimated ability and controls for impact-induced Type 1 error inflation by employing a CAT version of the IBTEST "regression correction." The…

Descriptors: Evaluation, Adaptive Testing, Computer Assisted Testing, Pretesting

Some Alternatives to Sympson-Hetter Item-Exposure Control in Computerized Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2003

The Hetter and Sympson (1997; 1985) method is a method of probabilistic item-exposure control in computerized adaptive testing. Setting its control parameters to admissible values requires an iterative process of computer simulations that has been found to be time consuming, particularly if the parameters have to be set conditional on a realistic…

Descriptors: Law Schools, Adaptive Testing, Admission (School), Computer Assisted Testing

An Alternative Method for Scoring Adaptive Tests.

Peer reviewed

Stocking, Martha L. – Journal of Educational and Behavioral Statistics, 1996

An alternative method for scoring adaptive tests, based on number-correct scores, is explored and compared with a method that relies more directly on item response theory. Using the number-correct score with necessary adjustment for intentional differences in adaptive test difficulty is a statistically viable scoring method. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Item Response Theory

Comparison of SPRT and Sequential Bayes Procedures for Classifying Examinees into Two Categories Using a Computerized Test.

Peer reviewed

Spray, Judith A.; Reckase, Mark D. – Journal of Educational and Behavioral Statistics, 1996

Two procedures for classifying examinees into categories, one based on the sequential probability ratio test (SPRT) and the other on sequential Bayes methodology, were compared to determine which required fewer items for classification. Results showed that the SPRT procedure requires fewer items to achieve the same accuracy level. (SLD)

Descriptors: Ability, Bayesian Statistics, Classification, Comparative Analysis

Some New Item Selection Criteria for Adaptive Testing.

Peer reviewed

Berger, Martijn P. F.; Veerkamp, Wim J. J. – Journal of Educational and Behavioral Statistics, 1997

Some alternative criteria for item selection in adaptive testing are proposed that take into account uncertainty in the ability estimates. A simulation study shows that the likelihood weighted information criterion is a good alternative to the maximum information criterion. Another good alternative uses a Bayesian expected a posteriori estimator.…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Computer Assisted Testing	12
Adaptive Testing	10
Test Items	6
Item Response Theory	5
Test Construction	4
Ability	3
Probability	3
Simulation	3
Admission (School)	2
Bayesian Statistics	2
Comparative Analysis	2
Item Analysis	2
Law Schools	2
Classification	1
Computation	1
Computer Oriented Programs	1
Computer Simulation	1
Criteria	1
Difficulty Level	1
Educational Assessment	1
Estimation (Mathematics)	1
Evaluation	1
Evaluation Criteria	1
Evaluation Methods	1
Grade 4	1
More ▼