ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	5

Descriptor

Simulation	11
Test Construction	11
Test Length	11
Test Items	8
Computer Assisted Testing	5
Cutting Scores	3
Adaptive Testing	2
Bayesian Statistics	2
Comparative Analysis	2
Correlation	2
Item Analysis	2
Item Banks	2
Item Response Theory	2
Mathematics Tests	2
Measurement Techniques	2
Models	2
Sample Size	2
Scores	2
Scoring	2
Test Format	2
Test Reliability	2
Ability	1
Achievement Tests	1
Career Development	1
Classification	1
More ▼

Source

Educational and Psychological…	2
Applied Measurement in…	1
Education and Information…	1
Journal of Educational…	1
ProQuest LLC	1

Publication Type

Reports - Research	6
Journal Articles	5
Speeches/Meeting Papers	5
Reports - Evaluative	3
Dissertations/Theses -…	1
Information Analyses	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

COMPASS (Computer Assisted…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Examination of the Parameter Estimate Bias When Violating the Orthogonality Assumption of the Bifactor Model

Direct link

Zheng, Chunmei – ProQuest LLC, 2013

Educational and psychological constructs are normally measured by multifaceted dimensions. The measured construct is defined and measured by a set of related subdomains. A bifactor model can accurately describe such data with both the measured construct and the related subdomains. However, a limitation of the bifactor model is the orthogonality…

Descriptors: Educational Testing, Measurement Techniques, Test Items, Models

Minimum Sample Size Requirements for Mokken Scale Analysis

Peer reviewed

Direct link

Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014

An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…

Descriptors: Sampling, Test Items, Effect Size, Scaling

Polytomous Adaptive Classification Testing: Effects of Item Pool Size, Test Termination Criterion, and Number of Cutscores

Peer reviewed

Direct link

Gnambs, Timo; Batinic, Bernad – Educational and Psychological Measurement, 2011

Computer-adaptive classification tests focus on classifying respondents in different proficiency groups (e.g., for pass/fail decisions). To date, adaptive classification testing has been dominated by research on dichotomous response formats and classifications in two groups. This article extends this line of research to polytomous classification…

Descriptors: Test Length, Computer Assisted Testing, Classification, Test Items

The Effects of Test Length and Sample Size on the Reliability and Equating of Tests Composed of Constructed-Response Items.

Peer reviewed

Fitzpatrick, Anne R.; Yen, Wendy M. – Applied Measurement in Education, 2001

Examined the effects of test length and sample size on the alternate forms reliability and equating of simulated mathematics tests composed of constructed response items scaled using the two-parameter partial credit model. Results suggest that, to obtain acceptable reliabilities and accurate equated scores, tests should have at least 8 6-point…

Descriptors: Constructed Response, Equated Scores, Mathematics Tests, Reliability

Pretesting alongside an Operational CAT.

Download full text

Davey, Tim; Pommerich, Mary; Thompson, Tony D. – 1999

In computerized adaptive testing (CAT), new or experimental items are frequently administered alongside operational tests to gather the pretest data needed to replenish and replace item pools. The two basic strategies used to combine pretest and operational items are embedding and appending. Variable-length CATs are preferred because of the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Measurement Techniques

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

Effects of Test Length and Advancement Score on Several Criterion-Referenced Test Reliability and Validity Indices. Laboratory of Psychometric and Evaluation Research Report No. 86.

Download full text

Eignor, Daniel R.; Hambleton, Ronald K. – 1979

The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…

Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

Setting Standards on Performance Assessments: Promising New Methods and Technical Issues.

Download full text

Hambleton, Ronald K. – 1995

Performance assessments in education and credentialing are becoming popular. At the same time, there do not exist any well established and validated methods for setting standards on performance assessments. This paper describes several of the new standard-setting methods that are emerging for use with performance assessments and considers their…

Descriptors: Achievement Tests, Cutting Scores, Holistic Evaluation, Licensing Examinations (Professions)

The Selection of Test Items for Decision Making with a Computer Adaptive Test.

Download full text

Spray, Judith A.; Reckase, Mark D. – 1994

The issue of test-item selection in support of decision making in adaptive testing is considered. The number of items needed to make a decision is compared for two approaches: selecting items from an item pool that are most informative at the decision point or selecting items that are most informative at the examinee's ability level. The first…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Hambleton, Ronald K.	3
Batinic, Bernad	1
Cook, Linda L.	1
Davey, Tim	1
Eignor, Daniel R.	1
Fitzpatrick, Anne R.	1
Gelbal, Selahattin	1
Gnambs, Timo	1
Liaw, Yuan-Ling	1
Ozdemir, Burhanettin	1
Pommerich, Mary	1
Reckase, Mark D.	1
Rutkowski, David	1
Rutkowski, Leslie	1
Sijtsma, Klaas	1
Spray, Judith A.	1
Straat, J. Hendrik	1
Svetina, Dubravka	1
Thompson, Tony D.	1
Yen, Wendy M.	1
Zheng, Chunmei	1
van der Ark, L. Andries	1
More ▼