ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Adaptive Testing	13
Statistical Distributions	13
Computer Assisted Testing	9
Item Response Theory	7
Test Items	6
Item Banks	5
Estimation (Mathematics)	4
Bayesian Statistics	3
Computation	3
Error of Measurement	3
Simulation	3
Comparative Analysis	2
Computer Simulation	2
Difficulty Level	2
Scores	2
Statistical Bias	2
Test Construction	2
Accuracy	1
Achievement Tests	1
Algorithms	1
Aptitude Tests	1
Attitude Measures	1
Cognitive Tests	1
Comparative Testing	1
Equations (Mathematics)	1
More ▼

Source

Applied Psychological…	5
Educational and Psychological…	3
Journal of Educational…	3
ETS Research Report Series	1
Psychometrika	1

Publication Type

Journal Articles	13
Reports - Evaluative	8
Reports - Research	5

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Pretest Item Calibration in Computerized Multistage Adaptive Testing

Peer reviewed

Direct link

Ersen, Rabia Karatoprak; Lee, Won-Chan – Journal of Educational Measurement, 2023

The purpose of this study was to compare calibration and linking methods for placing pretest item parameter estimates on the item pool scale in a 1-3 computerized multistage adaptive testing design in terms of item parameter recovery. Two models were used: embedded-section, in which pretest items were administered within a separate module, and…

Descriptors: Pretesting, Test Items, Computer Assisted Testing, Adaptive Testing

Dual-Objective Item Selection Criteria in Cognitive Diagnostic Computerized Adaptive Testing

Peer reviewed

Direct link

Kang, Hyeon-Ah; Zhang, Susu; Chang, Hua-Hua – Journal of Educational Measurement, 2017

The development of cognitive diagnostic-computerized adaptive testing (CD-CAT) has provided a new perspective for gaining information about examinees' mastery on a set of cognitive attributes. This study proposes a new item selection method within the framework of dual-objective CD-CAT that simultaneously addresses examinees' attribute mastery…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Test Items

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Detection of Person Misfit in Computerized Adaptive Tests with Polytomous Items.

Peer reviewed

van Krimpen-Stoop, Edith M. L. A.; Meijer, Rob R. – Applied Psychological Measurement, 2002

Compared the nominal and empirical null distributions of the standardized log-likelihood statistic for polytomous items for paper-and-pencil (P&P) and computerized adaptive tests (CATs). Results show that the empirical distribution of the statistic differed from the assumed standard normal distribution for both P&P tests and CATs. Also…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Statistical Distributions

Estimation of Reliability Coefficients Using the Test Information Function and Its Modifications.

Peer reviewed

Samejima, Fumiko – Applied Psychological Measurement, 1994

The reliability coefficient is predicted from the test information function (TIF) or two modified TIF formulas and a specific trait distribution. Examples illustrate the variability of the reliability coefficient across different trait distributions, and results are compared with empirical reliability coefficients. (SLD)

Descriptors: Adaptive Testing, Error of Measurement, Estimation (Mathematics), Reliability

The Null Distribution of Person-Fit Statistics for Conventional and Adaptive Tests.

Peer reviewed

van Krimpen-Stoop, Edith M. L. A.; Meijer, Rob – Applied Psychological Measurement, 1999

Theoretical null distributions of several fit statistic have been derived for paper-and-pencil tests. Examined whether these distributions also hold for computerized adaptive tests through simulation. Rates for two statistics studied were found to be similar in most cases. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Goodness of Fit, Item Response Theory

The Distribution of Indexes of Person Fit within the Computerized Adaptive Testing Environment.

Peer reviewed

Nering, Michael L. – Applied Psychological Measurement, 1997

Evaluated the distribution of person fit within the computerized-adaptive testing (CAT) environment through simulation. Found that, within the CAT environment, these indexes tend not to follow a standard normal distribution. Person fit indexes had means and standard deviations that were quite different from the expected. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Error of Measurement, Item Response Theory

A General Approach to Algorithmic Design of Fixed-Form Tests, Adaptive Tests, and Testlets.

Peer reviewed

Berger, Martijn P. F. – Applied Psychological Measurement, 1994

This paper focuses on similarities of optimal design of fixed-form tests, adaptive tests, and testlets within the framework of the general theory of optimal designs. A sequential design procedure is proposed that uses these similarities to obtain consistent estimates for the trait level distribution. (SLD)

Descriptors: Achievement Tests, Adaptive Testing, Algorithms, Estimation (Mathematics)

Item and Scale Information Functions for the Successive Intervals Rasch Model.

Peer reviewed

Dodd, Barbara G.; Koch, William R. – Educational and Psychological Measurement, 1994

Simulated data were used to investigate the impact of characteristics of threshold values (number, symmetry, and distance between adjacent threshold values) and delta values on the distribution of item information in the successive intervals Rasch model. Implications for computerized adaptive attitude measurement are discussed. (SLD)

Descriptors: Adaptive Testing, Attitude Measures, Computer Assisted Testing, Item Response Theory

Empirical Bayes Estimates of Domain Scores under Binomial and Hypergeometric Distributions for Test Scores.

Peer reviewed

Lin, Miao-Hsiang; Hsiung, Chao A. – Psychometrika, 1994

Two simple empirical approximate Bayes estimators are introduced for estimating domain scores under binomial and hypergeometric distributions respectively. Criteria are established regarding use of these functions over maximum likelihood estimation counterparts. (SLD)

Descriptors: Adaptive Testing, Bayesian Statistics, Computation, Equations (Mathematics)

The Effects of Variable Entry for a Bayesian Adaptive Test.

Peer reviewed

Hankins, Janette A. – Educational and Psychological Measurement, 1990

The effects of a fixed and variable entry procedure on bias and information of a Bayesian adaptive test were compared. Neither procedure produced biased ability estimates on the average. Bias at the distribution extremes, efficiency curves, item subsets generated for administration, and items required to reach termination are discussed. (TJH)

Descriptors: Adaptive Testing, Aptitude Tests, Bayesian Statistics, Comparative Analysis

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Computerized Adaptive Testing Using the Partial Credit Model: Effects of Item Pool Characteristics and Different Stopping Rules.

Peer reviewed

Dodd, Barbara G.; And Others – Educational and Psychological Measurement, 1993

Effects of the following variables on performance of computerized adaptive testing (CAT) procedures for the partial credit model (PCM) were studied: (1) stopping rule for terminating CAT; (2) item pool size; and (3) distribution of item difficulties. Implications of findings for CAT systems based on the PCM are discussed. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Difficulty Level

Dodd, Barbara G.	2
van Krimpen-Stoop, Edith M.…	2
Berger, Martijn P. F.	1
Chang, Hua-Hua	1
Ersen, Rabia Karatoprak	1
Hankins, Janette A.	1
Hsiung, Chao A.	1
Kang, Hyeon-Ah	1
Kim, Sooyeon	1
Koch, William R.	1
Lee, Won-Chan	1
Lin, Miao-Hsiang	1
Meijer, Rob	1
Meijer, Rob R.	1
Moses, Tim	1
Nering, Michael L.	1
Samejima, Fumiko	1
Wainer, Howard	1
Yoo, Hanwook Henry	1
Zhang, Susu	1
More ▼