ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Ability	18
Test Items	18
Test Length	18
Item Response Theory	9
Adaptive Testing	7
Computer Assisted Testing	6
Accuracy	5
Bayesian Statistics	5
Computation	5
Sample Size	5
Simulation	5
Test Format	5
Error of Measurement	4
Estimation (Mathematics)	4
Item Banks	4
Test Construction	4
Comparative Analysis	3
Differences	3
Difficulty Level	3
Maximum Likelihood Statistics	3
Test Bias	3
Achievement Tests	2
Classification	2
Computer Simulation	2
Higher Education	2
More ▼

Source

Educational and Psychological…	3
Applied Measurement in…	1
Applied Psychological…	1
ETS Research Report Series	1
International Journal of…	1
International Journal of…	1
Measurement:…	1
ProQuest LLC	1
Psychometrika	1

Publication Type

Journal Articles	10
Reports - Research	10
Reports - Evaluative	7
Speeches/Meeting Papers	6
Dissertations/Theses -…	1
Reports - Descriptive	1

Education Level

Early Childhood Education	1
Preschool Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
COMPASS (Computer Assisted…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment

Peer reviewed

Direct link

Lee, HyeSun; Geisinger, Kurt F. – Educational and Psychological Measurement, 2016

The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…

Descriptors: Test Bias, Measurement, Accuracy, Statistical Analysis

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

The Influence of Item Calibration Error on Variable-Length Computerized Adaptive Testing

Peer reviewed

Direct link

Patton, Jeffrey M.; Cheng, Ying; Yuan, Ke-Hai; Diao, Qi – Applied Psychological Measurement, 2013

Variable-length computerized adaptive testing (VL-CAT) allows both items and test length to be "tailored" to examinees, thereby achieving the measurement goal (e.g., scoring precision or classification) with as few items as possible. Several popular test termination rules depend on the standard error of the ability estimate, which in turn depends…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Length, Ability

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Treatment of Not-Administered Items on Individually Administered Intelligence Tests

Peer reviewed

Direct link

He, Wei; Wolfe, Edward W. – Educational and Psychological Measurement, 2012

In administration of individually administered intelligence tests, items are commonly presented in a sequence of increasing difficulty, and test administration is terminated after a predetermined number of incorrect answers. This practice produces stochastically censored data, a form of nonignorable missing data. By manipulating four factors…

Descriptors: Individual Testing, Intelligence Tests, Test Items, Test Length

Comparing Different Approaches of Bias Correction for Ability Estimation in IRT Models. Research Report. ETS RR-08-13

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2008

The method of maximum-likelihood is typically applied to item response theory (IRT) models when the ability parameter is estimated while conditioning on the true item parameters. In practice, the item parameters are unknown and need to be estimated first from a calibration sample. Lewis (1985) and Zhang and Lu (2007) proposed the expected response…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Ability

The Classification Accuracy of Shortened versus Full Length Tests with Number Correct Scoring.

Download full text

Schulz, E. Matthew; Wang, Lin – 2001

In this study, items were drawn from a full-length test of 30 items in order to construct shorter tests for the purpose of making accurate pass/fail classifications with regard to a specific criterion point on the latent ability metric. A three-item parameter Item Response Theory (IRT) framework was used. The criterion point on the latent ability…

Descriptors: Ability, Classification, Item Response Theory, Pass Fail Grading

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Peer reviewed

Kim, Seock-Ho; And Others – Psychometrika, 1994

Hierarchical Bayes procedures for the two-parameter logistic item response model were compared for estimating item and ability parameters through two joint and two marginal Bayesian procedures. Marginal procedures yielded smaller root mean square differences for item and ability, but results for larger sample size and test length were similar.…

Descriptors: Ability, Bayesian Statistics, Computer Simulation, Estimation (Mathematics)

An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.

Download full text

Pommerich, Mary; And Others – 1995

The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Bias

Inferring Examinee Ability When Some Item Responses Are Missing.

Download full text

Mislevy, Robert J.; Wu, Pao-Kuei – 1988

The basic equations of item response theory provide a foundation for inferring examinees' abilities and items' operating characteristics from observed responses. In practice, though, examinees will usually not have provided a response to every available item--for reasons that may or may not have been intended by the test administrator, and that…

Descriptors: Ability, Adaptive Testing, Equations (Mathematics), Estimation (Mathematics)

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

Monte Carlo Simulation Comparison of Two-Stage Testing and Computerized Adaptive Testing.

Download full text

Kim, Haeok; Plake, Barbara S. – 1993

A two-stage testing strategy is one method of adapting the difficulty of a test to an individual's ability level in an effort to achieve more precise measurement. A routing test provides an initial estimate of ability level, and a second-stage measurement test then evaluates the examinee further. The measurement accuracy and efficiency of item…

Descriptors: Ability, Adaptive Testing, Comparative Testing, Computer Assisted Testing

A Short and Simple Introduction to Tailored Testing.

Download full text

Rudner, Lawrence M. – 1978

Tailored testing provides the same information as group-administered standardized tests, but can do so using fewer items because the items administered are selected for the ability of the individual student. Thus, tailored testing offers several advantages over traditional methods. Because individual tailored tests are not timed, anxiety is…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

He, Wei	2
Lee, Yi-Hsuan	2
Reckase, Mark D.	2
Zhang, Jinming	2
Bergstrom, Betty A.	1
Cheng, Ying	1
Diao, Qi	1
Embretson, Susan E.	1
Geisinger, Kurt F.	1
Hasibe Yahsi Sari	1
Hulya Kelecioglu	1
Kim, Haeok	1
Kim, Seock-Ho	1
Lee, HyeSun	1
Mislevy, Robert J.	1
Nandakumar, Ratna	1
Patton, Jeffrey M.	1
Plake, Barbara S.	1
Pommerich, Mary	1
Rudner, Lawrence M.	1
Schulz, E. Matthew	1
Spray, Judith A.	1
Wang, Lin	1
Wang, Wei	1
More ▼