ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Models	9
Test Construction	9
Test Length	9
Item Response Theory	5
Test Items	5
Sample Size	4
Correlation	3
Adaptive Testing	2
Comparative Analysis	2
Computer Assisted Testing	2
Factor Analysis	2
Measurement Techniques	2
Simulation	2
Test Validity	2
Ability	1
Accuracy	1
Algebra	1
Algorithms	1
Classification	1
College Freshmen	1
College Students	1
Educational Assessment	1
Educational Testing	1
Effect Size	1
Estimation (Mathematics)	1
More ▼

Source

Applied Psychological…	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
Journal of Experimental…	1
Multivariate Behavioral…	1
ProQuest LLC	1

Author

Anil, Duygu	1
Bandalos, Deborah L.	1
Benson, Jeri	1
Dubravka Svetina Valdivia	1
Frick, Theodore W.	1
Sahin, Alper	1
Sanders, Piet F.	1
Shenghai Dai	1
Sijtsma, Klaas	1
Straat, J. Hendrik	1
Veldkamp, Bernard P.	1
Verschoor, Alfred J.	1
Wainer, Howard	1
Zheng, Chunmei	1
van der Ark, L. Andries	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	5
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Number of Response Categories and Sample Size Requirements in Polytomous IRT Models

Peer reviewed

Direct link

Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024

Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…

Descriptors: Item Response Theory, Sample Size, Models, Classification

The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Sahin, Alper; Anil, Duygu – Educational Sciences: Theory and Practice, 2017

This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…

Descriptors: Test Length, Sample Size, Item Response Theory, Test Construction

Examination of the Parameter Estimate Bias When Violating the Orthogonality Assumption of the Bifactor Model

Direct link

Zheng, Chunmei – ProQuest LLC, 2013

Educational and psychological constructs are normally measured by multifaceted dimensions. The measured construct is defined and measured by a set of related subdomains. A bifactor model can accurately describe such data with both the measured construct and the related subdomains. However, a limitation of the bifactor model is the orthogonality…

Descriptors: Educational Testing, Measurement Techniques, Test Items, Models

Minimum Sample Size Requirements for Mokken Scale Analysis

Peer reviewed

Direct link

Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014

An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…

Descriptors: Sampling, Test Items, Effect Size, Scaling

Parallel Test Construction Using Classical Item Parameters.

Peer reviewed

Sanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998

Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)

Descriptors: Algorithms, Models, Reliability, Test Construction

Multidimensional Test Assembly Based on Lagrangian Relaxation Techniques. Research Report 98-08.

Download full text

Veldkamp, Bernard P. – 1998

In this paper, a mathematical programming approach is presented for the assembly of ability tests measuring multiple traits. The values of the variance functions of the estimators of the traits are minimized, while test specifications are met. The approach is based on Lagrangian relaxation techniques and provides good results for the two…

Descriptors: Ability, Estimation (Mathematics), Foreign Countries, Item Banks

Second-Order Confirmatory Factor Analysis of the "Reactions to Tests" Scale with Cross-Validation.

Peer reviewed

Benson, Jeri; Bandalos, Deborah L. – Multivariate Behavioral Research, 1992

Factor structure of the Reactions to Tests (RTT) scale measuring test anxiety was studied by testing a series of confirmatory factor models including a second-order structure with 636 college students. Results support a shorter 20-item RTT but also raise questions about the cross-validation of covariance models. (SLD)

Descriptors: College Students, Factor Analysis, Factor Structure, Higher Education

A Comparison of an Expert Systems Approach to Computerized Adaptive Testing and an Item Response Theory Model.

Download full text

Frick, Theodore W. – 1991

Expert systems can be used to aid decisionmaking. A computerized adaptive test is one kind of expert system, although not commonly recognized as such. A new approach, termed EXSPRT, was devised that combines expert systems reasoning and sequential probability ratio test stopping rules. Two versions of EXSPRT were developed, one with random…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Expert Systems

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing