ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	22

Descriptor

Test Construction	65
Test Items	65
Test Length	65
Computer Assisted Testing	21
Adaptive Testing	18
Test Reliability	17
Item Banks	15
Test Validity	14
Item Analysis	13
Test Format	13
Testing Problems	11
Difficulty Level	10
Item Response Theory	9
Comparative Analysis	8
Latent Trait Theory	8
Simulation	8
Higher Education	7
Mastery Tests	7
Psychometrics	7
Accuracy	6
Achievement Tests	6
Cutting Scores	6
Mathematical Models	6
Sample Size	6
Bayesian Statistics	5
More ▼

Publication Type

Reports - Research	43
Journal Articles	25
Speeches/Meeting Papers	13
Reports - Evaluative	11
Numerical/Quantitative Data	5
Reports - Descriptive	5
Guides - Non-Classroom	4
Dissertations/Theses -…	3
Information Analyses	2
Opinion Papers	2
Historical Materials	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Education	2
Elementary Secondary Education	2
Grade 3	1
Grade 6	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	4
Administrators	1

Location

Asia	1
Australia	1
Israel	1
New Jersey	1

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Test of English as a Foreign…	2
COMPASS (Computer Assisted…	1
Program for International…	1
Raven Advanced Progressive…	1
School and College Ability…	1
Trends in International…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

A Comparison of Metaheuristic Optimization Algorithms for Scale Short-Form Development

Peer reviewed

Direct link

Raborn, Anthony W.; Leite, Walter L.; Marcoulides, Katerina M. – Educational and Psychological Measurement, 2020

This study compares automated methods to develop short forms of psychometric scales. Obtaining a short form that has both adequate internal structure and strong validity with respect to relationships with other variables is difficult with traditional methods of short-form development. Metaheuristic algorithms can select items for short forms while…

Descriptors: Test Construction, Automation, Heuristics, Mathematics

An Empirical Research on Identifiability and Q-Matrix Design for DINA Model

Peer reviewed
PDF on ERIC

Download full text

Xu, Peng; Desmarais, Michel C. – International Educational Data Mining Society, 2018

In most contexts of student skills assessment, whether the test material is administered by the teacher or within a learning environment, there is a strong incentive to minimize the number of questions or exercises administered in order to get an accurate assessment. This minimization objective can be framed as a Q-matrix design problem: given a…

Descriptors: Test Items, Accuracy, Test Construction, Skills

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Dynamic Multistage Testing: A Highly Efficient and Regulated Adaptive Testing Method

Peer reviewed

Direct link

Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019

This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Different Methods of Adjusting for Form Difficulty under the Rasch Model: Impact on Consistency of Assessment Results. Research Report. ETS RR-19-08

Peer reviewed
PDF on ERIC

Download full text

Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019

When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…

Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size

Illustration of a Survey Refinement Process Using Psychometric Analysis

Peer reviewed

Direct link

Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017

Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…

Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability

The Impact of Q-Matrix Designs on Diagnostic Classification Accuracy in the Presence of Attribute Hierarchies

Peer reviewed

Direct link

Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017

There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…

Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests

Designing CAT MOCCA: Guiding Principles and Simulation Research. MOCCA Technical Report MTR-2021-1

Peer reviewed
PDF on ERIC

Download full text

Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021

MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…

Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students

The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Sahin, Alper; Anil, Duygu – Educational Sciences: Theory and Practice, 2017

This study investigates the effects of sample size and test length on item-parameter estimation in test development utilizing three unidimensional dichotomous models of item response theory (IRT). For this purpose, a real language test comprised of 50 items was administered to 6,288 students. Data from this test was used to obtain data sets of…

Descriptors: Test Length, Sample Size, Item Response Theory, Test Construction

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Examination of the Parameter Estimate Bias When Violating the Orthogonality Assumption of the Bifactor Model

Direct link

Zheng, Chunmei – ProQuest LLC, 2013

Educational and psychological constructs are normally measured by multifaceted dimensions. The measured construct is defined and measured by a set of related subdomains. A bifactor model can accurately describe such data with both the measured construct and the related subdomains. However, a limitation of the bifactor model is the orthogonality…

Descriptors: Educational Testing, Measurement Techniques, Test Items, Models

Minimum Sample Size Requirements for Mokken Scale Analysis

Peer reviewed

Direct link

Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014

An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…

Descriptors: Sampling, Test Items, Effect Size, Scaling

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	9
Journal of Educational…	5
ProQuest LLC	3
ETS Research Report Series	2
Grantee Submission	2
AERA Online Paper Repository	1
Applied Measurement in…	1
Education Week	1
Education and Information…	1
Educational Sciences: Theory…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Experimental…	1
Journal of Technology,…	1
Measurement:…	1
OECD Publishing (NJ1)	1
Office of Education, United…	1
Research in the Schools	1
More ▼

Wainer, Howard	5
Hambleton, Ronald K.	4
Berk, Ronald A.	3
Reckase, Mark D.	3
Wilcox, Rand R.	2
Anil, Duygu	1
Arthur, Winfred, Jr.	1
Batinic, Bernad	1
Bergstrom, Betty	1
Boyd, Thomas A.	1
Bradshaw, Laine	1
Bruce, K.	1
Budescu, David V.	1
Byars, Alvin Gregg	1
Catts, Ralph	1
Changas, Paul S.	1
Cleary, T. Anne	1
Clements, Andrea D.	1
Conger, Anthony J.	1
Cook, Linda L.	1
Davey, Tim	1
David J. Weiss	1
Day, David V.	1
De Gruijter, Dato N. M.	1
More ▼