ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	17

Descriptor

Adaptive Testing	35
Scoring	35
Computer Assisted Testing	26
Test Items	15
Item Response Theory	14
Simulation	10
Comparative Analysis	9
Test Construction	9
Test Length	8
Scores	7
Response Style (Tests)	6
Test Validity	6
Accuracy	5
Bayesian Statistics	5
Difficulty Level	5
Error of Measurement	5
Foreign Countries	5
Item Analysis	5
Latent Trait Theory	5
Models	5
Statistical Analysis	5
Ability	4
Achievement Tests	4
Computation	4
Correlation	4
More ▼

Source

ETS Research Report Series	6
Educational and Psychological…	3
Advanced Education	1
Applied Psychological…	1
Assessment	1
Computers & Education	1
Educational Testing Service	1
International Journal of…	1
Journal of Educational…	1
Language Assessment Quarterly	1
Large-scale Assessments in…	1
More ▼

Publication Type

Reports - Research	35
Journal Articles	17
Speeches/Meeting Papers	6
Numerical/Quantitative Data	2
Information Analyses	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Denmark	1
Israel	1
Netherlands	1
Thailand	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

Center for Epidemiologic…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
NEO Personality Inventory	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Online Calibration in Multidimensional Computerized Adaptive Testing with Polytomously Scored Items

Peer reviewed

Direct link

Yuan, Lu; Huang, Yingshi; Li, Shuhang; Chen, Ping – Journal of Educational Measurement, 2023

Online calibration is a key technology for item calibration in computerized adaptive testing (CAT) and has been widely used in various forms of CAT, including unidimensional CAT, multidimensional CAT (MCAT), CAT with polytomously scored items, and cognitive diagnostic CAT. However, as multidimensional and polytomous assessment data become more…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Test Items

Technology-Enhanced Items and Model-Data Misfit. Research Report. ETS RR-22-11

Peer reviewed
PDF on ERIC

Download full text

Carol Eckerly; Yue Jia; Paul Jewsbury – ETS Research Report Series, 2022

Testing programs have explored the use of technology-enhanced items alongside traditional item types (e.g., multiple-choice and constructed-response items) as measurement evidence of latent constructs modeled with item response theory (IRT). In this report, we discuss considerations in applying IRT models to a particular type of adaptive testlet…

Descriptors: Computer Assisted Testing, Test Items, Item Response Theory, Scoring

Evaluating Different Scoring Methods for Multiple Response Items Providing Partial Credit

Peer reviewed

Direct link

Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022

The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…

Descriptors: Scoring, Test Items, Test Format, Raw Scores

Developing Multistage Tests Using "D"-Scoring Method

Peer reviewed

Direct link

Han, Kyung T.; Dimitrov, Dimiter M.; Al-Mashary, Faisal – Educational and Psychological Measurement, 2019

The "D"-scoring method for scoring and equating tests with binary items proposed by Dimitrov offers some of the advantages of item response theory, such as item-level difficulty information and score computation that reflects the item difficulties, while retaining the merits of classical test theory such as the simplicity of number…

Descriptors: Test Construction, Scoring, Test Items, Adaptive Testing

Imputation Methods to Deal with Missing Responses in Computerized Adaptive Multistage Testing

Peer reviewed

Direct link

Cetin-Berber, Dee Duygu; Sari, Halil Ibrahim; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019

Routing examinees to modules based on their ability level is a very important aspect in computerized adaptive multistage testing. However, the presence of missing responses may complicate estimation of examinee ability, which may result in misrouting of individuals. Therefore, missing responses should be handled carefully. This study investigated…

Descriptors: Computer Assisted Testing, Adaptive Testing, Error of Measurement, Research Problems

Computer Adaptive Language Testing According to NATO STANAG 6001 Requirements

Peer reviewed
PDF on ERIC

Download full text

Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021

The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction

PIACC: A New Design for A New Era

Peer reviewed

Direct link

Kirsch, Irwin; Lennon, Mary Louise – Large-scale Assessments in Education, 2017

As the largest and most innovative international assessment of adults, PIAAC marks an inflection point in the evolution of large-scale comparative assessments. PIAAC grew from the foundation laid by surveys that preceded it, and introduced innovations that have shifted the way we conceive and implement large-scale assessments. As the first fully…

Descriptors: International Assessment, Adults, Measurement, Surveys

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

The New Computer Adaptive Test of Size and Strength (CATSS): Development and Validation

Peer reviewed

Direct link

Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019

This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…

Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Improving Personality Facet Scores with Multidimensional Computer Adaptive Testing: An Illustration with the Neo Pi-R

Peer reviewed

Direct link

Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A. W. – Assessment, 2013

Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when personality tests contain many highly correlated…

Descriptors: Computer Assisted Testing, Adaptive Testing, Personality Measures, Accuracy

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

Curtailment and Stochastic Curtailment to Shorten the CES-D

Peer reviewed

Direct link

Finkelman, Matthew D.; Smits, Niels; Kim, Wonsuk; Riley, Barth – Applied Psychological Measurement, 2012

The Center for Epidemiologic Studies-Depression (CES-D) scale is a well-known self-report instrument that is used to measure depressive symptomatology. Respondents who take the full-length version of the CES-D are administered a total of 20 items. This article investigates the use of curtailment and stochastic curtailment (SC), two sequential…

Descriptors: Measures (Individuals), Depression (Psychology), Test Length, Computer Assisted Testing

Computerized Adaptive Testing with the Zinnes and Griggs Pairwise Preference Ideal Point Model

Peer reviewed

Direct link

Stark, Stephen; Chernyshenko, Oleksandr S. – International Journal of Testing, 2011

This article delves into a relatively unexplored area of measurement by focusing on adaptive testing with unidimensional pairwise preference items. The use of such tests is becoming more common in applied non-cognitive assessment because research suggests that this format may help to reduce certain types of rater error and response sets commonly…

Descriptors: Test Length, Simulation, Adaptive Testing, Item Analysis

Potential Impact of Context Effects on the Scoring and Equating of the Multistage GRE® Revised General Test. ETS GRE® Board Research Report. ETS GRE® GREB-08-01. ETS Research Report. RR-11-26

Peer reviewed
PDF on ERIC

Download full text

Davey, Tim; Lee, Yi-Hsuan – ETS Research Report Series, 2011

Both theoretical and practical considerations have led the revision of the Graduate Record Examinations® (GRE®) revised General Test, here called the rGRE, to adopt a multistage adaptive design that will be continuously or nearly continuously administered and that can provide immediate score reporting. These circumstances sharply constrain the…

Descriptors: Context Effect, Scoring, Equated Scores, College Entrance Examinations

Previous Page | Next Page »

Pages: 1 | 2 | 3

Weiss, David J.	3
Davey, Tim	2
Kim, Sooyeon	2
Al-Mashary, Faisal	1
Ali, Usama S.	1
Aviad-Levitzky, Tami	1
Bayroff, A.G.	1
Bejar, Isaac I.	1
Bergstrom, Betty A.	1
Betts, Joe	1
Carol Eckerly	1
Cetin-Berber, Dee Duygu	1
Chang, Hua-Hua	1
Chen, Ping	1
Chernyshenko, Oleksandr S.	1
DeAyala, R. J.	1
Dimitrov, Dimiter M.	1
Eddins, John M.	1
Finkelman, Matthew D.	1
Gawliczek, Piotr	1
Glas, Cees A. W.	1
Goldstein, Zahava	1
Green, Bert F.	1
Han, Kyung T.	1
More ▼