ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	8
Since 2017 (last 10 years)	13
Since 2007 (last 20 years)	22

Descriptor

Simulation	34
Test Format	34
Test Items	34
Item Response Theory	18
Comparative Analysis	14
Computer Assisted Testing	10
Adaptive Testing	9
Sample Size	7
Scores	7
Equated Scores	6
Models	6
Statistical Analysis	6
Test Bias	6
Test Construction	6
Ability	5
Difficulty Level	5
Error of Measurement	5
Guidelines	4
Item Analysis	4
Test Length	4
Bayesian Statistics	3
Computation	3
Foreign Countries	3
Psychometrics	3
Test Reliability	3
More ▼

Source

Educational and Psychological…	4
ProQuest LLC	4
Journal of Educational…	3
ETS Research Report Series	2
International Journal of…	2
Academic Medicine	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
Education and Information…	1
Educational Measurement:…	1
Eurasian Journal of…	1
Grantee Submission	1
Journal of Psychoeducational…	1
Practical Assessment,…	1
Quality Assurance in…	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	21
Reports - Evaluative	6
Speeches/Meeting Papers	6
Dissertations/Theses -…	4
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education	3
Postsecondary Education	2
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Administrators	1
Practitioners	1
Teachers	1

Location

Germany	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Advanced Placement…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Modeling Slipping Effects in a Large-Scale Assessment with Innovative Item Formats

Peer reviewed

Direct link

Cuhadar, Ismail; Binici, Salih – Educational Measurement: Issues and Practice, 2022

This study employs the 4-parameter logistic item response theory model to account for the unexpected incorrect responses or slipping effects observed in a large-scale Algebra 1 End-of-Course assessment, including several innovative item formats. It investigates whether modeling the misfit at the upper asymptote has any practical impact on the…

Descriptors: Item Response Theory, Measurement, Student Evaluation, Algebra

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

The Impact of Local Item Dependence on Computer Adaptive Testing Given between and within Testlet Adaptivity

Direct link

Ozge Ersan Cinar – ProQuest LLC, 2022

In educational tests, a group of questions related to a shared stimulus is called a testlet (e.g., a reading passage with multiple related questions). Use of testlets is very common in educational tests. Additionally, computerized adaptive testing (CAT) is a mode of testing where the test forms are created in real time tailoring to the test…

Descriptors: Test Items, Computer Assisted Testing, Adaptive Testing, Educational Testing

Examining of Internal Consistency Coefficients in Mixed-Format Tests in Different Simulation Conditions

Peer reviewed
PDF on ERIC

Download full text

Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020

Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…

Descriptors: Test Format, Simulation, Test Reliability, Sample Size

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats

Peer reviewed

Direct link

Lee, HyeSun; Smith, Weldon Z. – Educational and Psychological Measurement, 2020

Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response…

Descriptors: Bayesian Statistics, Item Response Theory, Monte Carlo Methods, Test Format

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

On Using Simulations to Inform Decision Making during Instrument Development

Peer reviewed

Direct link

Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018

Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…

Descriptors: Simulation, Decision Making, Test Construction, Validity

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Multistage Computerized Adaptive Testing with Uniform Item Exposure

Peer reviewed

Direct link

Edwards, Michael C.; Flora, David B.; Thissen, David – Applied Measurement in Education, 2012

This article describes a computerized adaptive test (CAT) based on the uniform item exposure multi-form structure (uMFS). The uMFS is a specialization of the multi-form structure (MFS) idea described by Armstrong, Jones, Berliner, and Pashley (1998). In an MFS CAT, the examinee first responds to a small fixed block of items. The items comprising…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Pommerich, Mary	3
Nicewander, W. Alan	2
Wang, Wen-Chung	2
Algina, James	1
Ali, Usama S.	1
Becker, Benjamin	1
Binici, Salih	1
Blackwell, Thomas A.	1
Burden, Timothy	1
Chang, Hua-Hua	1
Cuhadar, Ismail	1
De Champlain, Andre	1
DeMars, Christine E.	1
Debeer, Dries	1
Demir, Ergul	1
Edwards, Michael C.	1
Flora, David B.	1
Floyd, Harlee S.	1
Frey, Andreas	1
Frierson, Henry H., Jr.	1
Gelbal, Selahattin	1
Gessaroli, Marc E.	1
Gurdil Ege, Hatice	1
Hecht, Martin	1
More ▼