ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	9

Descriptor

Error of Measurement	9
Test Format	9
Comparative Analysis	4
Item Response Theory	4
Test Items	4
Equated Scores	3
Simulation	3
Test Reliability	3
Computer Assisted Testing	2
Foreign Countries	2
Item Analysis	2
Language Proficiency	2
Language Tests	2
Reading Comprehension	2
Scores	2
Test Construction	2
Test Length	2
Test Validity	2
Undergraduate Students	2
Ability Grouping	1
Academic Ability	1
Accuracy	1
Adults	1
Animation	1
Bayesian Statistics	1
More ▼

Source

Applied Measurement in…	1
Education and Information…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Measurement:…	1
Practical Assessment,…	1
ProQuest LLC	1
Sociological Methods &…	1

Publication Type

Journal Articles	8
Reports - Research	8
Dissertations/Theses -…	1

Education Level

Higher Education	3
Postsecondary Education	3

Audience

Location

Greece	1
Ireland (Dublin)	1
Saudi Arabia	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Do Different Devices Perform Equally Well with Different Numbers of Scale Points and Response Formats? A Test of Measurement Invariance and Reliability

Peer reviewed

Direct link

Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024

Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…

Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Automated Essay Scoring Effect on Test Equating Errors in Mixed-Format Test

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Scoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Test Format

Does Modality Matter? Aural and Written Vocabulary in Second Language Listening and Reading Comprehension

Direct link

Takehiro Iizuka – ProQuest LLC, 2024

This study examined the significance of the mode of delivery--aural versus written--in second language (L2) vocabulary knowledge and L2 comprehension skills. One of the unique aspects of listening comprehension that sets it apart from reading comprehension is the mode of delivery--language input is delivered not visually but aurally. Somewhat…

Descriptors: Reading Comprehension, Listening Comprehension, Language Skills, Error of Measurement

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

The Effect of Item Form on Estimating Person's Ability, Item Parameters, and Information Function According to Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

ALKursheh, Taha Okleh; Al-zboon, Habis Saad; AlNasraween, Mo'en Salman – International Journal of Instruction, 2022

This study aimed at comparing the effect of two test item formats (multiple-choice and complete) on estimating person's ability, item parameters and the test information function (TIF).To achieve the aim of the study, two format of mathematics(1) test have been created: multiple-choice and complete, In its final format consisted of (31) items. The…

Descriptors: Comparative Analysis, Test Items, Item Response Theory, Test Format

Animated Videos in Assessment: Comparing Validity Evidence from and Test-Takers' Reactions to an Animated and a Text-Based Situational Judgment Test

Peer reviewed

Direct link

Karakolidis, Anastasios; O'Leary, Michael; Scully, Darina – International Journal of Testing, 2021

The linguistic complexity of many text-based tests can be a source of construct-irrelevant variance, as test-takers' performance may be affected by factors that are beyond the focus of the assessment itself, such as reading comprehension skills. This experimental study examined the extent to which the use of animated videos, as opposed to written…

Descriptors: Animation, Vignettes, Video Technology, Test Format

ALKursheh, Taha Okleh	1
Al-zboon, Habis Saad	1
AlNasraween, Mo'en Salman	1
Cui, Zhongmin	1
Dogan, Nuri	1
Gelbal, Selahattin	1
He, Yong	1
Inga Laukaityte	1
Karakolidis, Anastasios	1
Lixin Yuan	1
Marie Wiberg	1
Minqiang Zhang	1
Natalja Menold	1
O'Leary, Michael	1
Ozdemir, Burhanettin	1
Scully, Darina	1
Shaojie Wang	1
Takehiro Iizuka	1
Uysal, Ibrahim	1
Vera Toepoel	1
Won-Chan Lee	1
More ▼