ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	14

Descriptor

Error of Measurement	21
Test Format	21
Test Items	21
Test Construction	10
Item Response Theory	9
Difficulty Level	7
Equated Scores	6
Comparative Analysis	5
Item Analysis	5
Mathematics Tests	5
Simulation	5
Foreign Countries	4
Multiple Choice Tests	4
Scores	4
Test Length	4
Test Reliability	4
Test Validity	4
English (Second Language)	3
Language Tests	3
Statistical Analysis	3
Ability	2
Achievement Tests	2
Bayesian Statistics	2
College Entrance Examinations	2
Correlation	2
More ▼

Source

Applied Measurement in…	2
Educational and Psychological…	2
American Institutes for…	1
Applied Psychological…	1
ETS Research Report Series	1
Education and Information…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Language Teaching Research	1
Measurement:…	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Publication Type

Reports - Research	15
Journal Articles	14
Reports - Evaluative	4
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	2
Grade 4	2
Early Childhood Education	1
Grade 3	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

Iran	1
Japan	1
Saudi Arabia	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Advanced Placement…	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

The Effect of Item Form on Estimating Person's Ability, Item Parameters, and Information Function According to Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

ALKursheh, Taha Okleh; Al-zboon, Habis Saad; AlNasraween, Mo'en Salman – International Journal of Instruction, 2022

This study aimed at comparing the effect of two test item formats (multiple-choice and complete) on estimating person's ability, item parameters and the test information function (TIF).To achieve the aim of the study, two format of mathematics(1) test have been created: multiple-choice and complete, In its final format consisted of (31) items. The…

Descriptors: Comparative Analysis, Test Items, Item Response Theory, Test Format

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats

Peer reviewed

Direct link

Lee, HyeSun; Smith, Weldon Z. – Educational and Psychological Measurement, 2020

Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response…

Descriptors: Bayesian Statistics, Item Response Theory, Monte Carlo Methods, Test Format

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

On the Dimensionality of Reading Comprehension Tests Composed of Text Comprehension Items and Cloze Test Items

Peer reviewed
PDF on ERIC

Download full text

Sheybani, Elias; Zeraatpishe, Mitra – International Journal of Language Testing, 2018

Test method is deemed to affect test scores along with examinee ability (Bachman, 1996). In this research the role of method facet in reading comprehension tests is studied. Bachman divided method facet into five categories, one category is the nature of input and the nature of expected response. This study examined the role of method effect in…

Descriptors: Reading Comprehension, Reading Tests, Test Items, Test Format

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

The Creation and Validation of a Listening Vocabulary Levels Test

Peer reviewed

Direct link

McLean, Stuart; Kramer, Brandon; Beglar, David – Language Teaching Research, 2015

An important gap in the field of second language vocabulary assessment concerns the lack of validated tests measuring aural vocabulary knowledge. The primary purpose of this study is to introduce and provide preliminary validity evidence for the Listening Vocabulary Levels Test (LVLT), which has been designed as a diagnostic tool to measure…

Descriptors: Test Construction, Test Validity, English (Second Language), Second Language Learning

Study of the Feasibility of a NAEP Mathematics Accessible Block Alternative

Download full text

DeStefano, Lizanne; Johnson, Jeremiah – American Institutes for Research, 2013

This paper describes one of the first efforts by the National Assessment of Educational Progress (NAEP) to improve measurement at the lower end of the distribution, including measurement for students with disabilities (SD) and English language learners (ELLs). One way to improve measurement at the lower end is to introduce one or more…

Descriptors: National Competency Tests, Measures (Individuals), Disabilities, English Language Learners

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

Creating IRT-Based Parallel Test Forms Using the Genetic Algorithm Method

Peer reviewed

Direct link

Sun, Koun-Tem; Chen, Yu-Jen; Tsai, Shu-Yen; Cheng, Chien-Fen – Applied Measurement in Education, 2008

In educational measurement, the construction of parallel test forms is often a combinatorial optimization problem that involves the time-consuming selection of items to construct tests having approximately the same test information functions (TIFs) and constraints. This article proposes a novel method, genetic algorithm (GA), to construct parallel…

Descriptors: Test Format, Measurement Techniques, Equations (Mathematics), Item Response Theory

Checking the Equivalence of Nearly Identical Test Editions.

Download full text

Dorans, Neil J.; Lawrence, Ida M. – 1988

A procedure for checking the score equivalence of nearly identical editions of a test is described. The procedure employs the standard error of equating (SEE) and utilizes graphical representation of score conversion deviation from the identity function in standard error units. Two illustrations of the procedure involving Scholastic Aptitude Test…

Descriptors: Equated Scores, Error of Measurement, Test Construction, Test Format

Previous Page | Next Page »

Pages: 1 | 2

ALKursheh, Taha Okleh	1
Al-zboon, Habis Saad	1
AlNasraween, Mo'en Salman	1
Beglar, David	1
Catts, Ralph M.	1
Chen, Yu-Jen	1
Cheng, Chien-Fen	1
Cole, Ki Lynn	1
Colton, Dean A.	1
DeStefano, Lizanne	1
Dorans, Neil J.	1
Gelbal, Selahattin	1
Hanson, Bradley A.	1
Henning, Grant	1
Henriksen, L. W.	1
Inga Laukaityte	1
Johnson, Jeremiah	1
Kim, Sohee	1
Kramer, Brandon	1
Lawrence, Ida M.	1
Lee, HyeSun	1
Lee, Yi-Hsuan	1
Liu, Sicong	1
Lixin Yuan	1
More ▼