ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	21

Descriptor

Error of Measurement	26
Item Response Theory	26
Test Format	26
Equated Scores	10
Comparative Analysis	9
Test Items	9
Simulation	8
Models	6
Test Construction	6
Item Analysis	5
Accuracy	4
Computer Assisted Testing	4
Evaluation Methods	4
Mathematics Tests	4
Scores	4
Test Theory	4
Correlation	3
Difficulty Level	3
Elementary School Students	3
Monte Carlo Methods	3
Multiple Choice Tests	3
Psychometrics	3
Test Reliability	3
True Scores	3
Ability	2
More ▼

Source

Applied Measurement in…	3
Educational and Psychological…	3
ETS Research Report Series	2
International Journal of…	2
Journal of Educational…	2
Journal of Educational and…	2
Measurement:…	2
ProQuest LLC	2
Applied Psychological…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	17
Reports - Evaluative	5
Speeches/Meeting Papers	4
Dissertations/Theses -…	2
Reports - Descriptive	2

Education Level

Early Childhood Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
Primary Education	1

Audience

Location

Saudi Arabia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Automated Essay Scoring Effect on Test Equating Errors in Mixed-Format Test

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Scoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Test Format

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Efficient Standard Errors in Item Response Theory Models for Short Tests

Peer reviewed

Direct link

Ippel, Lianne; Magis, David – Educational and Psychological Measurement, 2020

In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Standards

The Effect of Item Form on Estimating Person's Ability, Item Parameters, and Information Function According to Item Response Theory (IRT)

Peer reviewed
PDF on ERIC

Download full text

ALKursheh, Taha Okleh; Al-zboon, Habis Saad; AlNasraween, Mo'en Salman – International Journal of Instruction, 2022

This study aimed at comparing the effect of two test item formats (multiple-choice and complete) on estimating person's ability, item parameters and the test information function (TIF).To achieve the aim of the study, two format of mathematics(1) test have been created: multiple-choice and complete, In its final format consisted of (31) items. The…

Descriptors: Comparative Analysis, Test Items, Item Response Theory, Test Format

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats

Peer reviewed

Direct link

Lee, HyeSun; Smith, Weldon Z. – Educational and Psychological Measurement, 2020

Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response…

Descriptors: Bayesian Statistics, Item Response Theory, Monte Carlo Methods, Test Format

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Development of a Computerized Adaptive Version of the Turkish Driving Licence Exam

Peer reviewed
PDF on ERIC

Download full text

Cikrikci, Nukhet; Yalcin, Seher; Kalender, Ilker; Gul, Emrah; Ayan, Cansu; Uyumaz, Gizem; Sahin-Kursad, Merve; Kamis, Omer – International Journal of Assessment Tools in Education, 2020

This study tested the applicability of the theoretical Examination for Candidates of Driving License (ECODL) in Turkey as a computerized adaptive test (CAT). Firstly, various simulation conditions were tested for the live CAT through an item response theory-based calibrated item bank. The application of the simulated CAT was based on data from…

Descriptors: Motor Vehicles, Traffic Safety, Computer Assisted Testing, Item Response Theory

An Extension of IRT-Based Equating to the Dichotomous Testlet Response Theory Model

Peer reviewed

Direct link

Tao, Wei; Cao, Yi – Applied Measurement in Education, 2016

Current procedures for equating number-correct scores using traditional item response theory (IRT) methods assume local independence. However, when tests are constructed using testlets, one concern is the violation of the local item independence assumption. The testlet response theory (TRT) model is one way to accommodate local item dependence.…

Descriptors: Item Response Theory, Equated Scores, Test Format, Models

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

Data Collection Design for Equivalent Groups Equating: Using a Matrix Stratification Framework for Mixed-Format Assessment

Direct link

Mbella, Kinge Keka – ProQuest LLC, 2012

Mixed-format assessments are increasingly being used in large scale standardized assessments to measure a continuum of skills ranging from basic recall to higher order thinking skills. These assessments are usually comprised of a combination of (a) multiple-choice items which can be efficiently scored, have stable psychometric properties, and…

Descriptors: Educational Assessment, Test Format, Evaluation Methods, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2

Sykes, Robert C.	2
van der Linden, Wim J.	2
ALKursheh, Taha Okleh	1
Al-zboon, Habis Saad	1
AlNasraween, Mo'en Salman	1
Andrews, Benjamin James	1
Ayan, Cansu	1
Cao, Yi	1
Chen, Yu-Jen	1
Cheng, Chien-Fen	1
Choi, Jiwon	1
Cikrikci, Nukhet	1
Cole, Ki Lynn	1
Cui, Zhongmin	1
Dogan, Nuri	1
Fitzpatrick, Steven J.	1
Griffith, William D.	1
Gul, Emrah	1
Hanson, Brad	1
He, Yong	1
Hou, Liling	1
Ippel, Lianne	1
Kalender, Ilker	1
Kamata, Akihito	1
Kamis, Omer	1
More ▼