ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	13

Descriptor

Comparative Analysis	20
Test Format	20
Test Length	20
Test Items	11
Computer Assisted Testing	9
Simulation	7
Item Response Theory	6
Test Reliability	6
Difficulty Level	5
Sample Size	5
Scores	5
Error of Measurement	4
Test Construction	4
Test Validity	4
Equated Scores	3
Guidelines	3
Intelligence Tests	3
Monte Carlo Methods	3
Accuracy	2
Adaptive Testing	2
Construct Validity	2
Correlation	2
Educational Assessment	2
Higher Education	2
Item Analysis	2
More ▼

Source

ProQuest LLC	4
Psychological Assessment	2
ACT Education Corp.	1
Applied Measurement in…	1
ETS Research Report Series	1
Education and Information…	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
International Journal of…	1
Language Testing	1
Perceptual and Motor Skills	1
Quality Assurance in…	1
Research Matters	1
More ▼

Publication Type

Reports - Research	14
Journal Articles	12
Dissertations/Theses -…	4
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Elementary Education	1
Grade 6	1
Higher Education	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

Kaufman Brief Intelligence…	2
Wechsler Adult Intelligence…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Advanced Placement…	1
Marlowe Crowne Social…	1
Minnesota Multiphasic…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Item-Reduction Methodologies for Complex Educational Assessments: A Comparative Methodological Exploration

Direct link

Lance M. Kruse – ProQuest LLC, 2019

This study explores six item-reduction methodologies used to shorten an existing complex problem-solving non-objective test by evaluating how each shortened form performs across three sources of validity evidence (i.e., test content, internal structure, and relationships with other variables). Two concerns prompted the development of the present…

Descriptors: Educational Assessment, Comparative Analysis, Test Format, Test Length

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Effect of Differential Item Functioning on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Kabasakal, Kübra Atalay; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2015

This study examines the effect of differential item functioning (DIF) items on test equating through multilevel item response models (MIRMs) and traditional IRMs. The performances of three different equating models were investigated under 24 different simulation conditions, and the variables whose effects were examined included sample size, test…

Descriptors: Test Bias, Equated Scores, Item Response Theory, Simulation

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Validity of Random Short Forms: III. Wechsler's Intelligence Scales.

Peer reviewed

Silverstein, A. B. – Perceptual and Motor Skills, 1983

Formulas for estimating the validity of random short forms were applied to the standardization data for the Wechsler Adult Intelligence Scale-Revised, the Minnesota Multiphasic Personality Inventory, and the Marlowe-Crowne Social Desirability Scale. These formulas demonstrated how much "better than random" the best short forms of these…

Descriptors: Comparative Analysis, Intelligence Tests, Measures (Individuals), Test Format

Comparison of the K-BIT with Short Forms of the WAIS-R in a Neuropsychological Population.

Peer reviewed

Eisenstein, Norman; Engelhart, Charles I. – Psychological Assessment, 1997

The Kaufman Brief Intelligence Test (K-BIT) (A. S. Kaufman and N. L. Kaufman, 1990) was compared with short forms of the Wechsler Adult Intelligence Scale--Revised (WAIS-R) using results from 64 referrals to a neuropsychology service. Advantages of each test are noted and their use discussed. (SLD)

Descriptors: Adults, Comparative Analysis, Intelligence Tests, Neuropsychology

Previous Page | Next Page »

Pages: 1 | 2

Ann Arthur	1
Benton, Tom	1
Chen Qiu	1
Chi-Yu Huang	1
Christiansen, Neil D.	1
Coats, Pamela K.	1
Dogan, Nuri	1
Dongmei Li	1
Eignor, Daniel R.	1
Eisenstein, Norman	1
Engelhart, Charles I.	1
Erdem-Kara, Basak	1
Gelbal, Selahattin	1
Hambleton, Ronald K.	1
Hongling Wang	1
Isbell, Dan	1
Kabasakal, Kübra Atalay	1
Kelecioglu, Hülya	1
Kárász, Judit T.	1
Lance M. Kruse	1
Lixin Yuan	1
Minqiang Zhang	1
Oosterhof, Albert C.	1
Ozdemir, Burhanettin	1
More ▼