ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	16

Descriptor

Equated Scores	23
Statistical Analysis	23
Test Format	23
Test Items	11
Comparative Analysis	9
Item Response Theory	7
College Entrance Examinations	6
Test Construction	6
Difficulty Level	5
Mathematics Tests	5
Simulation	5
Raw Scores	4
Scores	4
Elementary Secondary Education	3
Error of Measurement	3
Gender Differences	3
Item Analysis	3
Multiple Choice Tests	3
Psychometrics	3
Reading Tests	3
Sample Size	3
Sampling	3
Science Tests	3
Standardized Tests	3
Test Reliability	3
More ▼

Source

ETS Research Report Series	4
ProQuest LLC	3
Applied Measurement in…	2
Practical Assessment,…	2
ACT, Inc.	1
College Board	1
College Entrance Examination…	1
Educational Sciences: Theory…	1
Educational Testing Service	1
Journal of Educational and…	1
Pearson	1
Psychological Assessment	1
More ▼

Publication Type

Reports - Research	13
Journal Articles	11
Reports - Evaluative	6
Speeches/Meeting Papers	5
Dissertations/Theses -…	3
Collected Works - General	1
Reports - Descriptive	1

Education Level

Higher Education	5
Postsecondary Education	5
Elementary Secondary Education	2
Elementary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Florida	1
Luxembourg	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	6
ACT Assessment	2
Advanced Placement…	1
Armed Services Vocational…	1
College Level Examination…	1
Graduate Management Admission…	1
Law School Admission Test	1
Trends in International…	1
Wisconsin Card Sorting Test	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

A Comparison of IRT Linking Approaches under the Nonequivalent Groups Anchor Test Design

Direct link

Jiajing Huang – ProQuest LLC, 2022

The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…

Descriptors: Item Response Theory, Test Format, Test Items, Test Construction

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

Subscore Equating and Profile Reporting

Peer reviewed

Direct link

Lim, Euijin; Lee, Won-Chan – Applied Measurement in Education, 2020

The purpose of this study is to address the necessity of subscore equating and to evaluate the performance of various equating methods for subtests. Assuming the random groups design and number-correct scoring, this paper analyzed real data and simulated data with four study factors including test dimensionality, subtest length, form difference in…

Descriptors: Equated Scores, Test Length, Test Format, Difficulty Level

An Investigation of Differential Mode Effects When Comparing Paper and Online ACT Testing. ACT Research & Policy. Technical Brief

Download full text

Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020

In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…

Descriptors: Test Format, Reading Tests, Language Tests, English

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Evaluating Equating Results in the Non-Equivalent Groups with Anchor Test Design Using Equipercentile and Equity Criteria

Direct link

Duong, Minh Quang – ProQuest LLC, 2011

Testing programs often use multiple test forms of the same test to control item exposure and to ensure test security. Although test forms are constructed to be as similar as possible, they often differ. Test equating techniques are those statistical methods used to adjust scores obtained on different test forms of the same test so that they are…

Descriptors: Equated Scores, Statistical Analysis, Item Response Theory, Evaluation Criteria

Examining Two Strategies to Link Mixed-Format Tests Using Multiple-Choice Anchors. Research Report. ETS RR-10-18

Download full text

Walker, Michael E.; Kim, Sooyeon – Educational Testing Service, 2010

This study examined the use of an all multiple-choice (MC) anchor for linking mixed format tests containing both MC and constructed-response (CR) items, in a nonequivalent groups design. An MC-only anchor could effectively link two such test forms if either (a) the MC and CR portions of the test measured the same construct, so that the MC anchor…

Descriptors: Equated Scores, Test Format, Multiple Choice Tests, Statistical Analysis

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

Peer reviewed

Direct link

Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude – Psychological Assessment, 2010

The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

Descriptors: Computer Assisted Testing, Psychometrics, Test Theory, Scores

Population Invariance of Vertical Scaling Results

Direct link

Powers, Sonya; Turhan, Ahmet; Binici, Salih – Pearson, 2012

The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…

Descriptors: Scaling, Equated Scores, Standardized Tests, Reading Tests

Developing Form Assembly Specifications for Exams with Multiple Choice and Constructed Response Items: Balancing Reliability and Validity Concerns

Download full text

Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010

The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…

Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity

Development of Approximations to Population Invariance Indices. Research Report. ETS RR-08-36

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zhu, Xiaowen – ETS Research Report Series, 2008

The purpose of this paper is to explore methods to approximate population invariance without conducting multiple linkings for subpopulations. Under the single group or equivalent groups design, no linking needs to be performed for the parallel-linear system linking functions. The unequated raw score information can be used as an approximation. For…

Descriptors: Raw Scores, Test Format, Comparative Analysis, Test Construction

Examining an Alternative to Score Equating: A Randomly Equivalent Forms Approach. Research Report. ETS RR-08-14

Peer reviewed
PDF on ERIC

Download full text

Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008

Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…

Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2

Lawrence, Ida M.	2
Livingston, Samuel A.	2
Algina, James	1
Angoff, William H.	1
Binici, Salih	1
Brunner, Martin	1
Dorans, Neil J.	1
Duong, Minh Quang	1
Ewing, Maureen	1
Hendrickson, Amy	1
Houssemand, Claude	1
Inga Laukaityte	1
Jiajing Huang	1
Kelecioglu, Hülya	1
Kim, Sooyeon	1
Lee, Won-Chan	1
Lee, Yi-Hsuan	1
Legg, Sue M.	1
Liao, Chi-Wen	1
Lim, Euijin	1
Little, Roderick J. A.	1
Liu, Jinghua	1
Liu, Mei, Ed.	1
Loarer, Even	1
More ▼