ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	7

Descriptor

Item Response Theory	13
Test Format	13
Testing Programs	13
Equated Scores	5
Test Items	5
Comparative Analysis	3
Context Effect	3
Elementary Secondary Education	3
Foreign Countries	3
State Programs	3
Test Construction	3
Achievement Tests	2
College Entrance Examinations	2
Difficulty Level	2
French	2
Grade 8	2
International Programs	2
Language Tests	2
Mathematics Tests	2
National Programs	2
Reading Achievement	2
Reading Comprehension	2
Reading Tests	2
Sample Size	2
Scaling	2
More ▼

Source

Applied Psychological…	2
Journal of Educational…	2
Applied Measurement in…	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational and Psychological…	1
Language Testing	1
Pearson	1

Publication Type

Reports - Research	10
Journal Articles	9
Speeches/Meeting Papers	4
Reports - Evaluative	2
Collected Works - General	1
Collected Works - Serials	1
Numerical/Quantitative Data	1

Education Level

Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Australia	1
Canada	1
Hong Kong	1
Illinois	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Graduate Record Examinations	1
National Assessment of…	1
North Carolina End of Course…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016

The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

The Potential Impact of Not Being Able to Create Parallel Tests on Expected Classification Accuracy

Peer reviewed

Direct link

Wyse, Adam E. – Applied Psychological Measurement, 2011

In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…

Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics

Do Questions Written in the Target Language Make Foreign Language Listening Comprehension Tests More Difficult?

Peer reviewed

Direct link

Filipi, Anna – Language Testing, 2012

The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…

Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

Direct link

Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012

Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…

Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

A Comparison of Developmental Scales Based on Thurstone Methods and Item Response Theory.

Peer reviewed

Williams, Valerie S. L.; Pommerich, Mary; Thissen, David – Journal of Educational Measurement, 1998

Created a developmental scale for the North Carolina End-of-Grade Mathematics Tests using a subset of identical test forms administered to adjacent grade levels with Thurstone scaling and Item Response Theory methods. Discusses differences in patterns produced. (Author/SLD)

Descriptors: Achievement Tests, Child Development, Comparative Analysis, Elementary Secondary Education

Curriculum and Translation Differential Item Functioning: A Comparison of Two DIF Detection Techniques.

Download full text

Emenogu, Barnabas; Childs, Ruth A. – 2003

This study investigated the possible impacts of language and curriculum differences on the performance of test items by subpopulations of students. Focusing on Measurement and Geometry items completed by students in French- and English-language schools in Ontario made it possible to explore the differences and to compare the item response theory…

Descriptors: Curriculum, English, Foreign Countries, French

Effects of Passage and Item Scrambling on Equating Relationships.

Peer reviewed

Harris, Deborah J. – Applied Psychological Measurement, 1991

Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)

Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores

Effects of Item Order and Context on Estimation of NAEP Reading Proficiency.

Peer reviewed

Zwick, Rebecca – Educational Measurement: Issues and Practice, 1991

Item parameter estimates derived through item response theory methods have been considered relatively robust to changes in item position and context, but the anomaly in reading scores from the 1986 National Assessment of Educational Progress (NAEP) illustrates problems with common population equating procedures when there are test form changes.…

Descriptors: Achievement Tests, Context Effect, Equated Scores, Estimation (Mathematics)

An Examination of the Influence of Expository and Narrative Passages on the Dimensionality of the IGAP Reading Test.

Download full text

Bolt, Daniel; Ackerman, Terry – 1994

The 1993 Illinois Goal Assessment Program (IGAP) Reading Tests measured reading comprehension using both narrative and expository reading passages. Noticeable differences in mean scaled scores occurred depending on whether the 1993 results were equated back to the 1992 narrative test or the 1993 expository test (Hsu and Ackerman, 1994). In an…

Descriptors: Achievement, Context Effect, Correlation, Educational Objectives

Current Developments in Language Testing. Anthology Series 25.

Download full text

Anivan, Sarinee, Ed. – 1991

The selection of papers on language testing includes: "Language Testing in the 1990s: How Far Have We Come? How Much Further Have We To Go?" (J. Charles Alderson); "Current Research/Development in Language Testing" (John W. Oller, Jr.); "The Difficulties of Difficulty: Prompts in Writing Assessment" (Liz Hamp-Lyons,…

Descriptors: Communicative Competence (Languages), Comparative Analysis, Computer Assisted Testing, Cues

Ackerman, Terry	1
Albano, Anthony D.	1
Anivan, Sarinee, Ed.	1
Bolt, Daniel	1
Chen, Hui-Fang	1
Childs, Ruth A.	1
Colvin, Kimberly F.	1
Cook, Robert J.	1
Emenogu, Barnabas	1
Filipi, Anna	1
Goodman, Joshua	1
Harris, Deborah J.	1
Jin, Kuan-Yu	1
Keller, Lisa A.	1
Keller, Robert	1
Lee, Yi-Hsuan	1
Meyers, Jason L.	1
Murphy, Stephen	1
Pommerich, Mary	1
Qian, Jiahe	1
Thissen, David	1
Turhan, Ahmet	1
Wang, Lin	1
Wang, Wen-Chung	1
Williams, Valerie S. L.	1
More ▼