Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 7 |
Descriptor
Item Response Theory | 13 |
Test Format | 13 |
Testing Programs | 13 |
Equated Scores | 5 |
Test Items | 5 |
Comparative Analysis | 3 |
Context Effect | 3 |
Elementary Secondary Education | 3 |
Foreign Countries | 3 |
State Programs | 3 |
Test Construction | 3 |
More ▼ |
Source
Applied Psychological… | 2 |
Journal of Educational… | 2 |
Applied Measurement in… | 1 |
ETS Research Report Series | 1 |
Educational Measurement:… | 1 |
Educational and Psychological… | 1 |
Language Testing | 1 |
Pearson | 1 |
Author
Ackerman, Terry | 1 |
Albano, Anthony D. | 1 |
Anivan, Sarinee, Ed. | 1 |
Bolt, Daniel | 1 |
Chen, Hui-Fang | 1 |
Childs, Ruth A. | 1 |
Colvin, Kimberly F. | 1 |
Cook, Robert J. | 1 |
Emenogu, Barnabas | 1 |
Filipi, Anna | 1 |
Goodman, Joshua | 1 |
More ▼ |
Publication Type
Reports - Research | 10 |
Journal Articles | 9 |
Speeches/Meeting Papers | 4 |
Reports - Evaluative | 2 |
Collected Works - General | 1 |
Collected Works - Serials | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Secondary Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Audience
Location
Australia | 1 |
Canada | 1 |
Hong Kong | 1 |
Illinois | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
Graduate Record Examinations | 1 |
National Assessment of… | 1 |
North Carolina End of Course… | 1 |
Program for International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F. – Applied Measurement in Education, 2016
The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…
Descriptors: Item Response Theory, Equated Scores, Test Format, Testing Programs
Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015
Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…
Descriptors: Item Response Theory, Test Format, Language Usage, Test Items
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Wyse, Adam E. – Applied Psychological Measurement, 2011
In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…
Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics
Filipi, Anna – Language Testing, 2012
The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…
Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries
Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012
Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…
Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

Williams, Valerie S. L.; Pommerich, Mary; Thissen, David – Journal of Educational Measurement, 1998
Created a developmental scale for the North Carolina End-of-Grade Mathematics Tests using a subset of identical test forms administered to adjacent grade levels with Thurstone scaling and Item Response Theory methods. Discusses differences in patterns produced. (Author/SLD)
Descriptors: Achievement Tests, Child Development, Comparative Analysis, Elementary Secondary Education
Emenogu, Barnabas; Childs, Ruth A. – 2003
This study investigated the possible impacts of language and curriculum differences on the performance of test items by subpopulations of students. Focusing on Measurement and Geometry items completed by students in French- and English-language schools in Ontario made it possible to explore the differences and to compare the item response theory…
Descriptors: Curriculum, English, Foreign Countries, French

Harris, Deborah J. – Applied Psychological Measurement, 1991
Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)
Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores

Zwick, Rebecca – Educational Measurement: Issues and Practice, 1991
Item parameter estimates derived through item response theory methods have been considered relatively robust to changes in item position and context, but the anomaly in reading scores from the 1986 National Assessment of Educational Progress (NAEP) illustrates problems with common population equating procedures when there are test form changes.…
Descriptors: Achievement Tests, Context Effect, Equated Scores, Estimation (Mathematics)
Bolt, Daniel; Ackerman, Terry – 1994
The 1993 Illinois Goal Assessment Program (IGAP) Reading Tests measured reading comprehension using both narrative and expository reading passages. Noticeable differences in mean scaled scores occurred depending on whether the 1993 results were equated back to the 1992 narrative test or the 1993 expository test (Hsu and Ackerman, 1994). In an…
Descriptors: Achievement, Context Effect, Correlation, Educational Objectives
Anivan, Sarinee, Ed. – 1991
The selection of papers on language testing includes: "Language Testing in the 1990s: How Far Have We Come? How Much Further Have We To Go?" (J. Charles Alderson); "Current Research/Development in Language Testing" (John W. Oller, Jr.); "The Difficulties of Difficulty: Prompts in Writing Assessment" (Liz Hamp-Lyons,…
Descriptors: Communicative Competence (Languages), Comparative Analysis, Computer Assisted Testing, Cues