Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Item Response Theory | 10 |
Standardized Tests | 10 |
Test Format | 10 |
Test Items | 6 |
Achievement Tests | 4 |
Equated Scores | 4 |
Mathematics Tests | 3 |
Test Construction | 3 |
College Entrance Examinations | 2 |
Difficulty Level | 2 |
Foreign Countries | 2 |
More ▼ |
Source
Applied Psychological… | 2 |
Applied Measurement in… | 1 |
Language Testing | 1 |
Pearson | 1 |
ProQuest LLC | 1 |
School Psychology | 1 |
Author
Ackerman, Terry | 1 |
Brennan, Robert L. | 1 |
Childs, Ruth A. | 1 |
Dorans, Neil J. | 1 |
Ehrich, John | 1 |
Emenogu, Barnabas | 1 |
Goodman, Joshua | 1 |
Hammond, Shelby | 1 |
Howard, Steven J. | 1 |
Jiajing Huang | 1 |
Keller, Lisa A. | 1 |
More ▼ |
Publication Type
Reports - Research | 6 |
Journal Articles | 5 |
Speeches/Meeting Papers | 3 |
Reports - Descriptive | 2 |
Dissertations/Theses -… | 1 |
Numerical/Quantitative Data | 1 |
Reports - Evaluative | 1 |
Education Level
Early Childhood Education | 1 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Primary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
National Assessment Program… | 1 |
Test of Standard Written… | 1 |
What Works Clearinghouse Rating
Jiajing Huang – ProQuest LLC, 2022
The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…
Descriptors: Item Response Theory, Test Format, Test Items, Test Construction
Woodcock, Stuart; Howard, Steven J.; Ehrich, John – School Psychology, 2020
Standardized testing is ubiquitous in educational assessment, but questions have been raised about the extent to which these test scores accurately reflect students' genuine knowledge and skills. To more rigorously investigate this issue, the current study employed a within-subject experimental design to examine item format effects on primary…
Descriptors: Elementary School Students, Grade 3, Test Items, Test Format
Shin, Sun-Young; Lee, Senyung; Lidster, Ryan – Language Testing, 2021
In this study we investigated the potential for a shared-first-language (shared-L1) effect on second language (L2) listening test scores using differential item functioning (DIF) analyses. We did this in order to understand how accented speech may influence performance at the item level, while controlling for key variables including listening…
Descriptors: Listening Comprehension Tests, Language Tests, Native Language, Scores
Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015
Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…
Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics
Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012
Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…
Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory
Dorans, Neil J.; Liu, Jinghua; Hammond, Shelby – Applied Psychological Measurement, 2008
This exploratory study was built on research spanning three decades. Petersen, Marco, and Stewart (1982) conducted a major empirical investigation of the efficacy of different equating methods. The studies reported in Dorans (1990) examined how different equating methods performed across samples selected in different ways. Recent population…
Descriptors: Test Format, Equated Scores, Sampling, Evaluation Methods
Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007
Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…
Descriptors: Classification, Reliability, Indexes, Computation
Ackerman, Terry – 1990
The issue of parallel forms is of paramount importance for producers of standardized tests. With increasing emphasis being placed on standardized test results it is necessary that each student achieve the same standard score regardless of the form he or she was administered. In the case of the American College Testing (ACT) Assessment Program,…
Descriptors: Achievement Tests, College Bound Students, College Entrance Examinations, High School Students
Emenogu, Barnabas; Childs, Ruth A. – 2003
This study investigated the possible impacts of language and curriculum differences on the performance of test items by subpopulations of students. Focusing on Measurement and Geometry items completed by students in French- and English-language schools in Ontario made it possible to explore the differences and to compare the item response theory…
Descriptors: Curriculum, English, Foreign Countries, French
Valley, John R. – 1992
From 1970 to 1985, the Scholastic Aptitude Test (SAT) underwent major modifications caused by: (1) the addition of the Test of Standard Written English (TSWE) to the College Board's Admissions Testing Program (ATP); (2) the passage of test disclosure legislation; (3) the institution of test sensitivity reviews; and (4) the use of item response…
Descriptors: Achievement Tests, College Entrance Examinations, Educational History, Equated Scores