Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 8 |
Descriptor
Test Items | 8 |
Test Reliability | 8 |
Test Validity | 7 |
Test Construction | 5 |
Foreign Countries | 3 |
Item Analysis | 3 |
Language Tests | 3 |
Receptive Language | 3 |
Scores | 3 |
Second Language Learning | 3 |
Comparative Analysis | 2 |
More ▼ |
Source
Language Assessment Quarterly | 8 |
Author
Aviad-Levitzky, Tami | 1 |
Claudia Helena… | 1 |
Duyen Thi Bich Nguyen | 1 |
Goldstein, Zahava | 1 |
Greenberg, Daphne | 1 |
Hayama, Megumi | 1 |
Hung Tan Ha | 1 |
Ido, Takahiro | 1 |
Jeffrey Stewart | 1 |
Ji-young Shin | 1 |
Koizumi, Rie | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 6 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Grade 9 | 1 |
Secondary Education | 1 |
Audience
Location
Israel | 1 |
Sweden | 1 |
United States | 1 |
Vietnam | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Peabody Picture Vocabulary… | 1 |
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Tengberg, Michael – Language Assessment Quarterly, 2018
Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results
Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019
This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Winke, Paula – Language Assessment Quarterly, 2011
In this study, I investigated the reliability of the U.S. Naturalization Test's civics component by asking 414 individuals to take a mock U.S. citizenship test comprising civics test questions. Using an incomplete block design of six forms with 16 nonoverlapping items and four anchor items on each form (the anchors connected the six subsets of…
Descriptors: Test Items, Citizenship, Civics, Test Validity
Pae, Hye K.; Greenberg, Daphne; Morris, Robin D. – Language Assessment Quarterly, 2012
The aim of this study was to apply the Rasch model to an analysis of the psychometric properties of the Peabody Picture Vocabulary Test--III Form A (PPVT--IIIA) items with struggling adult readers. The PPVT--IIIA was administered to 229 African American adults whose isolated word reading skills were between third and fifth grades. Conformity of…
Descriptors: African Americans, Test Items, Construct Validity, Test Validity
Koizumi, Rie; Sakai, Hideki; Ido, Takahiro; Ota, Hiroshi; Hayama, Megumi; Sato, Masatoshi; Nemoto, Akiko – Language Assessment Quarterly, 2011
This article reports on the development and validation of the English Diagnostic Test of Grammar (EDiT Grammar) for Japanese learners of English. From among the many aspects of grammar, this test focuses on the knowledge of basic English noun phrases (NPs), especially their internal structures, because previous research has indicated the…
Descriptors: Nouns, Diagnostic Tests, English (Second Language), Second Language Learning
Spaan, Mary – Language Assessment Quarterly, 2007
This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…
Descriptors: Test Items, Test Construction, Responses, Test Content