Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 537 |
Since 2006 (last 20 years) | 1060 |
Descriptor
Language Tests | 1170 |
Statistical Analysis | 1170 |
Second Language Learning | 778 |
Foreign Countries | 763 |
English (Second Language) | 744 |
Second Language Instruction | 536 |
Language Proficiency | 385 |
Teaching Methods | 327 |
Scores | 323 |
Pretests Posttests | 305 |
Correlation | 286 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 6 |
Teachers | 5 |
Students | 1 |
Location
Iran | 206 |
Japan | 60 |
China | 52 |
Turkey | 39 |
Taiwan | 35 |
Malaysia | 21 |
Spain | 21 |
Canada | 19 |
California | 18 |
Saudi Arabia | 18 |
Germany | 17 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 4 |
Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022
The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…
Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility
Li, Hongli; Hunter, Charles Vincent; Bialo, Jacquelyn Anne – Language Assessment Quarterly, 2022
The purpose of this study is to review the status of differential item functioning (DIF) research in language testing, particularly as it relates to the investigation of sources (or causes) of DIF, which is a defining characteristic of the third generation DIF. This review included 110 DIF studies of language tests dated from 1985 to 2019. We…
Descriptors: Test Bias, Language Tests, Statistical Analysis, Evaluation Research
Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020
In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…
Descriptors: Test Format, Reading Tests, Language Tests, English
Khoshsima, Hooshang; Saed, Amin; Mousaei, Fatemeh – Advances in Language and Literary Studies, 2018
Language proficiency tests have become common instruments to judge people based on their performance. Thus, the scores on language proficiency tests, such as the International English Language Testing System (IELTS) or Teaching English as a Foreign Language (TOEFL), play a crucial role in the test-takers' lives. Because of increasing demands on…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Test Wiseness
Putwain, David W.; Aveyard, Ben – School Psychology Quarterly, 2018
A well established finding is that the cognitive component of test anxiety (worry) is negatively related to examination performance. The present study examined how 3 self-beliefs (academic buoyancy, perceived control, and test competence) moderated the strength of the relationship between worry and examination performance in a sample of 270 final…
Descriptors: Test Anxiety, Secondary School Students, Self Concept, Correlation
Kalkan, Ömür Kaya; Kara, Yusuf; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2018
Missing data is a common problem in datasets that are obtained by administration of educational and psychological tests. It is widely known that existence of missing observations in data can lead to serious problems such as biased parameter estimates and inflation of standard errors. Most of the missing data imputation methods are focused on…
Descriptors: Item Response Theory, Statistical Analysis, Data, Test Items
Gampe, Anja; Kurthen, Ira; Daum, Moritz M. – First Language, 2018
The current study describes the development and validation of a novel scale (BILEX) designed to assess young bilingual children's receptive vocabulary in both languages, their conceptual vocabulary, and translational equivalents. BILEX was developed to facilitate the assessment of vocabulary size for both of the children's languages within one…
Descriptors: Language Tests, Bilingualism, Bilingual Students, Preschool Children
McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018
This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…
Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements
Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018
In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…
Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Classification
Wang, Chun; Lu, Jing – Journal of Educational and Behavioral Statistics, 2021
In cognitive diagnostic assessment, multiple fine-grained attributes are measured simultaneously. Attribute hierarchies are considered important structural features of cognitive diagnostic models (CDMs) that provide useful information about the nature of attributes. Templin and Bradshaw first introduced a hierarchical diagnostic classification…
Descriptors: Cognitive Measurement, Models, Vertical Organization, Classification
Wei, Youhua; Low, Albert – ETS Research Report Series, 2017
In most large-scale programs of tests that aid in making high-stakes decisions, such as the "TOEIC"® family of products and service, it is not unusual for a significant portion of test takers to retake the test at multiple times.The study reported here used multilevel growth modeling to explore the score change patterns of nearly 20,000…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Liu, Xueman Lucy; de Villiers, Jill; Ning, Chunyan; Rolfhus, Eric; Hutchings, Teresa; Lee, Wendy; Jiang, Fan; Zhang, Yi Wen – Journal of Speech, Language, and Hearing Research, 2017
Purpose: With no existing gold standard for comparison, challenges arise for establishing the validity of a new standardized Mandarin language assessment normed in mainland China. Method: A new assessment, Diagnostic Receptive and Expressive Assessment of Mandarin (DREAM), was normed with a stratified sample of 969 children ages 2;6 (years;months)…
Descriptors: Mandarin Chinese, Correlation, Language Tests, Diagnostic Tests
Winke, Paula; Lim, Hyojung – Language Assessment Quarterly, 2017
To examine the effects of listening test preparation, we had three groups, two experimental and one control (63 learners total), partake in three types of instruction sandwiched between two equally difficult listening tests (pretests and posttests). The first experimental group took four practice tests and received "explicit"…
Descriptors: Listening Comprehension Tests, Test Preparation, Pretests Posttests, Experimental Groups
Ajeigbe, Taiwo Oluwafemi; Afolabi, Eyitayo Rufus Ifedayo – World Journal of Education, 2017
This study assessed unidimensionality and occurrence of Differential Item Functioning (DIF) in Mathematics and English Language items of Osun State Qualifying Examination. The study made use of secondary data. The results showed that OSQ Mathematics (-0.094 = r = 0.236) and English Language items (-0.095 = r = 0.228) were unidimensional. Also,…
Descriptors: Foreign Countries, Test Bias, Secondary School Students, Statistical Analysis
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods