NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Teachers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 41 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
Walland, Emma – Research Matters, 2022
In this article, I report on examiners' views and experiences of using Pairwise Comparative Judgement (PCJ) and Rank Ordering (RO) as alternatives to traditional analytical marking for GCSE English Language essays. Fifteen GCSE English Language examiners took part in the study. After each had judged 100 pairs of essays using PCJ and eight packs of…
Descriptors: Essays, Grading, Writing Evaluation, Evaluators
Peer reviewed Peer reviewed
Direct linkDirect link
Marshall, Neil; Shaw, Kirsten; Hunter, Jodie; Jones, Ian – New Zealand Journal of Educational Studies, 2020
There is growing interest in using comparative judgement to assess student work as an alternative to traditional marking. Comparative judgement requires no rubrics and is instead grounded in experts making pairwise judgements about the relative 'quality' of students' work according to a high level criterion. The resulting decision data are fitted…
Descriptors: Comparative Analysis, Decision Making, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
James, Kate; Hannah, Elizabeth F. S. – International Journal of School & Educational Psychology, 2018
In Ireland, dyslexic students can apply for reasonable accommodations in Leaving Certificate examinations. One such accommodation is the Spelling and Grammar Waiver (SGW). Questions have been raised regarding its validity, and it has been suggested that it gives an unfair advantage. Mock Leaving Certificate English paper scripts were collected…
Descriptors: Foreign Countries, Spelling, Grammar, Dyslexia
Peer reviewed Peer reviewed
Direct linkDirect link
Aryadoust, Vahid; Foo, Stacy; Ng, Li Ying – Language Testing, 2022
The aim of this study was to investigate how test methods affect listening test takers' performance and cognitive load. Test methods were defined and operationalized as while-listening performance (WLP) and post-listening performance (PLP) formats. To achieve the goal of the study, we examined test takers' (N = 80) brain activity patterns…
Descriptors: Listening Comprehension Tests, Language Tests, Eye Movements, Brain Hemisphere Functions
Peer reviewed Peer reviewed
Direct linkDirect link
Bakhtiar, Mehdi; Wong, Min Ney; Tsui, Emily Ka Yin; McNeil, Malcolm R. – Journal of Speech, Language, and Hearing Research, 2020
Purpose: This study reports the psychometric development of the Cantonese versions of the English Computerized Revised Token Test (CRTT) for persons with aphasia (PWAs) and healthy controls (HCs). Method: The English CRTT was translated into standard Chinese for the Reading--Word Fade version (CRTT-R-[subscript WF]-Cantonese) and into formal…
Descriptors: Psychometrics, Sino Tibetan Languages, Computer Assisted Testing, Aphasia
Peer reviewed Peer reviewed
Direct linkDirect link
Krell, Moritz; Mathesius, Sabrina; van Driel, Jan; Vergara, Claudia; Krüger, Dirk – International Journal of Science Education, 2020
Scientific reasoning competencies are relevant science competencies and therefore the development of assessment instruments for scientific reasoning competencies has become an integral part of science education research. However, some authors have questioned the validity of the instruments available so far, since their psychometric quality has not…
Descriptors: Preservice Teachers, Science Teachers, Science Instruction, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Spada, Nina; Shiu, Julie Li-Ju; Tomita, Yasuyo – Language Learning, 2015
This study builds on research investigating the construct validity of elicited imitation (EI) as a measure of implicit second language (L2) grammatical knowledge. It differs from previous studies in that the EI task focuses on a single grammatical feature and time on task is strictly controlled. Seventy-three EFL learners and 20 native English…
Descriptors: Construct Validity, Task Analysis, Native Speakers, Second Language Learning
Dikici, Ayhan; Soh, Kaycheng – Online Submission, 2015
Many measurement tools on creativity are available in the literature. One of these scales is Creativity Fostering Teacher Behaviour Index (CFTIndex) developed for Singaporean teacher originally. It was then translated into Turkish and trialled on teachers in Nigde province with acceptable reliability and factorial validity. The main purpose of…
Descriptors: Creativity, Teacher Behavior, Comparative Analysis, Turkish
Peer reviewed Peer reviewed
Direct linkDirect link
Polikoff, Morgan S. – Educational Assessment, 2016
As state tests of student achievement are used for an increasingly wide array of high- and low-stakes purposes, evaluating their instructional sensitivity is essential. This article uses data from the Bill and Melinda Gates Foundation's Measures of Effective Project to examine the instructional sensitivity of 4 states' mathematics and English…
Descriptors: High Stakes Tests, Achievement Tests, Mathematics Tests, English
Peer reviewed Peer reviewed
Direct linkDirect link
Bagiati, Aikaterini; Yoon, So Yoon; Evangelou, Demetra; Magana, Alejandra; Kaloustian, Garene; Zhu, Jiabin – International Journal of STEM Education, 2015
Background: The newly formed discipline of engineering education is addressing the need to (a) enhance STEM education for precollege students and (b) identify optimum ways to introduce engineering content starting, perhaps, from the early ages. Introducing engineering at the Prekindergarten through 12th grade (PreK-12) education level requires…
Descriptors: Elementary Secondary Education, Engineering Education, Faculty Development, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Tse, Chi-Shing; Altarriba, Jeanette – Psychological Record, 2012
English speakers use horizontal spatial metaphors (e.g., before/after) to talk about time relative to vertical spatial metaphors (e.g., up/down), so they may be faster in verifying temporal targets (e.g., June comes after April) that are preceded by primes that activate horizontal, relative to vertical, spatial metaphors. We examined this…
Descriptors: Figurative Language, Spatial Ability, Time, Comprehension
Previous Page | Next Page »
Pages: 1  |  2  |  3