NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International…3
What Works Clearinghouse Rating
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Drackert, Anastasia; Timukova, Anna – Language Testing, 2020
In view of the ubiquitous increase in the use of C-tests, which are almost unanimously believed to measure general language proficiency, this study investigates whether the aspects of language proficiency tapped into by the C-test format are the same when the test is taken by a learner population other than that of foreign language learners.…
Descriptors: Cloze Procedure, Language Tests, Russian, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Ludewig, Ulrich; Schwerter, Jakob; McElvany, Nele – Journal of Psychoeducational Assessment, 2023
A better understanding of how distractor features influence the plausibility of distractors is essential for an efficient multiple-choice (MC) item construction in educational assessment. The plausibility of distractors has a major influence on the psychometric characteristics of MC items. Our analysis utilizes the nominal categories model to…
Descriptors: Vocabulary, Language Tests, German, Grade 4
Peer reviewed Peer reviewed
Direct linkDirect link
Gombert, Sebastian; Di Mitri, Daniele; Karademir, Onur; Kubsch, Marcus; Kolbe, Hannah; Tautz, Simon; Grimm, Adrian; Bohm, Isabell; Neumann, Knut; Drachsler, Hendrik – Journal of Computer Assisted Learning, 2023
Background: Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task…
Descriptors: Coding, Energy, Scientific Concepts, Formative Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Plassmann, Sibylle; Zeidler, Beate – Language Learning in Higher Education, 2014
Language testing means taking decisions: about the test taker's results, but also about the test construct and the measures taken in order to ensure quality. This article takes the German test "telc Deutsch C1 Hochschule" as an example to illustrate this decision-making process in an academic context. The test is used for university…
Descriptors: Language Tests, Test Wiseness, Test Construction, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Haag, Nicole; Heppt, Birgit; Roppelt, Alexander; Stanat, Petra – European Journal of Psychology of Education, 2015
In large-scale assessment studies, language minority students typically obtain lower test scores in mathematics than native speakers. Although this performance difference was related to the linguistic complexity of test items in some studies, other studies did not find linguistically demanding math items to be disproportionally more difficult for…
Descriptors: Foreign Countries, Language Minorities, Native Speakers, Monolingualism
Peer reviewed Peer reviewed
Direct linkDirect link
Guo, Hongwen; Ercikan, Kadriye – Educational Research and Evaluation, 2020
Rapid response behaviour, a type of test disengagement, cannot be interpreted as a true indicator of the targeted constructs and may compromise score accuracy as well as score validity for interpretation. Rapid responding may be due to multiple factors for diverse populations. In this study, using Programme for International Student Assessment…
Descriptors: Response Style (Tests), Foreign Countries, International Assessment, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Zufferey, Sandrine; Gygax, Pascal M. – Discourse Processes: A multidisciplinary journal, 2016
Previous research has suggested that some discourse relations are easier to convey implicitly than others due to cognitive biases in the interpretation of discourse. In this article we argue that relations involving a perspective shift, such as confirmation relations, are difficult to convey implicitly. We assess this claim with two empirical…
Descriptors: Role, Perspective Taking, Discourse Analysis, French
Peer reviewed Peer reviewed
Direct linkDirect link
Zehner, Fabian; Goldhammer, Frank; Lubaway, Emily; Sälzer, Christine – Education Inquiry, 2019
In 2015, the "Programme for International Student Assessment" (PISA) introduced multiple changes in its study design, the most extensive being the transition from paper- to computer-based assessment. We investigated the differences between German students' text responses to eight reading items from the paper-based study in 2012 to text…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Eckes, Thomas – Language Testing, 2014
Testlets are subsets of test items that are based on the same stimulus and are administered together. Tests that contain testlets are in widespread use in language testing, but they also share a fundamental problem: Items within a testlet are locally dependent with possibly adverse consequences for test score interpretation and use. Building on…
Descriptors: Test Items, Language Tests, Listening Comprehension Tests, German
Peer reviewed Peer reviewed
Direct linkDirect link
Ring, Malte; Brahm, Taiga; Randler, Christoph – International Journal of Science Education, 2019
In science, graphs are used to visualise data, relationships and scientific principles. Basic graph reading operations, i.e. reading single data points, recognising trends as well as conducting small extrapolations from the data are important skills for pupils and lay the groundwork for a comprehensive understanding of data visualisations across…
Descriptors: Graphs, Difficulty Level, Data Interpretation, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Schindler, Julia; Richter, Tobias; Isberner, Maj-Britt; Naumann, Johannes; Neeb, Yvonne – Language Assessment Quarterly, 2018
Reading comprehension is based on the efficient accomplishment of several cognitive processes at the word, sentence, and text level. To the extent that each of these processes contributes to reading comprehension, it can cause reading difficulties if it is deficient. To identify individual sources of reading difficulties, tools are required that…
Descriptors: Construct Validity, Language Tests, Grammar, Task Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Tiffin-Richards, Simon P.; Pant, Hans Anand; Koller, Olaf – Educational Measurement: Issues and Practice, 2013
Cut-scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard-setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item…
Descriptors: Foreign Countries, English (Second Language), Language Tests, Standard Setting (Scoring)
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Yi-Hsuan; Haberman, Shelby J. – International Journal of Testing, 2016
The use of computer-based assessments makes the collection of detailed data that capture examinees' progress in the tests and time spent on individual actions possible. This article presents a study using process and timing data to aid understanding of an international language assessment and the examinees. Issues regarding test-taking strategies,…
Descriptors: Computer Assisted Testing, Test Wiseness, Language Tests, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank – Educational and Psychological Measurement, 2016
Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme…
Descriptors: Educational Assessment, Coding, Automation, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
Hartig, Johannes; Frey, Andreas; Nold, Gunter; Klieme, Eckhard – Educational and Psychological Measurement, 2012
The article compares three different methods to estimate effects of task characteristics and to use these estimates for model-based proficiency scaling: prediction of item difficulties from the Rasch model, the linear logistic test model (LLTM), and an LLTM including random item effects (LLTM+e). The methods are applied to empirical data from a…
Descriptors: Item Response Theory, Models, Methods, Computation
Previous Page | Next Page »
Pages: 1  |  2