Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 27 |
Since 2006 (last 20 years) | 35 |
Descriptor
Achievement Tests | 47 |
Correlation | 47 |
Test Items | 47 |
Foreign Countries | 26 |
Scores | 16 |
International Assessment | 15 |
Comparative Analysis | 13 |
Item Analysis | 13 |
Secondary School Students | 11 |
Statistical Analysis | 11 |
Difficulty Level | 10 |
More ▼ |
Source
Author
Pools, Elodie | 2 |
Soland, James | 2 |
Verhoeven, Ludo | 2 |
Ahmed, Tamim | 1 |
Ahmet Yildirim | 1 |
Akar, Cüneyt | 1 |
Aktas, Elif | 1 |
An, Qi | 1 |
Angeles, Victor R. | 1 |
Bezruczko, Nikolaus | 1 |
Blömeke, Sigrid | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Location
Turkey | 5 |
Germany | 2 |
Netherlands | 2 |
United States | 2 |
Africa | 1 |
Asia | 1 |
Botswana | 1 |
California | 1 |
Canada | 1 |
Chile | 1 |
China | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Ahmet Yildirim; Nizamettin Koç – International Journal of Assessment Tools in Education, 2024
The present research aims to examine whether the questions in the Program for the International Student Assessment (PISA) 2009 reading literacy instrument display differential item functioning (DIF) among the Turkish, French, and American samples based on univariate and multivariate matching techniques before and after the total score, which is…
Descriptors: Test Items, Item Analysis, Correlation, Error of Measurement
Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025
Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…
Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
Pools, Elodie – Applied Measurement in Education, 2022
Many low-stakes assessments, such as international large-scale surveys, are administered during time-limited testing sessions and some test-takers are not able to endorse the last items of the test, resulting in not-reached (NR) items. However, because the test has no consequence for the respondents, these NR items can also stem from quitting the…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Rios, Joseph A.; Soland, James – International Journal of Testing, 2022
The objective of the present study was to investigate item-, examinee-, and country-level correlates of rapid guessing (RG) in the context of the 2018 PISA science assessment. Analyzing data from 267,148 examinees across 71 countries showed that over 50% of examinees engaged in RG on an average proportion of one in 10 items. Descriptive…
Descriptors: Foreign Countries, International Assessment, Achievement Tests, Secondary School Students
Stephanie B. Moore – ProQuest LLC, 2024
This three-manuscript dissertation attempts to answer the question: "How does students' English language proficiency (ELP) inform the availability, structure, and use of English language accommodations and intervention to support the academic achievement of English learner (EL) students?" The question is addressed using three independent…
Descriptors: English Language Learners, Language Proficiency, English (Second Language), Second Language Learning
Liu, Yimeng; Wang, Jian – International Journal of Science Education, 2022
The relationship between inquiry-based learning and science self-efficacy was analysed using data from 57 countries and economics participating in the 2015 Programme for International Student Assessment (PISA). This analysis generated a mediating--moderating model, which involved the mediating role of science interest and the moderating role of…
Descriptors: International Assessment, Achievement Tests, Foreign Countries, Secondary School Students
Pools, Elodie; Monseur, Christian – Large-scale Assessments in Education, 2021
Background: The idea of using low-stakes assessment results is often mentioned when designing educational system reforms. However, when tests have no consequences for the students, test takers may not make enough effort when completing the test, and their lack of engagement may negatively affect the validity of the conclusions of the studies that…
Descriptors: Science Tests, Test Validity, Student Motivation, Learner Engagement
Gao, Qiufeng; Wang, Huan; Chang, Fang; An, Qi; Yi, Hongmei; Kenny, Kaleigh; Shi, Yaojiang – Compare: A Journal of Comparative and International Education, 2022
This article reports on research conducted to investigate student confidence in reading by collecting data from 135 primary schools in rural China. In the survey, we adopted the PIRLS scales of confidence in reading and reading skills test items. Our analysis shows that compared to the other countries and regions, rural China ranks last with…
Descriptors: Self Esteem, Foreign Countries, Correlation, Rural Areas
Jamalzadeh, Mehri; Lotfi, Ahmad Reza; Rostami, Masoud – Language Testing in Asia, 2021
The current study sought to examine the validity of a General English Achievement Test (GEAT), administered to university students in the fall semester of 2018-2019 academic year, by hybridizing differential information (DIF) and differential distractor function (DDF) analytical models. Using a purposive sampling method, from the target population…
Descriptors: Language Tests, Achievement Tests, Undergraduate Students, Islam
Johansson, Stefan – Phi Delta Kappan, 2018
Responding to an earlier "Phi Delta Kappan" article, the author rejects the argument that East Asian students' high scores on international educational assessments come at the expense of learning to be creative and entrepreneurial. According to survey research, people in Japan, Korea, and other East Asian nations perceive themselves to…
Descriptors: International Assessment, High Achievement, Scores, Creativity
Wang, Yan; Kim, Eun Sook; Dedrick, Robert F.; Ferron, John M.; Tan, Tony – Educational and Psychological Measurement, 2018
Wording effects associated with positively and negatively worded items have been found in many scales. Such effects may threaten construct validity and introduce systematic bias in the interpretation of results. A variety of models have been applied to address wording effects, such as the correlated uniqueness model and the correlated traits and…
Descriptors: Test Items, Test Format, Correlation, Construct Validity
Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019
Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…
Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models
Löwenadler, John – Language Testing, 2019
This study aims to investigate patterns of variation in the interplay of L2 language ability and general reading comprehension skills in L2 reading, by comparing item-level effects of test-takers' results on L1 and L2 reading comprehension tests. The material comes from more than 500,000 people tested on L1 (Swedish) and L2 (English) in the…
Descriptors: Swedish, English (Second Language), Second Language Learning, Second Language Instruction
Büyükturan, Esin Bagcan; Sireci, Ayse – Journal of Education and Training Studies, 2018
Item discrimination index, which indicates the ability of the item to distinguish whether or not the individuals have acquired the qualities that are evaluated, is basically a validity measure and it is estimated by examining the fit between item score and the test score. Based on the definition of item discrimination index, classroom observation…
Descriptors: Foreign Countries, Classroom Observation Techniques, Scores, Test Items