Publication Date
In 2025 | 11 |
Since 2024 | 23 |
Since 2021 (last 5 years) | 58 |
Since 2016 (last 10 years) | 94 |
Since 2006 (last 20 years) | 94 |
Descriptor
Test Reliability | 94 |
Foreign Countries | 77 |
Test Validity | 71 |
Test Construction | 49 |
Factor Analysis | 33 |
Measures (Individuals) | 25 |
Psychometrics | 18 |
Test Items | 17 |
Turkish | 17 |
Factor Structure | 15 |
Translation | 15 |
More ▼ |
Source
International Journal of… | 94 |
Author
Acikgul, Kubra | 2 |
Alper Uslukaya | 2 |
Dogan, Nuri | 2 |
Duru, Erdinc | 2 |
Müslim Alanoglu | 2 |
Ozer Ozkan, Yesim | 2 |
Songül Karabatak | 2 |
Yakar, Levent | 2 |
Acar Guvendir, Meltem | 1 |
Aciksoz, Arif | 1 |
Adams, Betty A. J. | 1 |
More ▼ |
Publication Type
Journal Articles | 94 |
Reports - Research | 90 |
Tests/Questionnaires | 25 |
Information Analyses | 3 |
Reports - Descriptive | 1 |
Education Level
Audience
Location
Turkey | 68 |
Turkey (Ankara) | 4 |
Turkey (Istanbul) | 3 |
Cyprus | 1 |
Ghana | 1 |
Nigeria | 1 |
Philippines | 1 |
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025
This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…
Descriptors: Artificial Intelligence, Test Items, Automation, Test Format
Yasemin Duygu Esen; Filiz Isnaç; H. Deniz Gülleroglu – International Journal of Assessment Tools in Education, 2025
This study aims to determine the reliability of the Beck Depression Inventory (BDI), a widely used tool for diagnosing depression--a condition that significantly impacts individuals' lives--through meta-analysis, an advanced statistical technique. To achieve this objective, studies conducted in Turkey between 1961 and 2021 that utilized the Beck…
Descriptors: Foreign Countries, Depression (Psychology), Measures (Individuals), Test Reliability
Murat Ermis; Safak Uluçinar Sagir – International Journal of Assessment Tools in Education, 2025
In this study, an attempt was made to develop a valid and reliable measurement tool to determine teachers' self-efficacy levels for teaching metacognitive listening strategies. The study group consisted of 205 teachers for EFA and 248 teachers for CFA. As a result of the analyzes, a scale consisting of 16 items with 4 factors was developed. It was…
Descriptors: Test Validity, Test Reliability, Metacognition, Listening Skills
Mehmet Emin Ören; Servet Atik – International Journal of Assessment Tools in Education, 2025
In this study, it was aimed to adapt the DigiFuehr 2.0 Scale developed by Claassen et al. (2023) to Turkish and to conduct validity and reliability studies on three groups of participants consisting of teachers. In the study, exploratory and confirmatory factor analyses were performed in line with translation study, linguistic application, and…
Descriptors: Test Reliability, Test Validity, Test Construction, Translation
Hongwei Yang; Müslim Alanoglu; Songül Karabatak; Kelly D. Bradley – International Journal of Assessment Tools in Education, 2025
The study took a Rasch measurement theory approach to validating the 10-item Digital Literacy Scale (DLS) using the unidimensional rating scale model (RSM). To that end, the study used the data from a sample of online Turkish university students. The study began the Rasch analysis with all 10 items in the scale and, to improve in the local…
Descriptors: Digital Literacy, Measures (Individuals), Test Validity, Foreign Countries
Cemile Keski; Ilkay Dogan Tas – International Journal of Assessment Tools in Education, 2025
The purpose of this study was to create a measurement instrument that would be both valid and reliable for assessing middle school branch teachers' perceptions of curriculum leadership. A straightforward random sample technique was used to choose the participants. 343 middle school branch teachers made up the study's sample. The researchers…
Descriptors: Test Construction, Test Validity, Test Reliability, Measures (Individuals)
Tugba Ögücü; Aytaç Gögüs – International Journal of Assessment Tools in Education, 2025
School principals should have school technology leadership skills in order to manage technology integration into teaching and learning activities effectively and efficiently. The 'School Technology Leadership Scale' was developed by Grace in 2020 to assess principals' technology leadership skills from the perspective of the teachers. The scale was…
Descriptors: Test Validity, Test Reliability, Measures (Individuals), Educational Technology
Eren Can Aybek; Serkan Arikan; Günes Ertas – International Journal of Assessment Tools in Education, 2024
When it is required to estimate item parameters of a large item bank, Multiple Matrix Sampling (MMS) design provides an efficient way while minimizing the test burden on students. The current study exemplifies how to calibrate a large item pool using MMS design for various purposes, such as developing a CAT administration. The purpose of the…
Descriptors: Elementary School Mathematics, Elementary School Students, Grade 4, Item Banks
Alper Uslukaya; Yilmaz Arslan; Füsun Yükrük; Oktay Ag; Mehmet Akif Evcimik – International Journal of Assessment Tools in Education, 2025
The aim of this study was to develop a measurement tool to determine teachers' perceived emotional job demands. The item pool created as a result of a literature review conducted by researchers was subjected to expert evaluation for content, appearance, and meaning validity, and finally, a draft scale form was created. The draft form was applied…
Descriptors: Test Construction, Test Validity, Test Reliability, Measures (Individuals)
Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024
This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…
Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)
Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025
Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…
Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques
Kogar, Hakan – International Journal of Assessment Tools in Education, 2022
The purpose of this study is to identify which scale short-form development method produces better findings in different factor structures. A simulation study was designed based on this purpose. Three different factor structures and three simulation conditions were selected. As the findings of this simulation study, the model-data fit and…
Descriptors: Test Construction, Measures (Individuals), Factor Structure, Test Reliability
Ozalp, Ugur; Cetin, Munevver – International Journal of Assessment Tools in Education, 2022
The aim of this study was to develop a scale instrument for measuring academic intellectual capital in the Turkish higher education context depending on student perceptions. The sample consisted of students of higher education institutions in the 2020-2021 academic year. Data were gathered in two stages. Exploratory Factor Analysis (EFA) was…
Descriptors: Measures (Individuals), College Students, Test Validity, Test Reliability
Aybek, Eren Can; Toraman, Cetin – International Journal of Assessment Tools in Education, 2022
The current study investigates the optimum number of response categories for the Likert type of scales under the item response theory (IRT). The data was collected from university students attend to mainly the faculty of medicine and the faculty of education. A form of the "Social Gender Equity Scale" developed by Gozutok et al. (2017)…
Descriptors: Likert Scales, Item Response Theory, College Students, Test Reliability
Acar Guvendir, Meltem; Ozer Ozkan, Yesim – International Journal of Assessment Tools in Education, 2022
The aim of this study is to examine how the practice of different item removal strategies during exploratory factor analysis (EFA) phase of scale development change the number of factors, factor loadings, explained variance ratio, and reliability values (a and ?) explained. In the study, data obtained from 379 university students were used for the…
Descriptors: Test Items, Factor Analysis, Factor Structure, Test Construction