Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 26 |
| Since 2017 (last 10 years) | 61 |
| Since 2007 (last 20 years) | 83 |
Descriptor
| Test Items | 122 |
| Test Reliability | 106 |
| Test Construction | 80 |
| Test Validity | 74 |
| Foreign Countries | 43 |
| Factor Analysis | 29 |
| Item Analysis | 28 |
| Difficulty Level | 24 |
| Correlation | 23 |
| Psychometrics | 22 |
| College Students | 17 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| Turkey | 17 |
| Florida | 4 |
| China | 3 |
| California | 2 |
| Canada | 2 |
| District of Columbia | 2 |
| Georgia | 2 |
| Illinois | 2 |
| India | 2 |
| Iran | 2 |
| Netherlands | 2 |
| More ▼ | |
Laws, Policies, & Programs
| United Nations Convention on… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Abdullah Faruk Kiliç; Meltem Acar Güvendir; Gül Güler; Tugay Kaçak – Measurement: Interdisciplinary Research and Perspectives, 2025
In this study, the extent to wording effects impact structure and factor loadings, internal consistency and measurement invariance was outlined. The modified form, which includes items that semantically reversed, explains %21.5 more variance than the original form. Also, reversed items' factor loadings are higher. As a result of CFA, indexes…
Descriptors: Test Items, Factor Structure, Test Reliability, Semantics
Eyüp Yurt – International Journal of Education in Mathematics, Science and Technology, 2025
This study aimed to develop and validate the Creative Problem-Solving Skills Test (CPSS-T), grounded in Torrance's creativity theory, to assess these skills in university students. The CPSS-T consists of five open-ended question types, each designed to measure different aspects of creative problem-solving: Alternative Use, Hypothetical Scenario,…
Descriptors: Creativity Tests, Creativity, Creative Thinking, Problem Solving
Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021
This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…
Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water
Hsiao, Kuo-Lun; Ku, Ya-Yuan; Lee, Ya-Ting – Education and Information Technologies, 2023
New media literacy is an expected competency for university students. However, few literacy scales can evaluate students' fake news reporting and checking abilities. In the past, the new media literacy framework only included Critical Consuming, Critical Prosumption, Functional Prosumption, and Functional Consuming. Therefore, this study proposes…
Descriptors: Test Construction, Media Literacy, Test Validity, Test Items
José Ventura-León; Cristopher Lino-Cruz; Shirley Tocto-Muñoz; Andy Rick Sánchez-Villena – Journal of Psychoeducational Assessment, 2025
Academic and occupational success requires social intelligence, the ability to comprehend, and manage interpersonal connections. This research aims to assess and improve the Tromsø Social Intelligence Scale (TSIS) for Peruvian university students, focusing on cultural adaptability, reliability, and validity. Participants included 973 university…
Descriptors: Factor Analysis, Intelligence Tests, Test Items, Test Length
Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023
Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…
Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability
Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024
Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…
Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy
Qilong Zhang; Weiying Wu; Ke Jiang – European Journal of Teacher Education, 2024
Teacher professional standards are a mechanism to safeguard quality teaching. In the context of Chinese early childhood education (ECE), this study developed a scale for self-assessing teacher competence against professional standards. The study adopted a three-phase design. In Phase 1, in accordance with Professional Standards for Kindergarten…
Descriptors: Standards, Self Evaluation (Individuals), Rating Scales, Preschool Teachers
Hande, Vasudha; Jayan, Parvathy; Kishore, M. Thomas; Bhaskarapillai, Binukumar; Kommu, John Vijay Sagar – Journal of Intellectual Disabilities, 2023
Identifying the determinants of positive coping is a critical step in empowering the parents of children with intellectual disability. In this context, this study aims to develop a scale to assess the determinants of positive coping. Accordingly, culturally relevant items were pooled, got validated by experts and refined. The scale was…
Descriptors: Parents, Coping, Intellectual Disability, Children
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Daniel A. DeCino; Steven R. Chesnut; Phillip L. Waalkes; Reed N. Keen – Measurement and Evaluation in Counseling and Development, 2025
Objective: The purpose of this study was to develop and validate the Counselor Self-Reflection Inventory (CSRI) from a Transformative Learning Theory framework for counselors, and counselors-in-training to use in clinical and training settings. Method: A sample of 351, mostly female (86.89%), white (85.19%), counselors with MS or MA (88.08%)…
Descriptors: Test Construction, Test Validity, Test Reliability, Attitude Measures
Do-Hong Kim; Chuang Wang; Thi Nhu Ngoc Truong – Language Teaching Research, 2024
Researchers and practitioners in the field of second language acquisition have come to realize the importance of non-cognitive skills such as self-efficacy and self-regulation in students' learning of a second language. However, there has been limited systematic research on such measures in the second language context and the validity and…
Descriptors: Psychometrics, Test Content, Self Efficacy, English Language Learners
Vucaj, Indrit – Journal of Research on Technology in Education, 2022
This study presents the methodological and procedural development process of the Digital Age Teaching Scale (DATS), a summative assessment tool designed to measure application of the ISTE Standards for Educators in K-12 classrooms. The theoretical framework of the ISTE Standards for Educators informed the development of DATS, and an 8-step process…
Descriptors: Elementary Secondary Education, Standards, Test Construction, Test Items
Hidayati Maghfiroh; Siti Zubaidah; Susriyati Mahanal; Hendra Susanto; Chun-Yen Chang – Journal of Baltic Science Education, 2025
So far, instruments to measure genetic literacy that encompass global genetic issues are limited. In addition, instruments that assess practical knowledge and comprehensively evaluate genetic literacy skills have yet to be developed. Therefore, this research aimed to develop, validate, and improve an instrument based on a new conceptual framework…
Descriptors: Genetics, Scientific Concepts, Science Tests, Thinking Skills
Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024
Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…
Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning

Peer reviewed
Direct link
