Publication Date
| In 2026 | 1 |
| Since 2025 | 382 |
| Since 2022 (last 5 years) | 1506 |
| Since 2017 (last 10 years) | 3407 |
| Since 2007 (last 20 years) | 5336 |
Descriptor
| Test Validity | 10272 |
| Test Reliability | 10004 |
| Test Construction | 3400 |
| Foreign Countries | 2994 |
| Psychometrics | 1873 |
| Factor Analysis | 1706 |
| Measures (Individuals) | 1377 |
| Evaluation Methods | 992 |
| Questionnaires | 948 |
| College Students | 887 |
| Factor Structure | 861 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 307 |
| Practitioners | 229 |
| Teachers | 84 |
| Administrators | 61 |
| Policymakers | 27 |
| Counselors | 26 |
| Students | 13 |
| Parents | 9 |
| Community | 5 |
| Support Staff | 5 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 702 |
| China | 178 |
| Australia | 175 |
| Canada | 153 |
| Indonesia | 125 |
| Spain | 107 |
| Taiwan | 91 |
| United States | 91 |
| Germany | 90 |
| United Kingdom | 86 |
| Malaysia | 77 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Pereira, Teresa; Freire, Teresa; Tavares, Dionísia – European Journal of Developmental Psychology, 2023
Promoting positive development may lead to young people's active contributions to their environment through positive attitudes and behaviours. The Climate Change Attitude Survey (15-item version) aims to identify climate change attitudes differences in groups of students and to assess pre- to post-intervention attitude changes. We intended to…
Descriptors: Foreign Countries, Test Validity, Climate, Change
Morley, Alicen; Nissen, Jayson M.; Van Dusen, Ben – Physical Review Physics Education Research, 2023
Instructors and researchers often use research-based assessments to identify the impact of instructional activities. These investigations often focus on issues of diversity, equity, and inclusions by comparing outcomes across social identity groups (e.g., gender, race, and class). Comparisons across groups assume the assessments measure the same…
Descriptors: Error of Measurement, Racial Differences, Gender Differences, Test Validity
Oliveira, João Tiago; Faustino, Divo; Freitas, Fátima; Gonçalves, Miguel M.; Ribeiro, Eugénia; Gonçalves, Sónia; Machado, Paulo P. P. – British Journal of Guidance & Counselling, 2023
Worry is a phenomenon that is present in multiple psychopathologies. Given the widely accepted transdiagnostic role that worry plays in psychopathology, reliable measures for this construct are pivotal for clinical practice. The Penn State Worry Questionnaire (PSWQ) is one of the most widely used and established measures of worry in both clinical…
Descriptors: Anxiety, Questionnaires, Emotional Disturbances, Test Validity
Yavuz, Mehmet; Kayali, Bünyami; Balat, Sener; Çalisan, Mücahit – Journal of Educational Technology, 2023
The aim of this study is to adapt the 15-item Chatbot Usability Scale to the Turkish language and culture and evaluate the validity and reliability of the scale in the Turkish language and culture after the adaptation process. The necessary permissions were obtained, and the process was initiated. Proficient translators in both cultures were…
Descriptors: Artificial Intelligence, Computer Mediated Communication, Usability, Evaluation Methods
Zeng, Yating; Chi, Shaohui; Wang, Zuhao; Zhuang, Xiaosong – Journal of Baltic Science Education, 2023
Online metacognitive skills are the real-time awareness of cognition, which can effectively promote science learning and improve performance in solving scientific problems. Therefore, it is important to enhance and diagnose students' online metacognitive skills in science education. This study aimed to evaluate ninth-grade students' online…
Descriptors: Test Construction, Test Validity, Grade 9, Metacognition
Tsuda, Emi; Ward, Phillip; Atkinson, Obidiah J.; He, Yaohui; Sazama, Deb – International Journal of Kinesiology in Higher Education, 2023
Common content knowledge (CCK) is fundamental for quality instruction and critical to be acquired among physical education preservice teachers (PSTs). The purpose of this study was to develop a soccer CCK test for PSTs and to examine the validity and reliability of the test using Rasch measurement modeling. Two content experts developed a test in…
Descriptors: Physical Education Teachers, Preservice Teachers, Test Construction, Test Validity
Tsuda, Emi; Ward, Phillip; Ressler, James D.; Wyant, James; He, Yaohui; Kim, Insook; Santiago, José A. – International Journal of Kinesiology in Higher Education, 2023
The aim of this study was to develop a valid and reliable basketball common content knowledge (CCK) instrument for preservice teachers (PSTs; BB-CCK-T) in secondary physical education contexts in the United States. The research team used four steps to develop the BB-CCK-T. In the first step, two content experts determined the scope and weight of…
Descriptors: Test Construction, Test Validity, Test Reliability, Team Sports
Ambiel, Rodolfo A. M.; Moreira, Thaline da Cunha; Barros, Leonardo de Oliveira; Martins, Gustavo Henrique; Salvador, Ana Paula; Wille, Bart – International Journal for Educational and Vocational Guidance, 2023
This paper documents the translation and adaptation of the "Career Adapt-Abilities Scale + Cooperation Scale" (CAAS + C; Savickas & Porfeli, 2015) to the Brazilian context and provides initial validity evidence for this instrument by testing its internal structure and exploring its relationships with external variables (i.e., Big…
Descriptors: Foreign Countries, Measures (Individuals), Vocational Adjustment, Factor Structure
Ben Clarke; Marah Sutherland; Christian T. Doabler; Taylor Lesner; David Fainstein; Kelsey Nolan; Britt Landis; Derek Kosty – School Psychology Review, 2023
This study investigated the technical characteristics of four early measurement curriculum-based measures (EM-CBMs) designed to assess concepts related to linear measurement and iteration. The sample consisted of 221 first grade students. Data were collected at two time points approximately 10 weeks apart. Reliability and concurrent and predictive…
Descriptors: Test Construction, Screening Tests, Curriculum Based Assessment, Grade 1
Murat Aygün; Sait Çüm – International Journal of Assessment Tools in Education, 2023
Consuming sports products and services incessantly without being able to restrain oneself is characterized as compulsive sports consumption. The aim of this study is to adapt the Compulsive Sport Consumption Scale (CSCS) developed in English by Aiken et al. (2018) into Turkish utilizing a scientific scale adaptation process. The CSCS consists of…
Descriptors: Foreign Countries, Turkish, Translation, Test Construction
Zyxcban G. Wolfs; Saskia Brand-Gruwel; Henny P. A. Boshuizen – SAGE Open, 2023
The objective of this study was to develop and validate an instrument measuring the perception and interpretation of several distinct musical features (pitch, tonality, timing, loudness, and timbre). Therefore, we developed the Implicit Tonal Ability Test (ITAT), a listening test containing 49 multiple-choice items. A total of 233 children aged 6…
Descriptors: Elementary School Students, Test Validity, Test Reliability, Age Differences
D. Steger; S. Weiss; O. Wilhelm – Creativity Research Journal, 2023
Creativity can be measured with a variety of methods including self-reports, others reports, and ability tests. While typical self-reports are best understood as weak proxies of creativity, biographical reports that assess previous creative activities seem more promising. Drawbacks of such measures -- including skewed item distributions, a lack of…
Descriptors: Creativity, Creativity Tests, Test Construction, Algorithms
María Pilar Aparicio-Flores; Rosa Pilar Esteve-Faubel; Aitana Fernández-Sogorb; Carolina Gonzálvez – Education and Information Technologies, 2025
The use of Information and Communication Technologies (ICT) has been increasing in education. Despite its benefits, not everyone perceives its use with the same ease. This raises the need to observe the perceived ease of use (PEOU) of ICT among future teachers, which requires a valid and reliable instrument to measure this variable for the Spanish…
Descriptors: Spanish, Test Validity, Measures (Individuals), Usability
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction

Peer reviewed
Direct link
