Publication Date
| In 2026 | 8 |
| Since 2025 | 2276 |
| Since 2022 (last 5 years) | 12791 |
| Since 2017 (last 10 years) | 33916 |
| Since 2007 (last 20 years) | 68407 |
Descriptor
| Foreign Countries | 30560 |
| Test Validity | 21743 |
| Scores | 18256 |
| Academic Achievement | 16928 |
| Test Construction | 16756 |
| Test Reliability | 15028 |
| Achievement Tests | 14859 |
| Standardized Tests | 14720 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13042 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3393 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 978 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2822 |
| Australia | 2426 |
| Canada | 2270 |
| California | 1854 |
| United States | 1726 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1122 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Shanshan He; Anne-Marie Sénécal; Laura Stansfield; Ruslan Suvorov – Language Testing, 2025
Test preparation has garnered considerable attention in second language (L2) education due to the significant implications that successful performance on a language test may have for academic advancement, future career opportunities, and immigration prospects. Meanwhile, an overemphasis on test preparation has been criticized for encouraging the…
Descriptors: Literature Reviews, Second Language Learning, Language Tests, Study Habits
Atakan Yalcin; Cennet Sanli; Adnan Pinar – Journal of Theoretical Educational Science, 2025
This study aimed to develop a test to measure university students' spatial thinking skills. The research was conducted using a survey design, with a sample of 260 undergraduate students from geography teaching and geography departments. GIS software was used to incorporate maps and satellite images, enhancing the spatial representation in the…
Descriptors: Spatial Ability, Thinking Skills, Geography, Undergraduate Students
Sukru Murat Cebeci; Selcuk Acar – Journal of Creative Behavior, 2025
This study presents the Cebeci Test of Creativity (CTC), a novel computerized assessment tool designed to address the limitations of traditional open-ended paper-and-pencil creativity tests. The CTC is designed to overcome the challenges associated with the administration and manual scoring of traditional paper and pencil creativity tests. In this…
Descriptors: Creativity, Creativity Tests, Test Construction, Test Validity
Kimin Chung; Soohwan Kim; Yeonju Jang; Seongyune Choi; Hyeoncheol Kim – Education and Information Technologies, 2025
As artificial intelligence(AI) is utilised throughout society, the need to improve AI literacy as an essential competency, not only for specific experts but also for general citizens, is increasing. Therefore, several studies are being conducted on AI education, and attempts are being made to introduce it into the regular education curriculum.…
Descriptors: Artificial Intelligence, Technological Literacy, Diagnostic Tests, Elementary School Students
Mümüne Merve Parlak; Özlem Bizpinar Munis; Aysen Köse; Cansu Yildirim; Cemil Arcan Ülker – International Journal of Language & Communication Disorders, 2025
Background: Addenbrooke's Cognitive Examination III (ACE-III) was developed as a screening tool for cognitive disorders. Many countries have proven the cultural adaptation, reliability and validity of ACE-III. Aims: To make cultural adaptations of ACE-III for the Turkish population and to examine its validity and reliability. Methods &…
Descriptors: Foreign Countries, Cognitive Tests, Translation, Turkish
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Raudlah Melinda Sidik; Ana Ratna Wulan; K. Kusnadi – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
The research developed and validated EKSAI (Epistemic Knowledge Science Assessment Instrument), an assessment tool for epistemic knowledge in science education. The background is that 21st-century challenges demand a transformation in science education, with a focus on understanding how scientific knowledge is developed and evaluated, which is…
Descriptors: Science Tests, Knowledge Level, Biology, Test Validity
Yuriko K. Sosa Paredes; Björn Andersson – Educational Assessment, Evaluation and Accountability, 2025
In international large-scale assessments, student performance comparisons across educational systems are frequently done to assess the state and development in different domains. These results often have a large impact on educational policy and on the perceptions of an educational system's performance. Early assessments, such as the First and…
Descriptors: Test Interpretation, International Assessment, Science Tests, Scores
Shangchao Min; Kyoungwon Bishop – Language Testing, 2024
This paper evaluates the multistage adaptive test (MST) design of a large-scale academic language assessment (ACCESS) for Grades 1-12, with an aim to simplify the current MST design, using both operational and simulated test data. Study 1 explored the operational population data (1,456,287 test-takers) of the listening and reading tests of MST…
Descriptors: Adaptive Testing, Test Construction, Language Tests, English Language Learners
Alaa Eldin A. Ayoub; Muneera R. Ghablan; Eid G. Abo Hamza; Ahmed M. Abdulla Alabbasi – European Journal of STEM Education, 2025
This study describes the development of the science, technology, engineering, and mathematics (STEM) Scale, intended to assess parental attitudes toward school programs designed to deliver STEM, and evaluates its psychometric properties. The study group included 400 parents of students (138 males and 262 females) enrolled in STEM programs…
Descriptors: STEM Education, Test Construction, Parent Attitudes, Psychometrics
Jacqueline Raymond; David Wei Dai; Sue McAllister – Advances in Health Sciences Education, 2025
There is increasing interest in health professions education (HPE) in applying argument-based validity approaches, such as Kane's, to assessment design. The critical first step in employing Kane's approach is to specify the interpretation-use argument (IUA). However, in the HPE literature, this step is often poorly articulated. This article…
Descriptors: Allied Health Occupations Education, Test Interpretation, Test Construction, Inferences
Helen Zhang; Anthony Perry; Irene Lee – International Journal of Artificial Intelligence in Education, 2025
The rapid expansion of Artificial Intelligence (AI) in our society makes it urgent and necessary to develop young students' AI literacy so that they can become informed citizens and critical consumers of AI technology. Over the past decade many efforts have focused on developing curricular materials that make AI concepts accessible and engaging to…
Descriptors: Test Construction, Test Validity, Measures (Individuals), Artificial Intelligence
Hung-Yu Huang – Educational and Psychological Measurement, 2025
The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…
Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability
Jun-ichiro Yasuda; Michael M. Hull; Naohiro Mae; Kentaro Kojima – Physical Review Physics Education Research, 2025
Although conceptual assessment tests are commonly administered at the beginning and end of a semester, this pre-post approach has inherent limitations. Specifically, education researchers and instructors have limited ability to observe the progression of students' conceptual understanding throughout the course. Furthermore, instructors are limited…
Descriptors: Computer Assisted Testing, Adaptive Testing, Science Tests, Scientific Concepts
Filiz Arzu Yalin; Ahmet Özbay; Safak Oguz – European Journal of Education, 2025
This study developed and validated a Decision-Making Skill Test (DMST) for Turkish adolescents to address the lack of culturally appropriate assessment tools for multi-criteria decision-making skills. A cross-sectional design was employed with 427 participants aged 11-17 years from diverse socioeconomic backgrounds across Turkey. Following…
Descriptors: Test Construction, Test Validity, Student Evaluation, Decision Making

Peer reviewed
Direct link
