Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Susan Ramlo; Carrie Salmon; Yuan Xue – Journal of College Science Teaching, 2025
Research shows that there are multiple benefits to giving college students oral rather than written exams. However, studies that examine, describe, and differentiate how students view their oral exams were never found in a literature search. The purpose of this study was to use Q methodology [Q] to describe the divergent student views about taking…
Descriptors: Undergraduate Students, Science Instruction, Chemistry, Organic Chemistry
Sarah N. Shakir; Ashley M. Virabouth; Mallory M. Rice – American Biology Teacher, 2025
Exam anxiety has been well-documented to reduce student performance in undergraduate biology courses, especially for students from marginalized groups, which can contribute to achievement gaps. Our exploratory study surveyed 61 undergraduate biology students to better understand how exams affect their anxiety levels, focusing on the impact of exam…
Descriptors: Undergraduate Students, College Science, Biology, Student Attitudes
David A. Klingbeil; Alexander D. Latham; Jessica S. Kim; Madeline C. Schmitt – Psychology in the Schools, 2025
Several researchers have called for schools to interpret universal screening results using posterior probabilities. Following this recommendation could require schools to move away from direct-route, single-measure screening unless base rates of risk fall within a narrow range. In this descriptive study, we investigated two questions surrounding…
Descriptors: Reading Skills, Mathematics Skills, Screening Tests, Test Results
Murat Ermis; Safak Uluçinar Sagir – International Journal of Assessment Tools in Education, 2025
In this study, an attempt was made to develop a valid and reliable measurement tool to determine teachers' self-efficacy levels for teaching metacognitive listening strategies. The study group consisted of 205 teachers for EFA and 248 teachers for CFA. As a result of the analyzes, a scale consisting of 16 items with 4 factors was developed. It was…
Descriptors: Test Validity, Test Reliability, Metacognition, Listening Skills
A Two-Tier Multiple-Choice Diagnostic Test to Find Student Misconceptions about the Change of Matter
Rita Arfi Astuti Ningroom; Sri Yamtinah; Riyadi – Journal of Education and Learning (EduLearn), 2025
There are a lot of very interesting scientific concepts to learn in natural and social science. The initial concepts that the student possesses may contradict the actual concepts, which is what causes misconceptions. Misconceptions are identified using misconception detection test tools. In fact, the development of the use of diagnostic test…
Descriptors: Foreign Countries, Test Construction, Diagnostic Tests, Multiple Choice Tests
Antonio García-Vinuesa; José Gutiérrez-Pérez; Pablo Ángel Meira-Cartea; José Antonio Caride-Gómez – International Research in Geographical and Environmental Education, 2025
Considering the crucial role of education in offering mitigation and adaptation strategies for climate change, there is a clear need for objective tools to assess its impact on the understanding of the issue among secondary school students. This paper describes the methodological design used to build and validate an instrument that explores…
Descriptors: Foreign Countries, Test Construction, Test Validity, Climate
Nicolae Florian – Acta Didactica Napocensia, 2025
In this paper we will explore an application, made by us, that can generate physics grid tests using artificial intelligence. The application analyzes the response from the large language model in the required format recognized by the application and writes it to a file that will be accepted by the test builder application. The application creates…
Descriptors: Test Construction, Science Tests, Physics, Artificial Intelligence
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Paula Elosua – Language Assessment Quarterly, 2024
In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity
B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025
The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…
Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
S. Kanageswari Suppiah Shanmugam; Arsaythamby Veloo; Suheysen Revindran – Practical Assessment, Research & Evaluation, 2025
Conventional mathematics testing often fails to reflect the diverse cultural backgrounds and lived experiences of Indigenous pupils. While efforts to improve educational access for Indigenous communities have increased, less emphasis has been placed on adapting test development processes to align with Indigenous learners' linguistic backgrounds…
Descriptors: Mathematics Tests, Cultural Relevance, Indigenous Populations, Minority Group Students
Maria Treadaway; John Read – Language Testing, 2024
Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…
Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Deniz Mertkan Gezgin; Tugba Türk Kurtça – Education and Information Technologies, 2025
The purpose of this research is to create a reliable and valid scale to assess AIlessphobia in Education (the fear of being without Artificial Intelligence in education) among university students. In three phases, a sample of 1378 undergraduate students from different faculties at a public university participated in the reliability and validity…
Descriptors: Test Construction, Fear, Artificial Intelligence, Psychometrics

Peer reviewed
Direct link
