Publication Date
In 2025 | 923 |
Since 2024 | 4173 |
Since 2021 (last 5 years) | 15594 |
Since 2016 (last 10 years) | 36821 |
Since 2006 (last 20 years) | 68847 |
Descriptor
Foreign Countries | 29798 |
Test Validity | 21380 |
Scores | 18037 |
Academic Achievement | 16764 |
Test Construction | 16492 |
Test Reliability | 14751 |
Achievement Tests | 14632 |
Standardized Tests | 14620 |
Comparative Analysis | 14374 |
Elementary Secondary Education | 12946 |
Language Tests | 12360 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 5027 |
Teachers | 3377 |
Researchers | 2620 |
Policymakers | 1214 |
Administrators | 967 |
Students | 684 |
Parents | 325 |
Counselors | 214 |
Community | 162 |
Support Staff | 49 |
Media Staff | 34 |
More ▼ |
Location
Turkey | 2753 |
Australia | 2379 |
Canada | 2243 |
California | 1830 |
United States | 1697 |
Texas | 1581 |
China | 1498 |
Florida | 1291 |
United Kingdom | 1286 |
United Kingdom (England) | 1186 |
Germany | 1095 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 121 |
Meets WWC Standards with or without Reservations | 189 |
Does not meet standards | 174 |
Rachel Bowns; Jordan E. Loeffelman; Douglas Steinley; Kenneth J. Sher – Journal of American College Health, 2024
Objective: To develop a shortened form of the Young Adult Alcohol Problems Screening Test[superscript 1] (YAAPST; original length = 27 items) using a novel combinatorial approach. Participants: 489 college freshmen, half of whom were above average risk for alcohol use disorder based upon family history, attending a large, Midwestern University…
Descriptors: Test Construction, Screening Tests, Young Adults, Drinking
Melissa Whatley; Dominique Foster; Stephen Paul – Journal of Studies in International Education, 2024
The purpose of this study was to develop a measurement instrument that scholars and practitioners in international education can use as a means of exploring whether and how individuals who come into contact with international education programs develop a greater sense of cultural humility. Specifically, the study described here outlines the four…
Descriptors: Foreign Students, Cultural Awareness, Consciousness Raising, Test Construction
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Emily A. Holt; Jessica Duke; Ryan Dunk; Krystal Hinerman – Environmental Education Research, 2024
Student understanding of climate change is an active and growing area of research, but little research has documented undergraduate students' knowledge about the biotic impacts of climate change. Here, we address this literature gap by presenting the Inventory of Biotic Climate Literacy (IBCL), a concept inventory developed to assess undergraduate…
Descriptors: Climate, Undergraduate Students, Knowledge Level, Test Construction
Lisa DaVia Rubenstein; Kathrin Maki; Brianna Quigley; Shanyn Thompson; Lisa M. Ridgley Smith – AERA Online Paper Repository, 2024
The purpose of this systematic review was to survey available measures of creativity for pk12 students for assessments characteristics and reporting of psychometric properties. Using the PRISMA framework, we identified 42 unique articles with 48 assessments meeting our inclusion criteria. Then, two coders independently coded all articles using a…
Descriptors: Literature Reviews, Meta Analysis, Elementary Secondary Education, Creativity
Carrie L. Bonilla – Hispania, 2024
This article details the challenges and best practices of evaluating second language learners for placement into postsecondary Spanish language courses. The literature on testing for placement purposes in second language acquisition and language testing provides a great deal of insight, but language programs must make many decisions as well that…
Descriptors: Spanish, Language Tests, Placement Tests, Test Validity
Güntay Tasçi – Science Insights Education Frontiers, 2024
The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…
Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology
Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024
Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…
Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses
Anna Planas-Lladó; Xavier Úcar – American Journal of Evaluation, 2024
Empowerment is a concept that has become increasingly used over recent years. However, little research has been undertaken into how empowerment can be evaluated, particularly in the case of young people. The aim of this article is to present an inventory of dimensions and indicators of youth empowerment. The article describes the various phases in…
Descriptors: Youth, Empowerment, Test Construction, Test Validity
Fulya Merve Kos; Murat Bektas; Dijle Ayar – Psychology in the Schools, 2025
This study was conducted to develop a measurement tool to achieve epilepsy self-management in teachers and examine its Turkish psychometric properties. This descriptive, comparative, correlational, and methodological study was conducted between May and August 2022 with 346 teachers between the ages of 24 and 67 working in public schools selected…
Descriptors: Test Construction, Psychometrics, Self Management, Epilepsy
Bomna Ko; Phillip Ward; Han Joo Lee; Yaohui He; Kelsey Higginson; Insook Kim – International Journal of Kinesiology in Higher Education, 2025
Developing valid and reliable instruments to assess common content knowledge (CCK) is a prerequisite for determining and improving the content knowledge of preservice teachers (PSTs) and teachers. We report on the development and psychometric analysis of an instrument for assessing PSTs' gymnastics CCK for secondary physical education teaching…
Descriptors: Athletics, Foreign Countries, Preservice Teachers, Test Validity
Tom Benton – Practical Assessment, Research & Evaluation, 2025
This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…
Descriptors: Equated Scores, Test Format, Test Items, Computation
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024
The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…
Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction
Emma Walland – Research Matters, 2024
GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…
Descriptors: Educational Change, Test Items, Item Analysis, Scoring