Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 11 |
Descriptor
Scores | 32 |
Test Items | 32 |
Test Use | 32 |
Test Construction | 11 |
Achievement Tests | 9 |
Test Validity | 9 |
Test Interpretation | 8 |
Elementary Secondary Education | 7 |
Test Results | 7 |
Testing Programs | 7 |
Standardized Tests | 6 |
More ▼ |
Source
Author
Thompson, Bruce | 2 |
Ackerman, Terry | 1 |
Armstrong, Anne-Marie | 1 |
Arnau, Randolph C. | 1 |
Arter, Judith A. | 1 |
B. Barbot | 1 |
B. Goecke | 1 |
Bassler, Otto C. | 1 |
Bauer, Scott C. | 1 |
Bowman, Harry L. | 1 |
Buser, Karen | 1 |
More ▼ |
Publication Type
Education Level
Secondary Education | 3 |
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Secondary Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Practitioners | 3 |
Community | 1 |
Parents | 1 |
Location
Alabama | 1 |
Australia | 1 |
Florida | 1 |
Hong Kong | 1 |
Indiana | 1 |
Kansas | 1 |
Massachusetts | 1 |
Michigan | 1 |
Minnesota | 1 |
New Jersey | 1 |
Ohio | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025
The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…
Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity
Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024
The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…
Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability
Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025
Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…
Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis
Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024
Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…
Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software
Wenyue Ma – ProQuest LLC, 2023
Foreign language placement testing, an important component in university foreign language programs, has received considerable, but not copious, attention over the years in second language (L2) testing research (Norris, 2004), and it has been mostly concentrated on L2 English. In contrast to validation research on L2 English placement testing, the…
Descriptors: Second Language Learning, Chinese, Student Placement, Placement Tests
Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022
Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…
Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning
Trace, Jonathan – Language Testing, 2020
Originally designed to measure reading and passage comprehension in L1 readers, cloze tests continue to be used for L2 assessment purposes. However, there remain disputes about whether or not cloze items can measure beyond local comprehension information, as well as whether or not they are purely a test of reading alone, or if performance can be…
Descriptors: Cloze Procedure, Second Language Learning, Reading Comprehension, Native Language
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Traynor, Anne – Educational Assessment, 2017
Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…
Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum
Jia, Yujie – ProQuest LLC, 2013
This study employed Bachman and Palmer's (2010) Assessment Use Argument framework to investigate to what extent the use of a second language oral test as an exit test in a Hong Kong university can be justified. It also aimed to help test developers of this oral test identify the most critical areas in the current test design that might need…
Descriptors: Test Use, Language Tests, Oral Language, Second Language Learning
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items

DeConinck, James B.; And Others – Educational and Psychological Measurement, 1996
Using a multidimensional measure of pay satisfaction, the Pay Satisfaction Questionnaire (PSQ), this study assessed the discriminant validity between scores on a measure of distributive justice and the PSQ with 474 employees. Confirmatory factor analysis results indicate that items from both scales loaded on the hypothesized dimensions. (SLD)
Descriptors: Construct Validity, Employees, Salaries, Satisfaction

Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996
Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)
Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses
Ackerman, Terry – 1994
The purpose of this paper is to demonstrate how graphical analyses can enhance the interpretation and understanding of multidimensional item-response theory (IRT) analyses. Conceptually many of the unidimensional IRT concepts such as item characteristic curves, information, etc., can be extended to multiple dimensions. However, as the…
Descriptors: Ability, Achievement Tests, Educational Assessment, Item Response Theory

Thompson, Bruce; And Others – 1997
This study was conducted to investigate the construct validity of scores on the Personal Preferences Self-Description Questionnaire (PPSDQ), a measure of Jungian types. Confirmatory factor analysis methods were used to investigate the structures underlying PPSDQ responses of 641 university students. The model fit statistics were generally…
Descriptors: College Students, Construct Validity, Goodness of Fit, Higher Education