ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	11

Descriptor

Scores	32
Test Items	32
Test Use	32
Test Construction	11
Achievement Tests	9
Test Validity	9
Test Interpretation	8
Elementary Secondary Education	7
Test Results	7
Testing Programs	7
Standardized Tests	6
Educational Assessment	5
Foreign Countries	5
Higher Education	5
Item Analysis	5
Item Response Theory	5
College Students	4
Evaluation Methods	4
Language Tests	4
Second Language Learning	4
State Programs	4
Test Content	4
Academic Standards	3
Educational Quality	3
Item Banks	3
More ▼

Source

Educational Measurement:…	3
Journal of Creative Behavior	2
ProQuest LLC	2
Applied Measurement in…	1
Center for Assessment and…	1
Discover Education	1
Education Policy Analysis…	1
Educational Assessment	1
Educational Researcher	1
Educational and Psychological…	1
Journal of Autism and…	1
Language Testing	1
New Meridian Corporation	1
Psychological Review	1
Studies in Second Language…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	11
Reports - Evaluative	7
Speeches/Meeting Papers	7
Guides - Non-Classroom	4
Information Analyses	3
Reports - Descriptive	3
Dissertations/Theses -…	2
Tests/Questionnaires	2
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
Opinion Papers	1
More ▼

Education Level

Secondary Education	3
Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	3
Community	1
Parents	1

Location

Alabama	1
Australia	1
Florida	1
Hong Kong	1
Indiana	1
Kansas	1
Massachusetts	1
Michigan	1
Minnesota	1
New Jersey	1
Ohio	1
Oregon	1
Tennessee	1
Vermont	1
More ▼

Laws, Policies, & Programs

Comprehensive Education…

Assessments and Surveys

Program for International…	2
ACTFL Oral Proficiency…	1
Florida State Student…	1
National Assessment of…	1
National Teacher Examinations	1
North Carolina End of Course…	1
Pennsylvania Educational…	1
Raven Progressive Matrices	1
Remote Associates Test	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Content Validity of Creativity Self-Report Questionnaires from PISA 2022

Peer reviewed

Direct link

B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025

The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…

Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity

The Social Shapes Test as a Self-Administered, Online Measure of Social Intelligence: Two Studies with Typically Developing Adults and Adults with Autism Spectrum Disorder

Peer reviewed

Direct link

Matt I. Brown; Patrick R. Heck; Christopher F. Chabris – Journal of Autism and Developmental Disorders, 2024

The Social Shapes Test (SST) is a measure of social intelligence which does not use human faces or rely on extensive verbal ability. The SST has shown promising validity among adults without autism spectrum disorder (ASD), but it is uncertain whether it is suitable for adults with ASD. We find measurement invariance between adults with (n = 229)…

Descriptors: Interpersonal Competence, Autism Spectrum Disorders, Emotional Intelligence, Verbal Ability

Along the Convergent-Divergent Continuum: The Role of Task Structure in the PISA Creative Thinking Assessment

Peer reviewed

Direct link

Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025

Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…

Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

An Investigation into a Chinese Placement Test's Score Interpretations and Uses

Direct link

Wenyue Ma – ProQuest LLC, 2023

Foreign language placement testing, an important component in university foreign language programs, has received considerable, but not copious, attention over the years in second language (L2) testing research (Norris, 2004), and it has been mostly concentrated on L2 English. In contrast to validation research on L2 English placement testing, the…

Descriptors: Second Language Learning, Chinese, Student Placement, Placement Tests

Measurement Properties of a Standardized Elicited Imitation Test: An Integrative Data Analysis

Peer reviewed

Direct link

Isbell, Daniel R.; Son, Young-A – Studies in Second Language Acquisition, 2022

Elicited Imitation Tests (EITs) are commonly used in second language acquisition (SLA)/bilingualism research contexts to assess the general oral proficiency of study participants. While previous studies have provided valuable EIT construct-related validity evidence, some key gaps remain. This study uses an integrative data analysis to further…

Descriptors: Bilingualism, Imitation, Language Tests, Second Language Learning

Clozing the Gap: How Far Do Cloze Items Measure?

Peer reviewed

Direct link

Trace, Jonathan – Language Testing, 2020

Originally designed to measure reading and passage comprehension in L1 readers, cloze tests continue to be used for L2 assessment purposes. However, there remain disputes about whether or not cloze items can measure beyond local comprehension information, as well as whether or not they are purely a test of reading alone, or if performance can be…

Descriptors: Cloze Procedure, Second Language Learning, Reading Comprehension, Native Language

"Quality Testing Standards" -- A Starter Kit for States. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…

Descriptors: Testing, Standards, Comparative Analysis, Test Content

Does Test Item Performance Increase with Test-to-Standards Alignment?

Peer reviewed

Direct link

Traynor, Anne – Educational Assessment, 2017

Variation in test performance among examinees from different regions or national jurisdictions is often partially attributed to differences in the degree of content correspondence between local school or training program curricula, and the test of interest. This posited relationship between test-curriculum correspondence, or "alignment,"…

Descriptors: Test Items, Test Construction, Alignment (Education), Curriculum

Justifying the Use of a Second Language Oral Test as an Exit Test in Hong Kong: An Application of Assessment Use Argument Framework

Direct link

Jia, Yujie – ProQuest LLC, 2013

This study employed Bachman and Palmer's (2010) Assessment Use Argument framework to investigate to what extent the use of a second language oral test as an exit test in a Hong Kong university can be justified. It also aimed to help test developers of this oral test identify the most critical areas in the current test design that might need…

Descriptors: Test Use, Language Tests, Oral Language, Second Language Learning

On Validity Theory and Test Validation

Peer reviewed

Direct link

Sireci, Stephen G. – Educational Researcher, 2007

Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…

Descriptors: Test Content, Test Validity, Guidelines, Test Items

A Construct Validity Analysis of Scores on Measures of Distributive Justice and Pay Satisfaction.

Peer reviewed

DeConinck, James B.; And Others – Educational and Psychological Measurement, 1996

Using a multidimensional measure of pay satisfaction, the Pay Satisfaction Questionnaire (PSQ), this study assessed the discriminant validity between scores on a measure of distributive justice and the PSQ with 474 employees. Confirmatory factor analysis results indicate that items from both scales loaded on the hypothesized dimensions. (SLD)

Descriptors: Construct Validity, Employees, Salaries, Satisfaction

Assessing Person-Fit on Measures of Typical Performance.

Peer reviewed

Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996

Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)

Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses

Graphical Representation of Multidimensional Item Response Theory Analyses.

Download full text

Ackerman, Terry – 1994

The purpose of this paper is to demonstrate how graphical analyses can enhance the interpretation and understanding of multidimensional item-response theory (IRT) analyses. Conceptually many of the unidimensional IRT concepts such as item characteristic curves, information, etc., can be extended to multiple dimensions. However, as the…

Descriptors: Ability, Achievement Tests, Educational Assessment, Item Response Theory

Structure of PPSDQ-93 Item "Parcels": Confirmatory and Other Analyses.

PDF pending restoration

Thompson, Bruce; And Others – 1997

This study was conducted to investigate the construct validity of scores on the Personal Preferences Self-Description Questionnaire (PPSDQ), a measure of Jungian types. Confirmatory factor analysis methods were used to investigate the structures underlying PPSDQ responses of 641 university students. The model fit statistics were generally…

Descriptors: College Students, Construct Validity, Goodness of Fit, Higher Education

Previous Page | Next Page »

Pages: 1 | 2 | 3

Thompson, Bruce	2
Ackerman, Terry	1
Armstrong, Anne-Marie	1
Arnau, Randolph C.	1
Arter, Judith A.	1
B. Barbot	1
B. Goecke	1
Bassler, Otto C.	1
Bauer, Scott C.	1
Bowman, Harry L.	1
Buser, Karen	1
Carpenter, Patricia A.	1
Caulkins, Thomas G.	1
Christopher F. Chabris	1
DeConinck, James B.	1
Dietel, Ron	1
Estes, Gary D.	1
Flannery, Wm. Peter	1
Ghaith Assi	1
Herndon, Enid B.	1
Hills, John R.	1
Hiscox, Michael D.	1
Huang, Zheng Sen	1
Isbell, Daniel R.	1
More ▼