ERIC - Search Results

Publication Date

In 2025	7
Since 2024	18

Descriptor

Test Interpretation	18
Test Validity	13
Scores	8
Test Construction	8
Foreign Countries	6
Test Items	6
Test Reliability	6
Test Use	5
Evaluation Methods	4
Psychometrics	4
Accuracy	3
Construct Validity	3
Decision Making	3
Personality Traits	3
Academic Achievement	2
Achievement Tests	2
Administrator Attitudes	2
Alignment (Education)	2
Alternative Assessment	2
Assessment Literacy	2
Caregiver Attitudes	2
Content Validity	2
Elementary School Students	2
Equal Education	2
Error of Measurement	2
More ▼

Source

ETS Research Report Series	2
Journal of Educational…	2
Assessment for Effective…	1
Autism: The International…	1
Educational and Psychological…	1
Evaluation Review	1
Grantee Submission	1
International Journal of…	1
Interpreter and Translator…	1
Journal of Creative Behavior	1
Language Testing in Asia	1
National Assessment Governing…	1
School Leadership Review	1
School Mental Health	1
Society for Research on…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	14
Reports - Evaluative	3
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Education	4
Higher Education	4
Postsecondary Education	4
Secondary Education	4
Middle Schools	3
Grade 4	2
Intermediate Grades	2
Junior High Schools	2
Early Childhood Education	1
Grade 12	1
Grade 3	1
Grade 5	1
Grade 8	1
High Schools	1
Primary Education	1
More ▼

Audience

Location

China	1
Greece	1
Illinois	1
Iran (Tehran)	1
Kentucky (Louisville)	1
Spain	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
High School Longitudinal…	1
National Assessment of…	1
Program for International…	1
Progress in International…	1
Social Responsiveness Scale	1
Stages of Concern…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

A Rasch-Based Validation of the University of Tehran English Proficiency Test (UTEPT)

Peer reviewed

Direct link

Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024

Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…

Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency

New Developments in Measurement Invariance Testing: An Overview and Comparison of EFA-Based Approaches

Peer reviewed

Direct link

Philipp Sterner; Kim De Roover; David Goretzko – Structural Equation Modeling: A Multidisciplinary Journal, 2025

When comparing relations and means of latent variables, it is important to establish measurement invariance (MI). Most methods to assess MI are based on confirmatory factor analysis (CFA). Recently, new methods have been developed based on exploratory factor analysis (EFA); most notably, as extensions of multi-group EFA, researchers introduced…

Descriptors: Error of Measurement, Measurement Techniques, Factor Analysis, Structural Equation Models

Building a Validity Argument for the TOEFL Junior® Tests. TOEFL® Research Report. RR-102. ETS RR-24-05

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2024

The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

A Historic Review and Empirical Revitalization of the Stages of Concern Questionnaire

Peer reviewed
PDF on ERIC

Download full text

Kent Anderson Seidel – School Leadership Review, 2025

This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…

Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Raters' Scoring Process in Assessment of Interpreting: An Empirical Study Based on Eye Tracking and Retrospective Verbalisation

Peer reviewed

Direct link

Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024

Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…

Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability

The Broad Autism Phenotype--International Test (BAP-IT): A Two-Domain-Based Test for the Assessment of the Broad Autism Phenotype

Peer reviewed

Direct link

Marta Godoy-Giménez; Ángel García-Pérez; Fernando Cañadas; Angeles F. Estévez; Pablo Sayans-Jiménez – Autism: The International Journal of Research and Practice, 2024

The broad autism phenotype is the phenotypic expression of the primary characteristics of autism. However, currently available tests do not agree with the two-domain operationalization of broad autism phenotype or autism, and their internal structure has shown instability across applications. This study presents the Broad Autism…

Descriptors: Autism Spectrum Disorders, Genetics, Diagnostic Tests, Foreign Countries

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates

Peer reviewed

Direct link

Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024

Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…

Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity

Content Validity of Creativity Self-Report Questionnaires from PISA 2022

Peer reviewed

Direct link

B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025

The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…

Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity

Re-Examining Measurement Invariance of School Climate Surveys across Race/Ethnicity

Peer reviewed

Direct link

Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025

Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…

Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment

Calibrating Items Using an Unfolding Model of Item Response Theory: The Case of the Trait Personality Questionnaire 5 (TPQue5)

Peer reviewed

Direct link

Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024

Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…

Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits

NAEP Achievement Levels Validity Argument Report

Download full text

Anne H. Davidson – National Assessment Governing Board, 2025

The purpose of this National Assessment of Educational Progress (NAEP) Achievement Levels Validity Argument Report is to synthesize evidence currently available to address the validity of the interpretations and uses of the NAEP Achievement Levels. Validity is the extent to which theory and evidence supports or refutes proposed and enacted test…

Descriptors: National Competency Tests, Academic Achievement, Test Validity, College Entrance Examinations

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024

We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…

Descriptors: Screening Tests, Psychometrics, Validity, Child Development

Previous Page | Next Page »

Pages: 1 | 2

Amy Briesch	2
Brittany Melo	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Sandra M. Chafouleas	2
Amit Sevak	1
Angeles F. Estévez	1
Anne H. Davidson	1
Anum Khushal	1
B. Barbot	1
B. Goecke	1
Binghan Zheng	1
Brian A. Couch	1
Carolin Hahnel	1
Caroline M. Böhm	1
Chao Han	1
Ching-Ni Hsieh	1
Christina LiCalsi	1
Daniel Fishtein	1
David Goretzko	1
Dena Dossett	1
Eirini M. Mitropoulou	1
Fernando Cañadas	1
Frank Goldhammer	1
George Stifel	1
More ▼