Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 24 |
Since 2006 (last 20 years) | 45 |
Descriptor
Construct Validity | 56 |
Psychometrics | 56 |
Test Items | 56 |
Test Construction | 26 |
Test Validity | 24 |
Foreign Countries | 18 |
Test Reliability | 17 |
Factor Analysis | 15 |
Measures (Individuals) | 15 |
Item Response Theory | 13 |
Questionnaires | 10 |
More ▼ |
Source
Author
Champagne, Zachary M. | 2 |
Farina, Kristy | 2 |
LaVenia, Mark | 2 |
Schoen, Robert C. | 2 |
Thompson, Bruce | 2 |
Yocom, Peter | 2 |
Anderson, Eric | 1 |
Anetzberger, Georgia J. | 1 |
Arendasy, Martin | 1 |
Aylott, Alice | 1 |
Beaver, Jessica | 1 |
More ▼ |
Publication Type
Education Level
Audience
Location
Taiwan | 3 |
Germany | 2 |
Turkey | 2 |
United Kingdom (England) | 2 |
Arizona | 1 |
Australia | 1 |
Canada | 1 |
Illinois | 1 |
Indiana | 1 |
Iowa | 1 |
Iran (Tehran) | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025
This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…
Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics
Katrin Schuessler; Vanessa Fischer; Maik Walpuski – Instructional Science: An International Journal of the Learning Sciences, 2025
Cognitive load studies are mostly centered on information on perceived cognitive load. Single-item subjective rating scales are the dominant measurement practice to investigate overall cognitive load. Usually, either invested mental effort or perceived task difficulty is used as an overall cognitive load measure. However, the extent to which the…
Descriptors: Cognitive Processes, Difficulty Level, Rating Scales, Construct Validity
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Leo, Francisco M.; Fernández-Río, Javier; Pulido, Juan J.; Rodríguez-González, Pablo; López-Gajardo, Miguel A. – Social Psychology of Education: An International Journal, 2023
The aim of this study was to develop and validate a psychometrically-sound instrument to assess students' perceptions about class cohesion. Two studies were conducted. In Study 1, four steps were established: (1) development of the Class Cohesion Questionnaire (CCQ); (2) item selection; (3) item compression; and (4) exploration of psychometric…
Descriptors: Classroom Environment, Group Unity, Elementary School Students, Secondary School Students
Kanj, Rama; El-Hassan, Karma – International Journal of Multilingualism, 2023
Vocabulary tests administered on multilingual populations should take into account the unique linguistic and cultural makeup of the population by adopting test development methods that allow responses in several languages. Our aims were to develop a picture-naming test for multilingual Lebanese school-age children (L1: Lebanese, L2: French and/or…
Descriptors: Vocabulary Development, Language Tests, Expressive Language, Multilingualism
Durak, Ismail; Karagoz, Yalcin – International Journal of Assessment Tools in Education, 2021
The aim of this study is to adapt the Statistics Anxiety Scale (SAS) developed by Vigil-Colet et al. (2008) to Turkish. This study is expected to fill an important gap in the literature since no valid and reliable specific statistics anxiety scale developed or adapted in Turkish for undergraduate students in the literature is available. The sample…
Descriptors: Foreign Countries, Affective Measures, Statistics, Mathematics Anxiety
Koch, Marco; Spinath, Frank M.; Greiff, Samuel; Becker, Nicolas – Journal of Intelligence, 2022
Figural matrices tasks are one of the most prominent item formats used in intelligence tests, and their relevance for the assessment of cognitive abilities is unquestionable. However, despite endeavors of the open science movement to make scientific research accessible on all levels, there is a lack of royalty-free figural matrices tests. The Open…
Descriptors: Intelligence, Intelligence Tests, Computer Assisted Testing, Test Items
Çetin, Münevver; Karaokur Akdag, Seyma – Journal of Education and Learning, 2022
The aim of this study was to develop a scaling instrument for measuring organizational development level in the Turkish higher education context depending on perceptions of the faculty. The sample consisted of academicians of higher education institutions in the 2020-2021 academic year. Data were gathered in two stages. Exploratory Factor Analysis…
Descriptors: Organizational Development, Likert Scales, Measures (Individuals), Test Construction
Jones, Tiffany M.; Fleming, Charles; Beaver, Jessica; Anderson, Eric – Child & Youth Care Forum, 2023
Background: Schools are increasingly measuring school climate and social emotional learning, yet few measures conduct invariance testing by race, gender, or language despite known differences in perceptions of these constructs based on these identities. Objective: This study reports on the validation process of a school climate and social…
Descriptors: Public Schools, Student Surveys, Educational Environment, Social Emotional Learning
Piller, Aimee; Fletcher, Tina; Pfeiffer, Beth; Dunlap, Karen; Pickens, Noralyn – Assessment for Effective Intervention, 2019
The "Participation and Sensory Environment Questionnaire--Teacher Version" (PSEQ-TV) is a teacher report questionnaire designed to examine the impact of the sensory environment on participation for preschool children with autism spectrum disorder (ASD). This study examines the construct validity of the assessment through principal…
Descriptors: Construct Validity, Questionnaires, Measures (Individuals), Sensory Experience
Malone, Kathy L.; Boone, William J.; Stammen, Andria; Schuchardt, Anita; Ding, Lin; Sabree, Zakee – EURASIA Journal of Mathematics, Science and Technology Education, 2021
Instruments for assessing secondary students' conceptual understanding of core concepts in biology are needed by educational practitioners and researchers alike. Most instruments available for secondary biology (years 9 to 12) focus only on highly specific biological concepts instead of multiple core concepts. This study describes the development…
Descriptors: Measures (Individuals), Test Construction, Construct Validity, Test Reliability
Raykov, Tenko; Marcoulides, George A.; Dimitrov, Dimiter M.; Li, Tatyana – Educational and Psychological Measurement, 2018
This article extends the procedure outlined in the article by Raykov, Marcoulides, and Tong for testing congruence of latent constructs to the setting of binary items and clustering effects. In this widely used setting in contemporary educational and psychological research, the method can be used to examine if two or more homogeneous…
Descriptors: Tests, Psychometrics, Test Items, Construct Validity
Jean-Yves Bégin; Luc Touchette; Caroline Couture; Cassandre Blais – International Journal of Nurture in Education, 2020
The Boxall Profile provides a framework for the structured observation of children in nurture groups. It is a detailed and rigorously trialled normative diagnostic instrument developed for teachers and teaching assistants to measure children's levels of emotional and behavioural functioning. Moreover, it highlights specific targets for…
Descriptors: Psychometrics, French, Observation, Children
Shujuan Wang – ProQuest LLC, 2021
Existing methods used to validate self-report questionnaires in foreign language teaching effectiveness have relied on Classical Test Theory (CTT). However, the use of CTT approaches limits the reliability and validity of self-report instruments. The Rasch Model, which is based on the principles of objective measurement, addresses some of the…
Descriptors: Second Language Programs, Second Language Learning, Second Language Instruction, Language Tests
Tsai, Liang-Ting; Chang, Cheng-Chieh – Environmental Education Research, 2019
This study established a Chinese scale for measuring high school students' ocean literacy. This included testing its reliability, validity, and differential item functioning (DIF) with the aim of compensating for the lack of DIF tests focusing on current scales. The construct validity and reliability were verified and tested by analyzing the…
Descriptors: Foreign Countries, Measures (Individuals), Oceanography, Knowledge Level