Publication Date
In 2025 | 223 |
Since 2024 | 723 |
Since 2021 (last 5 years) | 2312 |
Since 2016 (last 10 years) | 4613 |
Since 2006 (last 20 years) | 6918 |
Descriptor
Test Reliability | 14781 |
Test Validity | 9789 |
Test Construction | 4261 |
Foreign Countries | 3670 |
Psychometrics | 2367 |
Factor Analysis | 2255 |
Measures (Individuals) | 1721 |
Evaluation Methods | 1401 |
Higher Education | 1384 |
Correlation | 1234 |
Questionnaires | 1234 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 453 |
Practitioners | 319 |
Teachers | 128 |
Administrators | 73 |
Policymakers | 33 |
Counselors | 31 |
Students | 17 |
Parents | 10 |
Community | 6 |
Support Staff | 5 |
Location
Turkey | 801 |
Australia | 237 |
Canada | 205 |
China | 195 |
Indonesia | 142 |
Spain | 124 |
United States | 121 |
United Kingdom | 119 |
Germany | 106 |
Taiwan | 103 |
Netherlands | 99 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 1 |
Thomas W. Frazier; Andrew J. O. Whitehouse; Susan R. Leekam; Sarah J. Carrington; Gail A. Alvares; David W. Evans; Antonio Y. Hardan; Mirko Uljarevic – Journal of Autism and Developmental Disorders, 2024
Purpose: The aim of the present study was to compare scale and conditional reliability derived from item response theory analyses among the most commonly used, as well as several newly developed, observation, interview, and parent-report autism instruments. Methods: When available, data sets were combined to facilitate large sample evaluation.…
Descriptors: Test Reliability, Item Response Theory, Autism Spectrum Disorders, Clinical Diagnosis
Matthias Winfried Kleespies; Viktoria Feucht; Til Jonas Tille; Alina Miriam Bambach; Eva Gricar; Maximilian Claus; Michael Matthias Günther Konertz; Laura Kokott; Valentin Rupp; Valentin Bergmann; Volker Wenzel; Paul Wilhelm Dierkes – Measurement: Interdisciplinary Research and Perspectives, 2024
Human pro-environmental behavior in the private sphere is an important factor which influences nature and the environment and thus can contribute to the management of environmental problems. Although there are a variety of self-reported measurement tools for pro-environmental behavior, an established and validated measurement instrument for…
Descriptors: Ecology, Conservation (Environment), Test Construction, Behavior
Eren Can Aybek; Serkan Arikan; Günes Ertas – International Journal of Assessment Tools in Education, 2024
When it is required to estimate item parameters of a large item bank, Multiple Matrix Sampling (MMS) design provides an efficient way while minimizing the test burden on students. The current study exemplifies how to calibrate a large item pool using MMS design for various purposes, such as developing a CAT administration. The purpose of the…
Descriptors: Elementary School Mathematics, Elementary School Students, Grade 4, Item Banks
Chak Li; Meghan M. Burke; Julie Lounds Taylor; Leann S. DaWalt; Zachary Rossetti – Intellectual and Developmental Disabilities, 2024
Advocacy has long been heralded as a way to create change for individuals with intellectual and developmental disabilities (IDD) and their families. However, without an established measure, it is difficult to accurately characterize advocacy activities. Drawing from extant research, the Advocacy Activities Scale was developed to assess three…
Descriptors: Advocacy, Students with Disabilities, Intellectual Disability, Developmental Disabilities
Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024
This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…
Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition
Abdulrahman Alshammari – ProQuest LLC, 2024
A critical component of modern software development practices, particularly continuous integration (CI), is the halt of development activities in response to test failures which requires further investigation and debugging. As software changes, regression testing becomes vital to verify that new code does not affect existing functionality.…
Descriptors: Computer Software, Programming, Coding, Test Reliability
Biru Chang; Jiajian Wang; Jun Cai – Journal of Career Development, 2024
The present study aims to translate the questionnaires of Parents' Attitudes toward Early Childhood Career Development (PAECCD) and Parents' Attitudes toward Vocational Education Implementation in Preschool Curriculum (PAVEIPC) into Chinese versions (PAECCD-C and PAVEIPC-C) and examines their reliability and validity through two studies. In Study…
Descriptors: Foreign Countries, Parent Attitudes, Preschool Children, Vocational Education
Allie Spencer Patterson; Thomas Brotherhood – Higher Education Quarterly, 2024
The purpose of this study was to develop and test the internal and external reliability of a novel research instrument which measures language support for international faculty members and its effects on integration. While previous research has focused on the contributions of international faculty and efforts to attract them, growing concerns…
Descriptors: Test Construction, Test Validity, Test Reliability, Foreign Workers
Shangchao Min; Kyoungwon Bishop – Language Testing, 2024
This paper evaluates the multistage adaptive test (MST) design of a large-scale academic language assessment (ACCESS) for Grades 1-12, with an aim to simplify the current MST design, using both operational and simulated test data. Study 1 explored the operational population data (1,456,287 test-takers) of the listening and reading tests of MST…
Descriptors: Adaptive Testing, Test Construction, Language Tests, English Language Learners
Yanxuan Qu; Sandip Sinharay – ETS Research Report Series, 2024
The goal of this paper is to find better ways to estimate the internal consistency reliability of scores on tests with a specific type of design that are often encountered in practice: tests with constructed-response items clustered into sections that are not parallel or tau-equivalent, and one of the sections has only one item. To estimate the…
Descriptors: Test Reliability, Essay Tests, Construct Validity, Error of Measurement
Badenes-Ribera, Laura; Duro-García, Carmen; López-Ibáñez, Carmen; Martí-Vilar, Manuel; Sánchez-Meca, Julio – International Journal of Behavioral Development, 2023
The Adult Prosocialness Behavior Scale (APBS) is most often used to measure adult prosociality. We conducted a reliability generalization meta-analysis to compute the average APBS reliability and examine the heterogeneity among reliability estimations and the influence of moderator variables. An exhaustive search identified 74 articles that…
Descriptors: Adults, Prosocial Behavior, Behavior Rating Scales, Test Reliability
Kalkbrenner, Michael T.; Ryan, Aimee F.; Hunt, Adam J.; Rahman, Samiah R. – Measurement and Evaluation in Counseling and Development, 2023
We conducted a psychometric synthesis of the internal consistency reliability and internal structure of scores on the English versions of the Patient Health Questionnaire-9 (PHQ-9) and Generalized Anxiety Disorder-7 (GAD-7) in publications between 2012 and 2022. Results supported acceptable-to-strong reliability and validity evidence of scores…
Descriptors: Psychometrics, Test Validity, Test Reliability, Questionnaires
Mucuk, Makbule Duran; Sahin, Ekrem Sedat – African Educational Research Journal, 2023
The aim of this study was to develop a valid and reliable measurement instrument to find out adolescents' relative deprivation levels and to determine the statistical characteristics of the instrument. The Relative Deprivation Scale-Adolescent Form was prepared and applied to 586 adolescents within the scope of the study. Exploratory Factor…
Descriptors: Test Construction, Disadvantaged, Psychometrics, Test Validity
Tsang, Winnie; Oliver, David; Triantafyllopoulou, Paraskevi – Journal of Applied Research in Intellectual Disabilities, 2023
Background: Adults with intellectual disabilities are an at-risk group of developing dementia. In the absence of a cure for dementia, emphasis on treatment is the promotion of Quality of life (QoL). The aim of this review is to identify and describe QoL tools for people with intellectual disabilities and dementia. Method: A systematic review was…
Descriptors: Intellectual Disability, Adults, Dementia, Quality of Life
Menold, Natalja – Field Methods, 2023
While numerical bipolar rating scales may evoke positivity bias, little is known about the corresponding bias in verbal bipolar rating scales. The choice of verbalization of the middle category may lead to response bias, particularly if it is not in line with the scale polarity. Unipolar and bipolar seven-category rating scales in which the…
Descriptors: Rating Scales, Test Bias, Verbal Tests, Responses