Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Bentley, Andrew P. K.; Petcovic, Heather L.; Cassidy, David P. – Environmental Education Research, 2019
Individuals are exposed to misleading or outright false anthropogenic climate change (ACC) information. The goals of this study are to identify ACC dissenter messages, and to develop an instrument that quantifies the extent to which individuals agree with these messages. The instrument was developed using a sequential mixed methods design. A…
Descriptors: Climate, Likert Scales, Test Validity, Test Reliability
Caretta, Martina Angela; Pérez, María Alejandra – Field Methods, 2019
Transactional validity, a common approach in participatory research, is attained when preliminary analyses of research results are discussed with research participants and their feedback is incorporated in the analysis. Member checking is one way of achieving transactional validity, which has been heralded as a stronger version of validity reached…
Descriptors: Participatory Research, Validity, Reliability, Conflict
Maxwell, Mary; Gleason, Jim – International Journal of Mathematical Education in Science and Technology, 2019
Many large universities, community colleges and some smaller four-year colleges are turning to hybrid or online instruction for remedial and entry level mathematics courses, often assessed using online exams in a proctored computer lab environment. Faculty face the task of choosing questions from a publisher's text bank with very little, if any,…
Descriptors: Item Response Theory, Test Reliability, Item Banks, Algebra
Hoekstra, R.; Vugteveen, J.; Warrens, M. J.; Kruyen, P. M. – International Journal of Social Research Methodology, 2019
Cronbach's alpha is the most frequently used measure to investigate the reliability of measurement instruments. Despite its frequent use, many warn for misinterpretations of alpha. These claims about regular misunderstandings, however, are not based on empirical data. To understand how common such beliefs are, we conducted a survey study to test…
Descriptors: Statistical Analysis, Researchers, Beliefs, Knowledge Level
De Raadt, Alexandra; Warrens, Matthijs J.; Bosker, Roel J.; Kiers, Henk A. L. – Educational and Psychological Measurement, 2019
Cohen's kappa coefficient is commonly used for assessing agreement between classifications of two raters on a nominal scale. Three variants of Cohen's kappa that can handle missing data are presented. Data are considered missing if one or both ratings of a unit are missing. We study how well the variants estimate the kappa value for complete data…
Descriptors: Interrater Reliability, Data, Statistical Analysis, Statistical Bias
Bais, Frank; Schouten, Barry; Lugtig, Peter; Toepoel, Vera; Arends-Tòth, Judit; Douhou, Salima; Kieruj, Natalia; Morren, Mattijn; Vis, Corrie – Sociological Methods & Research, 2019
Item characteristics can have a significant effect on survey data quality and may be associated with measurement error. Literature on data quality and measurement error is often inconclusive. This could be because item characteristics used for detecting measurement error are not coded unambiguously. In our study, we use a systematic coding…
Descriptors: Foreign Countries, National Surveys, Error of Measurement, Test Items
Schack, Edna O.; Dueber, David; Thomas, Jonathan Norris; Fisher, Molly H.; Jong, Cindy – AERA Online Paper Repository, 2019
Scoring of teachers' noticing responses is typically burdened with rater bias and reliance upon interrater consensus. The authors sought to make the scoring process more objective, equitable, and generalizable. The development process began with a description of response characteristics for each professional noticing component disconnected from…
Descriptors: Models, Teacher Evaluation, Observation, Bias
Esposito, Giovanna; Marôco, João; Passeggia, Raffaella; Pepicelli, Giuliana; Freda, Maria Francesca – European Journal of Higher Education, 2022
Student Engagement (SE) refers to the extent to which a student participates in academic and non-academic activities, invests in and commits to learning, belonging and identification with the educational institution. Despite the relevance of SE for students' success, a few valid and reliable instruments have been developed. This study presents the…
Descriptors: Learner Engagement, College Students, Foreign Countries, Psychology
Sovey, Saralah; Osman, Kamisah; Matore, Mohd Effendi Ewan Mohd – EURASIA Journal of Mathematics, Science and Technology Education, 2022
Computational thinking is a strategy of thinking to tackle complex problems. There is a paucity of conceptualization and instruments that cogitate on computational thinking disposition and attitudes. This study reacts to these constraints by establishing an instrument to test computational thinking related dispositions and attitudes. The…
Descriptors: Item Response Theory, Computation, Thinking Skills, Secondary School Students
Tarhan, Nevzat; Tutgun Unal, Aylin – Turkish Online Journal of Educational Technology - TOJET, 2022
In this research, it is aimed to develop a series of scales to determine the changing values and behaviors of different generations in society today, where new media environments are diversifying day by day. In the study, which took into account the generation classification made with the focus of technological tools, generation X was considered…
Descriptors: Test Construction, Generational Differences, Mass Media Effects, Test Validity
Burgueño, Rafael; Calderón, Antonio; Sinelnikov, Oleg; Medina-Casaubón, Jesús – Measurement in Physical Education and Exercise Science, 2022
This research developed and psychometrically tested the Sport Education Scale, a measure of students' perceptions of the structural features of a Sport Education season. In the first study (N = 277 students), a pool of 28 items was developed, and an exploratory factor analysis found a 7-factor solution. In the second study (N = 656 students), a…
Descriptors: Test Construction, Test Validity, Physical Education, Attitude Measures
McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Cecilione, Jennifer; Cook, Clayton R.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – School Mental Health, 2022
Teacher-reported measures of treatment integrity (the extent to which prescribed practices are delivered as intended by teachers) have the potential to support efforts to evaluate and implement evidence-based interventions in early childhood settings. However, self-report treatment integrity measures have shown poor correspondence with…
Descriptors: Fidelity, Early Childhood Teachers, Intervention, Self Evaluation (Individuals)
Qian, Lu; Shao, Huan; Fang, Hui; Xiao, Ting; Ding, Ning; Sun, Bei; Gao, HuiYun; Tang, Min; Ye, Mei; Ke, XiaoYan; O'Neill, Daniela K. – International Journal of Language & Communication Disorders, 2022
Background: Pragmatics has generally been defined as the ability to use language in social situations, it is commonly regarded as the third major component of language ability. To date, there is no tool for assessing early pragmatic development of Chinese-speaking children. Aims: To describe the translation of the Language Use Inventory (LUI) from…
Descriptors: Measures (Individuals), Language Usage, Pragmatics, Mandarin Chinese
Zhai, Tina; Bailey, Phoebe E.; Rogers, Kris D.; Kneebone, Ian I. – International Journal of Behavioral Development, 2022
This study investigated the psychometric properties of the Geriatric Anxiety Inventory (GAI) in younger adults. Participants were 212 younger adults age M = 22 (range = 17-53) years. They completed a demographic information questionnaire and self-report measures: the GAI, the Depression Anxiety Stress Scales (DASS), the Generalized Anxiety…
Descriptors: Anxiety, Psychometrics, Late Adolescents, Adults
van der Meer, Hedwig A.; Sheftel-Simanova, Irina; Kan, Cornelis C.; Trujillo, James P. – Journal of Autism and Developmental Disorders, 2022
The actions and feelings questionnaire (AFQ) provides a short, self-report measure of how well someone uses and understands visual communicative signals such as gestures. The objective of this study was to translate and cross-culturally adapt the AFQ into Dutch (AFQ-NL) and validate this new version in neurotypical and autistic populations.…
Descriptors: Questionnaires, Translation, Validity, Reliability

Peer reviewed
Direct link
