Publication Date
In 2025 | 7 |
Since 2024 | 16 |
Since 2021 (last 5 years) | 38 |
Since 2016 (last 10 years) | 122 |
Since 2006 (last 20 years) | 310 |
Descriptor
Item Analysis | 581 |
Test Reliability | 581 |
Test Validity | 529 |
Test Construction | 269 |
Foreign Countries | 163 |
Factor Analysis | 158 |
Test Items | 151 |
Psychometrics | 134 |
Correlation | 78 |
Statistical Analysis | 70 |
Questionnaires | 68 |
More ▼ |
Source
Author
Erford, Bradley T. | 7 |
Dedrick, Robert F. | 4 |
Ferron, John | 4 |
Shaunessy-Dedrick, Elizabeth | 4 |
Suldo, Shannon M. | 4 |
Haladyna, Tom | 3 |
Whitney, Douglas R. | 3 |
Aaronson, May | 2 |
Abell, Neil | 2 |
Bakioglu, Fuad | 2 |
Bichi, Ado Abdu | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 17 |
Practitioners | 11 |
Teachers | 6 |
Students | 2 |
Administrators | 1 |
Counselors | 1 |
Laws, Policies, & Programs
Individuals with Disabilities… | 4 |
No Child Left Behind Act 2001 | 4 |
Assessments and Surveys
What Works Clearinghouse Rating
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Güntay Tasçi – Science Insights Education Frontiers, 2024
The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…
Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
R. Noah Padgett – Practical Assessment, Research & Evaluation, 2023
The consistency of psychometric properties across waves of data collection provides valuable evidence that scores can be interpreted consistently. Evidence supporting the consistency of psychometric properties can come from using a longitudinal extension of item factor analysis to account for the lack of independence of observation when evaluating…
Descriptors: Psychometrics, Factor Analysis, Item Analysis, Validity
Durak, Ismail; Karagoz, Yalcin – International Journal of Assessment Tools in Education, 2021
The aim of this study is to adapt the Statistics Anxiety Scale (SAS) developed by Vigil-Colet et al. (2008) to Turkish. This study is expected to fill an important gap in the literature since no valid and reliable specific statistics anxiety scale developed or adapted in Turkish for undergraduate students in the literature is available. The sample…
Descriptors: Foreign Countries, Affective Measures, Statistics, Mathematics Anxiety
Kent Anderson Seidel – School Leadership Review, 2025
This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…
Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention
Mahdi Ghorbankhani; Keyvan Salehi – SAGE Open, 2025
Academic procrastination, the tendency to delay academic tasks without reasonable justification, has significant implications for students' academic performance and overall well-being. To measure this construct, numerous scales have been developed, among which the Academic Procrastination Scale (APS) has shown promise in assessing academic…
Descriptors: Psychometrics, Measures (Individuals), Time Management, Foreign Countries
Gilber Chura-Quispe; Cristina Beatriz Flores-Rosado; Alex Alfredo Valenzuela-Romero; Enlil Iván Herrera-Pérez; Avenilda Eufemia Herrera-Chura; Mercedes Alejandrina Collazos Alarcón – Contemporary Educational Technology, 2025
Information literacy is a fundamental component in the academic development of future professionals. The aim of the study was to evaluate the metric properties of the 'questionnaire of self-perceived information competences', analyzing the factorial structure, internal consistency, convergent validity, factorial invariance according to gender and…
Descriptors: Information Literacy, College Students, Student Attitudes, Foreign Countries
Nazli Uygun Emil – ProQuest LLC, 2020
Validity of a measurement refers to appropriate test score meanings, uses, and interpretations (Messick, 1989; Kane, 1992). There are different approaches to validity: an evidentiary aspect of validity is one requiring gathering statistical evidence to evaluate test score meaning. A common approach to validation is comparisons of test score equity…
Descriptors: Educational Quality, Mathematics Tests, Test Validity, Test Reliability
Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024
This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…
Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits
Dogan, Fatma; Aydin, Hasan – International Journal of Educational Reform, 2019
Applicability of multilingual education, which is applied in many countries, has increasingly proficiency and learning been a question of debate in Turkey because of the inclusion of living languages and dialects lessons into educational institutions. The purpose of this study is to develop a valid and reliable Likert-type scale to determine the…
Descriptors: Foreign Countries, Bilingual Education, Multilingualism, Test Construction
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Shivam Kumar; Shridhar Patil; Anil Paswan; Swaraj Kumar Dutta; R. K. Sohane – Journal of Agricultural Education and Extension, 2024
Purpose: The study was aimed at measuring farmers' helpline services quality in India using a standardized multi-factor scale (HELPQUAL) developed as part of this study. Design/methodology/approach: The present study is based on 360 farmers' and 45 experts' responses gathered using telephonic interviews and mailed questionnaires during the year…
Descriptors: Agricultural Occupations, Help Seeking, Counseling Services, Rural Extension
Yalalem Assefa; Bekalu Tadesse Moges; Shouket Ahmad Tilwani – Journal of Applied Research in Higher Education, 2024
Purpose: Lifelong learning has become one of the most interesting areas of research. Hence, the current study was aimed at developing and validating a tool that helps to study how well people working in higher education institutions are engaged in lifelong learning. Design/methodology/approach: A review of theories in the literature and experts'…
Descriptors: Lifelong Learning, Measures (Individuals), Likert Scales, Test Construction