Publication Date
In 2025 | 6 |
Since 2024 | 16 |
Since 2021 (last 5 years) | 44 |
Since 2016 (last 10 years) | 147 |
Since 2006 (last 20 years) | 367 |
Descriptor
Item Analysis | 886 |
Test Reliability | 886 |
Test Validity | 525 |
Test Construction | 385 |
Test Items | 243 |
Factor Analysis | 198 |
Foreign Countries | 192 |
Psychometrics | 165 |
Correlation | 118 |
Statistical Analysis | 108 |
Higher Education | 99 |
More ▼ |
Source
Author
Erford, Bradley T. | 7 |
Ebel, Robert L. | 5 |
Benson, Jeri | 4 |
Dedrick, Robert F. | 4 |
Ferron, John | 4 |
Shaunessy-Dedrick, Elizabeth | 4 |
Suldo, Shannon M. | 4 |
Aiken, Lewis R. | 3 |
Bashaw, W. L. | 3 |
Brennan, Robert L. | 3 |
Cliff, Norman | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 25 |
Practitioners | 16 |
Teachers | 8 |
Students | 2 |
Administrators | 1 |
Counselors | 1 |
Location
Turkey | 57 |
Canada | 15 |
India | 10 |
China | 8 |
Australia | 7 |
Iran | 7 |
Florida | 6 |
United States | 6 |
New York | 5 |
Nigeria | 5 |
Taiwan | 5 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 4 |
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
R. Noah Padgett – Practical Assessment, Research & Evaluation, 2023
The consistency of psychometric properties across waves of data collection provides valuable evidence that scores can be interpreted consistently. Evidence supporting the consistency of psychometric properties can come from using a longitudinal extension of item factor analysis to account for the lack of independence of observation when evaluating…
Descriptors: Psychometrics, Factor Analysis, Item Analysis, Validity
Mahdi Ghorbankhani; Keyvan Salehi – SAGE Open, 2025
Academic procrastination, the tendency to delay academic tasks without reasonable justification, has significant implications for students' academic performance and overall well-being. To measure this construct, numerous scales have been developed, among which the Academic Procrastination Scale (APS) has shown promise in assessing academic…
Descriptors: Psychometrics, Measures (Individuals), Time Management, Foreign Countries
Gilber Chura-Quispe; Cristina Beatriz Flores-Rosado; Alex Alfredo Valenzuela-Romero; Enlil Iván Herrera-Pérez; Avenilda Eufemia Herrera-Chura; Mercedes Alejandrina Collazos Alarcón – Contemporary Educational Technology, 2025
Information literacy is a fundamental component in the academic development of future professionals. The aim of the study was to evaluate the metric properties of the 'questionnaire of self-perceived information competences', analyzing the factorial structure, internal consistency, convergent validity, factorial invariance according to gender and…
Descriptors: Information Literacy, College Students, Student Attitudes, Foreign Countries
Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024
This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…
Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits
Thompson, Kathryn N. – ProQuest LLC, 2023
It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…
Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores
Mehmet Kanik – International Journal of Assessment Tools in Education, 2024
ChatGPT has surged interest to cause people to look for its use in different tasks. However, before allowing it to replace humans, its capabilities should be investigated. As ChatGPT has potential for use in testing and assessment, this study aims to investigate the questions generated by ChatGPT by comparing them to those written by a course…
Descriptors: Artificial Intelligence, Testing, Multiple Choice Tests, Test Construction
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Shivam Kumar; Shridhar Patil; Anil Paswan; Swaraj Kumar Dutta; R. K. Sohane – Journal of Agricultural Education and Extension, 2024
Purpose: The study was aimed at measuring farmers' helpline services quality in India using a standardized multi-factor scale (HELPQUAL) developed as part of this study. Design/methodology/approach: The present study is based on 360 farmers' and 45 experts' responses gathered using telephonic interviews and mailed questionnaires during the year…
Descriptors: Agricultural Occupations, Help Seeking, Counseling Services, Rural Extension
Yalalem Assefa; Bekalu Tadesse Moges; Shouket Ahmad Tilwani – Journal of Applied Research in Higher Education, 2024
Purpose: Lifelong learning has become one of the most interesting areas of research. Hence, the current study was aimed at developing and validating a tool that helps to study how well people working in higher education institutions are engaged in lifelong learning. Design/methodology/approach: A review of theories in the literature and experts'…
Descriptors: Lifelong Learning, Measures (Individuals), Likert Scales, Test Construction
Yu-Sheng Su; Xiao Wang; Li Zhao – IEEE Transactions on Education, 2024
Research Purpose and Contribution: The study aimed to construct an evaluation framework for assessing pupils' computational thinking (CT) during classroom learning problem solving. As a self-report evaluation scale for pupils, this evaluation framework further enriched the CT assessment instruments for pupils and provided a specialized instrument…
Descriptors: Computation, Thinking Skills, Student Evaluation, Evaluation Methods
Testing Anatomy: Dissecting Spatial and Non-Spatial Knowledge in Multiple-Choice Question Assessment
Julie Dickson; Darren J. Shaw; Andrew Gardiner; Susan Rhind – Anatomical Sciences Education, 2024
Limited research has been conducted on the spatial ability of veterinary students and how this is evaluated within anatomy assessments. This study describes the creation and evaluation of a split design multiple-choice question (MCQ) assessment (totaling 30 questions divided into 15 non-spatial MCQs and 15 spatial MCQs). Two cohorts were tested,…
Descriptors: Anatomy, Spatial Ability, Multiple Choice Tests, Factor Analysis
Do-Hong Kim; Chuang Wang; Thi Nhu Ngoc Truong – Language Teaching Research, 2024
Researchers and practitioners in the field of second language acquisition have come to realize the importance of non-cognitive skills such as self-efficacy and self-regulation in students' learning of a second language. However, there has been limited systematic research on such measures in the second language context and the validity and…
Descriptors: Psychometrics, Test Content, Self Efficacy, English Language Learners
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Castillo-Diaz, Marcio Alexander; Gomes, Cristiano Mauro Assis; Jelihovschi, Enio Galinkin – International Journal of Educational Methodology, 2022
The field of studies in metacognition points to some limitations in the way the construct has traditionally been measured and shows a near absence of performance-based tests. The Meta-Text is a performance-based test recently created to assess components of cognition regulation: planning, monitoring, and judgment. This study presents the first…
Descriptors: Schemata (Cognition), Decision Making, Undergraduate Students, Foreign Countries