Publication Date
In 2025 | 8 |
Since 2024 | 17 |
Since 2021 (last 5 years) | 37 |
Since 2016 (last 10 years) | 58 |
Since 2006 (last 20 years) | 101 |
Descriptor
Psychometrics | 105 |
Test Items | 105 |
Test Construction | 49 |
Foreign Countries | 46 |
Test Reliability | 44 |
College Students | 34 |
Test Validity | 34 |
Item Analysis | 29 |
Item Response Theory | 24 |
Factor Analysis | 23 |
Scores | 23 |
More ▼ |
Source
Author
Gierl, Mark J. | 4 |
Liu, Ou Lydia | 4 |
Baghaei, Purya | 3 |
Lai, Hollis | 2 |
Wang, Changjiang | 2 |
Acikgul, Kubra | 1 |
Adamu, Gishua Garba | 1 |
Afsharrad, Mohammad | 1 |
Ahmed Al - Badri | 1 |
Akram Riaz, Muhammad | 1 |
Ali, Usama S. | 1 |
More ▼ |
Publication Type
Education Level
Postsecondary Education | 105 |
Higher Education | 104 |
Secondary Education | 12 |
High Schools | 10 |
Adult Education | 1 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Grade 9 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Teachers | 1 |
Location
Turkey | 9 |
Canada | 6 |
Germany | 6 |
Iran | 4 |
Indonesia | 2 |
Alabama | 1 |
Australia | 1 |
Bosnia and Herzegovina… | 1 |
China | 1 |
Colombia | 1 |
Colorado | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yunting Liu; Shreya Bhandari; Zachary A. Pardos – British Journal of Educational Technology, 2025
Effective educational measurement relies heavily on the curation of well-designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT-3.5, GPT-4,…
Descriptors: Artificial Intelligence, Test Items, Psychometrics, Educational Assessment
Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025
This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…
Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests
Fu Chen; Ying Cui; Alina Lutsyk-King; Yizhu Gao; Xiaoxiao Liu; Maria Cutumisu; Jacqueline P. Leighton – Education and Information Technologies, 2024
Post-secondary data literacy education is critical to students' academic and career success. However, the literature has not adequately addressed the conceptualization and assessment of data literacy for post-secondary students. In this study, we introduced a novel digital performance-based assessment for teaching and evaluating post-secondary…
Descriptors: Performance Based Assessment, College Students, Information Literacy, Evaluation Methods
Emily A. Holt; Jessica Duke; Ryan Dunk; Krystal Hinerman – Environmental Education Research, 2024
Student understanding of climate change is an active and growing area of research, but little research has documented undergraduate students' knowledge about the biotic impacts of climate change. Here, we address this literature gap by presenting the Inventory of Biotic Climate Literacy (IBCL), a concept inventory developed to assess undergraduate…
Descriptors: Climate, Undergraduate Students, Knowledge Level, Test Construction
Mahdi Ghorbankhani; Keyvan Salehi – SAGE Open, 2025
Academic procrastination, the tendency to delay academic tasks without reasonable justification, has significant implications for students' academic performance and overall well-being. To measure this construct, numerous scales have been developed, among which the Academic Procrastination Scale (APS) has shown promise in assessing academic…
Descriptors: Psychometrics, Measures (Individuals), Time Management, Foreign Countries
Kamau Oginga Siwatu; Kara Page; Narges Hadi – College Teaching, 2024
The purpose of this article is to document the development of a new measure of teaching self-efficacy -- "The College Teaching Self-Efficacy (CTSE) Scale." We designed the CTSE scale to examine individuals' beliefs in their abilities to perform specific teaching tasks in a college classroom successfully. We developed an instrument that…
Descriptors: Self Efficacy, Beliefs, Psychometrics, Measures (Individuals)
Grace C. Tetschner; Sachin Nedungadi – Chemistry Education Research and Practice, 2025
Many undergraduate chemistry students hold alternate conceptions related to resonance--an important and fundamental topic of organic chemistry. To help address these alternate conceptions, an organic chemistry instructor could administer the resonance concept inventory (RCI), which is a multiple-choice assessment that was designed to identify…
Descriptors: Scientific Concepts, Concept Formation, Item Response Theory, Scores
Apichat Khamboonruang – Language Testing in Asia, 2025
Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…
Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests
Dahl, Laura S.; Staples, B. Ashley; Mayhew, Matthew J.; Rockenbach, Alyssa N. – Innovative Higher Education, 2023
Surveys with rating scales are often used in higher education research to measure student learning and development, yet testing and reporting on the longitudinal psychometric properties of these instruments is rare. Rasch techniques allow scholars to map item difficulty and individual aptitude on the same linear, continuous scale to compare…
Descriptors: Surveys, Rating Scales, Higher Education, Educational Research
van Rijn, Peter W.; Attali, Yigal; Ali, Usama S. – Journal of Experimental Education, 2023
We investigated whether and to what extent different scoring instructions, timing conditions, and direct feedback affect performance and speed. An experimental study manipulating these factors was designed to address these research questions. According to the factorial design, participants were randomly assigned to one of twelve study conditions.…
Descriptors: Scoring, Time, Feedback (Response), Performance
Sen, Sedat – Creativity Research Journal, 2022
The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…
Descriptors: Generalization, Creativity, Meta Analysis, Higher Education
Acikgul, Kubra; Sad, Suleyman Nihat; Altay, Bilal – International Journal of Assessment Tools in Education, 2023
This study aimed to develop a useful test to measure university students' spatial abilities validly and reliably. Following a sequential explanatory mixed methods research design, first, qualitative methods were used to develop the trial items for the test; next, the psychometric properties of the test were analyzed through quantitative methods…
Descriptors: Spatial Ability, Scores, Multiple Choice Tests, Test Validity
Azwani Masuwai; Hafizhah Zulkifli; Mohd Isa Hamzah – Cogent Education, 2024
This study aims to test the content and face validity of Secondary School Islamic Education Teacher Self-Assessment Instrument (SSIET-SAI) using Content Validity Ratio (CVR), Content Validity Index (CVI) and Cohen Kappa Index (CKI) analysis. They are nine professional experts from universities and schools and two other expert field. Validation…
Descriptors: Secondary School Teachers, Teacher Evaluation, Self Evaluation (Individuals), Test Validity
Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025
Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…
Descriptors: College Entrance Examinations, Testing, Change, Scoring
Do-Hong Kim; Chuang Wang; Thi Nhu Ngoc Truong – Language Teaching Research, 2024
Researchers and practitioners in the field of second language acquisition have come to realize the importance of non-cognitive skills such as self-efficacy and self-regulation in students' learning of a second language. However, there has been limited systematic research on such measures in the second language context and the validity and…
Descriptors: Psychometrics, Test Content, Self Efficacy, English Language Learners