Publication Date
In 2025 | 2 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 31 |
Since 2016 (last 10 years) | 99 |
Since 2006 (last 20 years) | 170 |
Descriptor
Correlation | 235 |
Item Analysis | 235 |
Test Reliability | 118 |
Foreign Countries | 115 |
Factor Analysis | 112 |
Reliability | 100 |
Test Validity | 74 |
Measures (Individuals) | 60 |
Psychometrics | 55 |
Test Construction | 48 |
Statistical Analysis | 47 |
More ▼ |
Source
Author
Abrams, Lisa M. | 2 |
Conroy, Maureen A. | 2 |
Hung Tan Ha | 2 |
Isgör, Isa Yücel | 2 |
McLeod, Bryce D. | 2 |
Smith, Meghan M. | 2 |
Sutherland, Kevin S. | 2 |
Tim Stoeckel | 2 |
Veldman, Donald J. | 2 |
Yaman, Erkan | 2 |
Abd-Hamid, Nor Hashidah | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 2 |
Location
Turkey | 42 |
Canada | 7 |
China | 6 |
Taiwan | 5 |
Hong Kong | 4 |
Netherlands | 4 |
South Korea | 4 |
Greece | 3 |
India | 3 |
Japan | 3 |
Saudi Arabia | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023
A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…
Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation
Alkhanani, Badriah – International Journal of Language Education, 2022
The purpose of this study was to find the effect of English Language Teachers' Methodology (ELTM) on the Career Growth (CG) of the Saudi students. In order to provide a solid basis for this research study, a cross-sectional-descriptive research design was employed. For scale development and tool standardization, inter-class correlation…
Descriptors: Career Development, English (Second Language), Second Language Learning, Second Language Instruction
Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022
Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…
Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025
The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…
Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis
Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024
Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…
Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning
Ferrari-Bridgers, Franca – International Journal of Listening, 2023
While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…
Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Konrad Piotrowski; Aleksandra Nowicka; Kamil Janowicz; Martin M. Smith – Journal of Psychoeducational Assessment, 2024
The Big Three Perfectionism Scale (BTPS) was created to integrate different aspects of perfectionism, including the newly conceptualized concept of narcissistic perfectionism. The goal of our two studies (N = 1341) was to examine the psychometric properties of the Polish adaptation of the BTPS, supporting the validity and portability of the…
Descriptors: Personality Measures, Foreign Countries, Personality Problems, Parent Child Relationship
Ahmet Erol; Mustafa Erol – Journal of Turkish Science Education, 2024
Engineering education aims to equip children with the skills to solve and apply complex problems. Problem-solving processes in engineering require high-level thinking and mind habits. Habit is a term used to describe various aspects of intelligence. Engineering habits of mind are the values, attitudes, and thinking skills associated with…
Descriptors: Engineering Education, Factor Analysis, Cognitive Processes, Construct Validity
Huaxia Xiong; Mingfeng Xue; Guan Di; Yaqing Mao; Enhui Qiao – Journal of Psychoeducational Assessment, 2024
The impact of teachers' beliefs on the implementation and effectiveness of Social and Emotional Learning (SEL) programs underscores the essential need for reliable measures of these beliefs. This study aims to explore and validate the psychometric properties of the Teacher Social and Emotional Learning Beliefs Scale (TSELBS) within the Chinese…
Descriptors: Social Emotional Learning, Teacher Attitudes, Program Effectiveness, Correlation
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity