Publication Date
In 2025 | 3 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 20 |
Since 2016 (last 10 years) | 70 |
Since 2006 (last 20 years) | 144 |
Descriptor
Comparative Analysis | 174 |
Foreign Countries | 174 |
Test Reliability | 174 |
Test Validity | 111 |
Statistical Analysis | 36 |
Factor Analysis | 35 |
Correlation | 33 |
Psychometrics | 32 |
English (Second Language) | 31 |
Scores | 30 |
Measures (Individuals) | 27 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Administrators | 2 |
Teachers | 2 |
Practitioners | 1 |
Researchers | 1 |
Location
Turkey | 19 |
Australia | 15 |
United States | 14 |
United Kingdom (England) | 11 |
China | 10 |
Germany | 9 |
Hong Kong | 9 |
Iran | 9 |
Taiwan | 9 |
United Kingdom | 9 |
Belgium | 7 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Antonio P. Gutierrez de Blume; Diana Marcela Montoya Londoño; Virginia Jiménez Rodríguez; Olivia Morán Núñez; Ariel Cuadro; Lilián Daset; Mauricio Molina Delgado; Claudia García de la Cadena; María Beatríz Beltrán Navarro; Aníbal Puente Ferreras; Sebastián Urquijo; Walter Lizandro Arias – Metacognition and Learning, 2024
Metacognition is defined as a higher-order thinking skill that enables individuals to monitor, control, and regulate their thinking and behavior. In education, this skill is important, as learners need to self-regulate their learning behaviors for successful lifelong learning. Thus, it is essential for educators and learners alike to know their…
Descriptors: Metacognition, Measures (Individuals), Psychometrics, Standards
Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…
Descriptors: German, Portuguese, Spanish, Test Construction
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Amssalu Wondmagegn Getu; Fikadu Edhetu Gashaw; Menberu Mengesha Woldemariam – Shanlax International Journal of Education, 2024
The study aimed to assess the effectiveness of the Predict-Explain-Enact-Observe-Reflect (PEEOR) instructional strategy on general science students' conceptual understanding and motivation in the topic of motion and force. The research employed a pre-test post-test quasi-experimental design. The sample consisted of 107 general science summer, year…
Descriptors: Physics, Science Instruction, Learning Motivation, Reflection
Benton, Tom – Research Matters, 2021
Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…
Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level
Purwanto; Hidayah, Niswatul; Wagistina, Satti – International Journal of Educational Methodology, 2023
Learning geography in Indonesia philosophically aims to develop spatial literacy. Students must improve spatial literacy to form reasoning skills and apply spatial concepts in real life. Applying Gersmehl's spatial learning can improve students' spatial literacy through syntax arranged based on spatial aspects. The use of google earth helps…
Descriptors: Spatial Ability, Natural Disasters, Geography Instruction, Teaching Methods
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Kasikarn Bansong; Somkiet Poopatwiboon; Apisak Sukying – Journal of Education and Learning, 2023
It is increasingly prevalent in digital learning and teaching strategies for discerning a global perspective on creating the student learning experience. Multimodality is an emergent phenomenon that may influence how digital learning is designed, especially during the COVID-19 pandemic in which immersive learning environments, such as a virtual…
Descriptors: Elementary School Students, English (Second Language), Second Language Learning, Second Language Instruction
Seraji, Farhad; Ansari, Saied; Chosarih, Muhammad Reza Yousefzadeh – Education and Information Technologies, 2023
Increasing children's access to media has attached greater significance to media literacy education, the content and methods of which have changed with media development. This study aimed to investigate the effects of the Community of Inquiry (CoI) method on media literacy competencies in elementary students. To this end, 95 female sixth-grade…
Descriptors: Media Literacy, Communities of Practice, Elementary School Students, Literacy Education
Maghfiroh, Anissa; Kuswanto, Heru – International Journal of Instruction, 2022
This research aims to reveal the effectiveness of the use of Kofie GeBoL media in improving (1) vector representation ability and (2) critical thinking ability in physics instruction. It is a descriptive quantitative study with the quasi-experiment design. It was conducted in two stages: empirical try out and implementation of Kofie GeboL to see…
Descriptors: Physics, Instructional Effectiveness, Critical Thinking, Thinking Skills
Sawczuk, Thomas; Jones, Ben; Scantlebury, Sean; Weakley, Jonathan; Read, Dale; Costello, Nessan; Darrall-Jones, Joshua David; Stokes, Keith; Till, Kevin – Measurement in Physical Education and Exercise Science, 2018
This study aimed to evaluate the between-day reliability and usefulness of a fitness testing battery in a group of youth sport athletes. Fifty-nine youth sport athletes (age = 17.3 ± 0.7 years) undertook a fitness testing battery including the isometric mid-thigh pull, counter-movement jump, 5-40 m sprint splits, and the 5-0-5 change of direction…
Descriptors: Test Reliability, Comparative Analysis, Athletes, Team Sports
Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2018
The maximal reliability of a congeneric measure is achieved by weighting item scores to form the optimal linear combination as the total score; it is never lower than the composite reliability of the measure when measurement errors are uncorrelated. The statistical method that renders maximal reliability would also lead to maximal criterion…
Descriptors: Test Reliability, Test Validity, Comparative Analysis, Attitude Measures