Publication Date
In 2025 | 4 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 32 |
Since 2006 (last 20 years) | 66 |
Descriptor
Comparative Analysis | 67 |
Test Reliability | 67 |
Test Validity | 44 |
Foreign Countries | 29 |
Correlation | 28 |
Undergraduate Students | 24 |
College Students | 18 |
Scores | 14 |
Factor Analysis | 13 |
Test Items | 13 |
Psychometrics | 12 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Location
Turkey | 6 |
Iran | 5 |
Australia | 3 |
Japan | 3 |
United Kingdom | 3 |
Greece | 2 |
Saudi Arabia | 2 |
United Kingdom (England) | 2 |
United States | 2 |
Asia | 1 |
Austria | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025
This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…
Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis
Antonio P. Gutierrez de Blume; Diana Marcela Montoya Londoño; Virginia Jiménez Rodríguez; Olivia Morán Núñez; Ariel Cuadro; Lilián Daset; Mauricio Molina Delgado; Claudia García de la Cadena; María Beatríz Beltrán Navarro; Aníbal Puente Ferreras; Sebastián Urquijo; Walter Lizandro Arias – Metacognition and Learning, 2024
Metacognition is defined as a higher-order thinking skill that enables individuals to monitor, control, and regulate their thinking and behavior. In education, this skill is important, as learners need to self-regulate their learning behaviors for successful lifelong learning. Thus, it is essential for educators and learners alike to know their…
Descriptors: Metacognition, Measures (Individuals), Psychometrics, Standards
Karel Kok; Sophia Chroszczinsky; Burkhard Priemer – Physical Review Physics Education Research, 2024
Data comparison problems are used in teaching and science education research that focuses on students' ability to compare datasets and their conceptual understanding of measurement uncertainties. However, the evaluation of students' decisions in these problems can pose a problem: e.g., students making a correct decision for the wrong reasons.…
Descriptors: Secondary School Students, Undergraduate Students, Comparative Analysis, Evaluation Methods
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Amssalu Wondmagegn Getu; Fikadu Edhetu Gashaw; Menberu Mengesha Woldemariam – Shanlax International Journal of Education, 2024
The study aimed to assess the effectiveness of the Predict-Explain-Enact-Observe-Reflect (PEEOR) instructional strategy on general science students' conceptual understanding and motivation in the topic of motion and force. The research employed a pre-test post-test quasi-experimental design. The sample consisted of 107 general science summer, year…
Descriptors: Physics, Science Instruction, Learning Motivation, Reflection
David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023
We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…
Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format
Miguel-Revilla, Diego – Journal of Curriculum Studies, 2022
Secondary education students do not usually perceive history as a subject connected with their lives, backgrounds and interests. At the same time, prospective and in-service teachers do not always have a coherent vision of this discipline, which can reflect on their students' perceptions. This study makes use of a theoretical framework developed…
Descriptors: History Instruction, Relevance (Education), Student Attitudes, Secondary School Students
Ozdemir, Adem; Koc, Yasemin; Gundogdu, Kerim – International Journal of Psycho-Educational Sciences, 2018
The aim of this research is to develop a scale that prospective science teachers in the Education Faculties compare themselves to their peers according to the "Science field Teacher and Professional Skills" courses. For this reason, 25 items related to Physics, Chemistry, Biology, Science Experiments and Professional Skills courses were…
Descriptors: Preservice Teachers, Science Teachers, Likert Scales, Test Validity
Zaidi, Nikki L.; Swoboda, Christopher M.; Kelcey, Benjamin M.; Manuel, R. Stephen – Advances in Health Sciences Education, 2017
The extant literature has largely ignored a potentially significant source of variance in multiple mini-interview (MMI) scores by "hiding" the variance attributable to the sample of attributes used on an evaluation form. This potential source of hidden variance can be defined as rating items, which typically comprise an MMI evaluation…
Descriptors: Interviews, Scores, Generalizability Theory, Monte Carlo Methods
Horn, Aaron S.; Horner, Olena G.; Lee, Giljae – Studies in Higher Education, 2019
Researchers in higher education frequently evaluate institutional effectiveness as the difference between an actual and predicted graduation rate, but little is known about whether such a method is reliable or valid. This study examines the measurement properties of effectiveness scores derived from regression residuals for community colleges in…
Descriptors: Instructional Effectiveness, Two Year Colleges, Comparative Analysis, Raw Scores
Karami, Hossein; Kouhpaee Nejad, Mohammadhossein; Nourzadeh, Saeed; Ahmadi Shirazi, Masoumeh – International Journal of Bilingual Education and Bilingualism, 2020
This study was set to cross-validate a bilingual Persian-English version of the Vocabulary Size Test (VST) against the monolingual English version and compare Iranian EFL learners' performance on the two versions. Various bilingual versions of the VST have been developed based on the assumption that bilingual versions are not affected by the…
Descriptors: Bilingualism, Indo European Languages, English (Second Language), Second Language Learning
The Efficiency of Higher Education Institutions in England Revisited: Comparing Alternative Measures
Johnes, Geraint; Tone, Kaoru – Tertiary Education and Management, 2017
Data envelopment analysis (DEA) has often been used to evaluate efficiency in the context of higher education institutions. Yet there are numerous alternative non-parametric measures of efficiency available. This paper compares efficiency scores obtained for institutions of higher education in England, 2013-2014, using three different methods: the…
Descriptors: Foreign Countries, Efficiency, Higher Education, Alternative Assessment
Wilkin, John P. – College & Research Libraries, 2017
The 1961 Copyright Office study on renewals, authored by Barbara Ringer, has cast an outsized influence on discussions of the U.S. 1923-1963 public domain. As more concrete data emerge from initiatives such as the large-scale determination process in the Copyright Review Management System (CRMS) project, questions are raised about the reliability…
Descriptors: Comparative Analysis, Copyrights, Misconceptions, Test Reliability