Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 31 |
Descriptor
Comparative Analysis | 37 |
Gender Differences | 37 |
Test Items | 37 |
Foreign Countries | 16 |
Scores | 15 |
Item Analysis | 13 |
Test Bias | 9 |
Difficulty Level | 8 |
Statistical Analysis | 8 |
High School Students | 7 |
Mathematics Tests | 7 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 25 |
Journal Articles | 22 |
Reports - Evaluative | 8 |
Dissertations/Theses -… | 4 |
Numerical/Quantitative Data | 4 |
Speeches/Meeting Papers | 3 |
Tests/Questionnaires | 2 |
Collected Works - Proceedings | 1 |
Education Level
Secondary Education | 13 |
Elementary Education | 8 |
High Schools | 8 |
Higher Education | 8 |
Elementary Secondary Education | 7 |
Postsecondary Education | 6 |
Grade 8 | 3 |
Grade 4 | 2 |
Grade 6 | 2 |
Middle Schools | 2 |
Early Childhood Education | 1 |
More ▼ |
Audience
Location
Australia | 2 |
Asia | 1 |
Belgium | 1 |
Czech Republic | 1 |
Indonesia | 1 |
Israel | 1 |
Maryland | 1 |
Michigan | 1 |
New York | 1 |
North Carolina | 1 |
South Korea | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Bacon, Terrence E. – ProQuest LLC, 2023
The purpose of this study was to investigate developmental music aptitude with a broader sample in order to propose national norms. Research questions were: 1) To what extent are published Primary Measures of Music Aptitude (PMMA) norms different from those established using a current sample? 2) Are there comparative differences in PMMA item…
Descriptors: Psychometrics, Music, Aptitude Tests, Test Items
Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022
Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…
Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity
Alexander James Kwako – ProQuest LLC, 2023
Automated assessment using Natural Language Processing (NLP) has the potential to make English speaking assessments more reliable, authentic, and accessible. Yet without careful examination, NLP may exacerbate social prejudices based on gender or native language (L1). Current NLP-based assessments are prone to such biases, yet research and…
Descriptors: Gender Bias, Natural Language Processing, Native Language, Computational Linguistics
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Rujun Xu; James Soland – International Journal of Testing, 2024
International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…
Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries
Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022
Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…
Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction
Nawas, Abu; Darmawan, I Gusti Ngurah; Maadad, Nina – Language Testing in Asia, 2023
The greater emphasis on the significance and difference in English performance between the school types has mainly been investigated across Asian countries. However, not much is known about what language skills differentiate their overall language achievement. Using a quantitative study with comparative analysis, this study measured the reading…
Descriptors: Foreign Countries, Listening Comprehension Tests, Language Tests, English (Second Language)
Fairness and Comparability in Achievement Motivation Items: A Differential Item Functioning Analysis
Bialo, Jacquelyn A.; Li, Hongli – Journal of Psychoeducational Assessment, 2022
Achievement motivation is a well-documented predictor of a variety of positive student outcomes. However, given observed group differences in motivation and related outcomes, motivation instruments should be checked for comparable item and scale functioning. Therefore, the purpose of this study was to evaluate measurement scale comparability and…
Descriptors: Student Motivation, Academic Achievement, Item Analysis, Gender Differences
Akhavan Masoumi, Ghazal; Sadeghi, Karim – Language Testing in Asia, 2020
This study aimed to examine the effect of test format on test performance by comparing Multiple Choice (MC) and Constructed Response (CR) vocabulary tests in an EFL setting. Also, this paper investigated the function of gender in MC and CR vocabulary measures. To this end, five 20-item stem-equivalent vocabulary tests (CR, and 3-, 4-, 5-, and…
Descriptors: Language Tests, Test Items, English (Second Language), Second Language Learning
Hrouzková, Tereza; Richterek, Lukáš – International Baltic Symposium on Science and Technology Education, 2021
The Lawson classroom test of scientific reasoning is a quite popular and widely used tool that measures the level and development of the student's scientific reasoning skills. In this contribution, the results of this test for the N=446 students of the Faculty of Science Palacký University Olomouc from the years 2018-2020 at the beginning of their…
Descriptors: Science Tests, Thinking Skills, Undergraduate Students, Science Education
Luo, Wei; Smith, Thomas J.; Whalley, Kyle; Darling, Andrew; Ormand, Carol; Hung, Wei-Chen; Chiang, Jui-Ling; Pelletier, Jon; Duffin, Kirk – British Journal of Educational Technology, 2019
This paper presents results from a randomized experimental design replicated over four semesters that compared students' performance in understanding landform evolution processes as measured by the pretest to posttest score growth between two treatment methods: an online interactive simulation tool and a paper-based exercise. While both methods…
Descriptors: Earth Science, Models, Science Tests, Computer Simulation
Bourdeaud'Hui, Heleen; Aesaert, Koen; van Braak, Johan – Language Assessment Quarterly, 2021
Effective listening comprehension skills are an important prerequisite for the academic success of primary school students. However, the assessment of listening skills in the instructional language appears to have received only scant attention in the literature. Therefore, the goal of the present study was twofold. Firstly, a comprehensive…
Descriptors: Native Language, Indo European Languages, Second Language Learning, Test Items
Yang, Eunbae B.; Lee, Myung Ae; Park, Yoon Soo – Advances in Health Sciences Education, 2018
In 2012, the National Health Personnel Licensing Examination Board of Korea decided to publicly disclose all test items and answers to satisfy the test takers' right to know and enhance the transparency of tests administered by the government. This study investigated the effects of item disclosure on the medical licensing examination (MLE),…
Descriptors: Certification, Foreign Countries, Test Items, Disclosure
Acar, Tülin – International Journal of Evaluation and Research in Education, 2016
The aim of this research is to determine the attitudes of secondary level students regarding the skills in English as a Foreign Language and to compare the level of relationship between the academic success at English and the attitudes measured. Attitudes and success levels of the students of secondary education regarding their language skills…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Academic Achievement
Colombo-Dougovito, Andrew M. – Physical Educator, 2013
The purpose of this investigation was to analyze the possible differences of the physical fitness performance of elementary-aged students with and without attention deficit hyperactivity disorder (ADHD). Little research has been produced in the area of youth with ADHD and motor development; this research paper further investigates the effects of…
Descriptors: Physical Fitness, Attention Deficit Hyperactivity Disorder, Elementary School Students, Motor Development