Publication Date
In 2025 | 4 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 47 |
Since 2016 (last 10 years) | 97 |
Since 2006 (last 20 years) | 127 |
Descriptor
Difficulty Level | 138 |
Foreign Countries | 138 |
Test Reliability | 138 |
Test Items | 97 |
Test Validity | 79 |
Test Construction | 54 |
Multiple Choice Tests | 35 |
Item Response Theory | 32 |
Psychometrics | 29 |
Item Analysis | 27 |
Science Tests | 24 |
More ▼ |
Source
Author
Al-Jarf, Reima | 2 |
Atalmis, Erkan Hasan | 2 |
Barniol, Pablo | 2 |
Gu, Jianjun | 2 |
Istiyono, Edi | 2 |
Jandaghi, Gholamreza | 2 |
Lubiano, Michael Leonard D. | 2 |
Magpantay, Marife S. | 2 |
Retnawati, Heri | 2 |
Xu, Meidan | 2 |
Zavala, Genaro | 2 |
More ▼ |
Publication Type
Journal Articles | 124 |
Reports - Research | 121 |
Tests/Questionnaires | 12 |
Reports - Evaluative | 8 |
Dissertations/Theses -… | 4 |
Reports - Descriptive | 4 |
Speeches/Meeting Papers | 3 |
Collected Works - Proceedings | 1 |
Education Level
Audience
Researchers | 1 |
Location
Indonesia | 20 |
Turkey | 20 |
Germany | 10 |
Nigeria | 8 |
United Kingdom | 7 |
Australia | 5 |
China | 5 |
Japan | 5 |
South Korea | 5 |
United States | 5 |
Canada | 4 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024
This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…
Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition
Suwita Suwita; Sulistyo Saputro; Sajidan Sajidan; Sutarno Sutarno – Journal of Baltic Science Education, 2024
The current study uses the Rasch Model to measure lower-secondary school students' critical thinking skills on photosynthesis topics. Critical thinking skills are considered essential in science education, but few valid and practical measurement instruments remain. The current study fills the gap by adapting the instrument from the Watson-Glaser…
Descriptors: Secondary School Students, Critical Thinking, Thinking Skills, Botany
Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…
Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025
Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…
Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics
Kanto, Laura; Syrjälä, Henna; Mann, Wolfgang – Journal of Deaf Studies and Deaf Education, 2021
This study investigates children's vocabulary knowledge in Finnish Sign Language (FinSL), specifically their understanding of different form-meaning mappings by using a multilayered assessment format originally developed for British Sign Language (BSL). The web-based BSL vocabulary test by Mann (2009) was adapted for FinSL following the steps…
Descriptors: Vocabulary Development, Sign Language, Foreign Countries, Deafness
Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021
This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…
Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability
Büsra Kilinç; Mehmet Diyaddin Yasar – Science Insights Education Frontiers, 2024
In this study, it was aimed to develop an achievement test taking into account the subject acquisitions of the sound and properties unit in the sixth-grade science course. In the test development phase, firstly, literature review for the study was conducted. Then, 30 multiple choice questions in align with the subject acquisition in the 2018…
Descriptors: Science Tests, Test Construction, Grade 6, Science Instruction
Rafatbakhsh, Elaheh; Ahmadi, Alireza – Practical Assessment, Research & Evaluation, 2022
The purpose of this study was to investigate the validity of the vocabulary subsection of a high-stakes university entrance exam for Ph.D. programs using the argument-based approach. All the three different versions of the test administered in a period of five years and the responses of 12,500 test-takers were studied. The study focused on four…
Descriptors: Vocabulary, College Entrance Examinations, Doctoral Programs, Test Validity
Kim, Hun Ju; Lee, Sung Ja; Kam, Kyung-Yoon – International Journal of Disability, Development and Education, 2023
This study verified validity and reliability of the School Function Assessment (SFA) using Rasch analysis in South Korean school-based occupational therapy sites serving children with intellectual disabilities and others. Participants were 103 elementary school children (grades 1 through 6) with disabilities. Rasch analysis revealed several…
Descriptors: Foreign Countries, Test Validity, Test Reliability, Occupational Therapy
Benton, Tom – Research Matters, 2021
Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…
Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level
Dina Kamber Hamzic; Mirsad Trumic; Ismar Hadžalic – International Electronic Journal of Mathematics Education, 2025
Trigonometry is an important part of secondary school mathematics, but it is usually challenging for students to understand and learn. Since trigonometry is learned and used at a university level in many fields, like physics or geodesy, it is important to have an insight into students' trigonometry knowledge before the beginning of the university…
Descriptors: Trigonometry, Mathematics Instruction, Prior Learning, Outcomes of Education
Munawarah; Thalhah, Siti Zuhaerah; Angriani, Andi Dian; Nur, Fitriani; Kusumayanti, Andi – Mathematics Teaching Research Journal, 2021
The increase in the need for critical and analytical thinking among students to boost their confidence in dealing with complex and difficult problems has led to the development of computational skills. Therefore, this study aims to develop an instrument test for computational thinking (CT) skills in the mathematics-based RME (Realistic Mathematics…
Descriptors: Test Construction, Mathematics Tests, Computation, Thinking Skills
Isolda Margarita Castillo-Martínez; Davis Velarde-Camaqui; María Soledad Ramírez-Montoya; Jorge Sanabria-Z – Journal of Social Studies Education Research, 2024
Reasoning for complexity is a fundamental competency in these complex times for solutions to social problems and decision-making. The purpose of this paper is to demonstrate the validity and reliability of the eComplexity instrument by presenting its psychometric properties. The instrument consists of a Likert-type scale questionnaire designed to…
Descriptors: Psychometrics, Test Validity, Test Reliability, Difficulty Level
Saenna, Watcharaporn; Phusee-orn, Songsak – Higher Education Studies, 2022
The purposes of the research were to: (1) create a scientific creativity measure for high school students; (2) examine the quality of the science creativity scale of the created test; (3) establish a benchmark for scientific creativity scores for high school students; and (4) study a scientific creativity level of students in the senior high…
Descriptors: Foreign Countries, Test Construction, High School Students, Creativity