Publication Date
In 2025 | 6 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 51 |
Since 2016 (last 10 years) | 93 |
Since 2006 (last 20 years) | 122 |
Descriptor
Difficulty Level | 136 |
Foreign Countries | 136 |
Test Validity | 136 |
Test Items | 88 |
Test Reliability | 81 |
Test Construction | 55 |
Multiple Choice Tests | 35 |
Item Response Theory | 32 |
Item Analysis | 29 |
Language Tests | 29 |
Psychometrics | 26 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 122 |
Journal Articles | 119 |
Tests/Questionnaires | 13 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 7 |
Dissertations/Theses -… | 3 |
Reports - Descriptive | 3 |
Collected Works - Proceedings | 1 |
Opinion Papers | 1 |
Education Level
Audience
Researchers | 1 |
Location
Turkey | 15 |
Indonesia | 13 |
Canada | 7 |
Germany | 7 |
Japan | 7 |
Nigeria | 7 |
Iran | 6 |
United Kingdom | 4 |
Australia | 3 |
Chile | 3 |
China | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sherwin E. Balbuena – Online Submission, 2024
This study introduces a new chi-square test statistic for testing the equality of response frequencies among distracters in multiple-choice tests. The formula uses the information from the number of correct answers and wrong answers, which becomes the basis of calculating the expected values of response frequencies per distracter. The method was…
Descriptors: Multiple Choice Tests, Statistics, Test Validity, Testing
Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2022
The possible dependency of criterion validity on item formulation in a multicomponent measuring instrument is examined. The discussion is concerned with evaluation of the differences in criterion validity between two or more groups (populations/subpopulations) that have been administered instruments with items having differently formulated item…
Descriptors: Test Items, Measures (Individuals), Test Validity, Difficulty Level
Apichat Khamboonruang – Language Testing in Asia, 2025
Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…
Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests
Suwita Suwita; Sulistyo Saputro; Sajidan Sajidan; Sutarno Sutarno – Journal of Baltic Science Education, 2024
The current study uses the Rasch Model to measure lower-secondary school students' critical thinking skills on photosynthesis topics. Critical thinking skills are considered essential in science education, but few valid and practical measurement instruments remain. The current study fills the gap by adapting the instrument from the Watson-Glaser…
Descriptors: Secondary School Students, Critical Thinking, Thinking Skills, Botany
Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…
Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity
Busra Ozmen Yagiz; Ecenaz Alemdag – Education and Information Technologies, 2025
Resilience is a critical personality trait that allows one to deal with difficulties, learn from failures, and maintain a positive attitude during task performance. However, it has not been understudied in a complex and challenging educational domain. The current research intends to address this gap by analyzing the specific characteristics of…
Descriptors: Foreign Countries, Undergraduate Students, Resilience (Psychology), Programming
Wai Kei Chan; Li Zhang; Emily Pey-Tee Oon – International Journal of Assessment Tools in Education, 2023
We report the validity of a test instrument that assesses the arithmetic ability of primary students by (a) describing the theoretical model of arithmetic ability assessment using Wilson's (2004) four building blocks of constructing measures and (b) providing empirical evidence for the validation study. The instrument consists of 21…
Descriptors: Foreign Countries, Elementary School Students, Arithmetic, Grade 3
Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025
Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…
Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics
Kanto, Laura; Syrjälä, Henna; Mann, Wolfgang – Journal of Deaf Studies and Deaf Education, 2021
This study investigates children's vocabulary knowledge in Finnish Sign Language (FinSL), specifically their understanding of different form-meaning mappings by using a multilayered assessment format originally developed for British Sign Language (BSL). The web-based BSL vocabulary test by Mann (2009) was adapted for FinSL following the steps…
Descriptors: Vocabulary Development, Sign Language, Foreign Countries, Deafness
Büsra Kilinç; Mehmet Diyaddin Yasar – Science Insights Education Frontiers, 2024
In this study, it was aimed to develop an achievement test taking into account the subject acquisitions of the sound and properties unit in the sixth-grade science course. In the test development phase, firstly, literature review for the study was conducted. Then, 30 multiple choice questions in align with the subject acquisition in the 2018…
Descriptors: Science Tests, Test Construction, Grade 6, Science Instruction
Rafatbakhsh, Elaheh; Ahmadi, Alireza – Practical Assessment, Research & Evaluation, 2022
The purpose of this study was to investigate the validity of the vocabulary subsection of a high-stakes university entrance exam for Ph.D. programs using the argument-based approach. All the three different versions of the test administered in a period of five years and the responses of 12,500 test-takers were studied. The study focused on four…
Descriptors: Vocabulary, College Entrance Examinations, Doctoral Programs, Test Validity
Kim, Hun Ju; Lee, Sung Ja; Kam, Kyung-Yoon – International Journal of Disability, Development and Education, 2023
This study verified validity and reliability of the School Function Assessment (SFA) using Rasch analysis in South Korean school-based occupational therapy sites serving children with intellectual disabilities and others. Participants were 103 elementary school children (grades 1 through 6) with disabilities. Rasch analysis revealed several…
Descriptors: Foreign Countries, Test Validity, Test Reliability, Occupational Therapy
Dina Kamber Hamzic; Mirsad Trumic; Ismar Hadžalic – International Electronic Journal of Mathematics Education, 2025
Trigonometry is an important part of secondary school mathematics, but it is usually challenging for students to understand and learn. Since trigonometry is learned and used at a university level in many fields, like physics or geodesy, it is important to have an insight into students' trigonometry knowledge before the beginning of the university…
Descriptors: Trigonometry, Mathematics Instruction, Prior Learning, Outcomes of Education
Lee, Shinhye – ETS Research Report Series, 2022
In response to the calls for making key stakeholders' perspectives relevant in the test validation process, the study discussed in this report sought test-taker feedback as part of collecting validity evidence and supporting the ongoing field testing efforts of the new "TOEFL ITP"® Speaking section. Specifically, I aimed to investigate…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Test Validity
Isolda Margarita Castillo-Martínez; Davis Velarde-Camaqui; María Soledad Ramírez-Montoya; Jorge Sanabria-Z – Journal of Social Studies Education Research, 2024
Reasoning for complexity is a fundamental competency in these complex times for solutions to social problems and decision-making. The purpose of this paper is to demonstrate the validity and reliability of the eComplexity instrument by presenting its psychometric properties. The instrument consists of a Likert-type scale questionnaire designed to…
Descriptors: Psychometrics, Test Validity, Test Reliability, Difficulty Level