Publication Date
In 2025 | 6 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 21 |
Since 2016 (last 10 years) | 41 |
Since 2006 (last 20 years) | 66 |
Descriptor
Difficulty Level | 71 |
Foreign Countries | 71 |
Psychometrics | 71 |
Test Items | 47 |
Item Response Theory | 32 |
Test Reliability | 29 |
Test Validity | 26 |
Item Analysis | 18 |
Test Construction | 15 |
Multiple Choice Tests | 14 |
Science Tests | 13 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 64 |
Reports - Research | 62 |
Reports - Evaluative | 5 |
Collected Works - Proceedings | 2 |
Dissertations/Theses -… | 2 |
Numerical/Quantitative Data | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 1 |
Location
Turkey | 7 |
Taiwan | 6 |
Germany | 5 |
Greece | 5 |
Nigeria | 5 |
Canada | 4 |
United States | 4 |
Australia | 3 |
Indonesia | 3 |
Malaysia | 3 |
South Africa | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Katrin Schuessler; Vanessa Fischer; Maik Walpuski – Instructional Science: An International Journal of the Learning Sciences, 2025
Cognitive load studies are mostly centered on information on perceived cognitive load. Single-item subjective rating scales are the dominant measurement practice to investigate overall cognitive load. Usually, either invested mental effort or perceived task difficulty is used as an overall cognitive load measure. However, the extent to which the…
Descriptors: Cognitive Processes, Difficulty Level, Rating Scales, Construct Validity
Apichat Khamboonruang – Language Testing in Asia, 2025
Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…
Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests
Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025
Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…
Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics
Cui-Yan Hoe; Chieh-Yu Chen; Ching-I Chen – Infants and Young Children, 2025
The Ages and Stages Questionnaires: Social-Emotional, Second Edition (ASQ:SE-2) has been translated into Traditional Chinese (ASQ:SE-2-TC) in Taiwan. This study investigated whether the ASQ:SE-2-TC is also suitable for use in Malaysian Chinese families, and if any cultural differences are presented in ASQ:SE-2-TC items. This study analyzed the…
Descriptors: Social Emotional Learning, Child Development, Screening Tests, Item Analysis
Rodrigo Moreta-Herrera; Xavier Oriol-Granado; Mònica González; Jose A. Rodas – Infant and Child Development, 2025
This study evaluates the Children's Worlds Psychological Well-Being Scale (CW-PSWBS) within a diverse international cohort of children aged 10 and 12, utilising Classical Test Theory (CTT) and Item Response Theory (IRT) methodologies. Through a detailed psychometric analysis, this research assesses the CW-PSWBS's structural integrity, focusing on…
Descriptors: Well Being, Rating Scales, Children, Item Response Theory
Kanto, Laura; Syrjälä, Henna; Mann, Wolfgang – Journal of Deaf Studies and Deaf Education, 2021
This study investigates children's vocabulary knowledge in Finnish Sign Language (FinSL), specifically their understanding of different form-meaning mappings by using a multilayered assessment format originally developed for British Sign Language (BSL). The web-based BSL vocabulary test by Mann (2009) was adapted for FinSL following the steps…
Descriptors: Vocabulary Development, Sign Language, Foreign Countries, Deafness
Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025
This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…
Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory
Zenger, Tim; Bitzenbauer, Philipp – Science Education International, 2022
This article reports on the development and piloting of a German version of a concept test to assess students' conceptual knowledge of density. The concept test was administered in paper-pencil format to 222 German secondary school students as a post-test after instruction in all relevant concepts of density. We provide a psychometric…
Descriptors: Foreign Countries, Secondary School Students, Concept Formation, Psychometrics
Figueiredo, Sandra; Martins, Margarida Alves – Journal of Cognitive Education and Psychology, 2022
In order to assess the accuracy and validity of proficiency diagnostic tests in Second Language (L2), specifically regarding the linguistic (orthographic, semantic, syntactic, lexical) and cognitive (verbal reasoning, lexical decision) components for the immigrant population in Portugal, a study of cut-off points of 6 tests was conducted. This…
Descriptors: Foreign Countries, Second Language Learning, Language Proficiency, Portuguese
Musa Adekunle Ayanwale – Discover Education, 2023
Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…
Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format
Roelofs, Erik C.; Emons, Wilco H. M.; Verschoor, Angela J. – International Journal of Testing, 2021
This study reports on an Evidence Centered Design (ECD) project in the Netherlands, involving the theory exam for prospective car drivers. In particular, we illustrate how cognitive load theory, task-analysis, response process models, and explanatory item-response theory can be used to systematically develop and refine task models. Based on a…
Descriptors: Foreign Countries, Psychometrics, Test Items, Evidence Based Practice
Isolda Margarita Castillo-Martínez; Davis Velarde-Camaqui; María Soledad Ramírez-Montoya; Jorge Sanabria-Z – Journal of Social Studies Education Research, 2024
Reasoning for complexity is a fundamental competency in these complex times for solutions to social problems and decision-making. The purpose of this paper is to demonstrate the validity and reliability of the eComplexity instrument by presenting its psychometric properties. The instrument consists of a Likert-type scale questionnaire designed to…
Descriptors: Psychometrics, Test Validity, Test Reliability, Difficulty Level
Fadillah, Sarah Meilani; Ha, Minsu; Nuraeni, Eni; Indriyanti, Nurma Yunita – Malaysian Journal of Learning and Instruction, 2023
Purpose: Researchers discovered that when students were given the opportunity to change their answers, a majority changed their responses from incorrect to correct, and this change often increased the overall test results. What prompts students to modify their answers? This study aims to examine the modification of scientific reasoning test, with…
Descriptors: Science Tests, Multiple Choice Tests, Test Items, Decision Making
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests