Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 16 |
Since 2016 (last 10 years) | 38 |
Since 2006 (last 20 years) | 54 |
Descriptor
Difficulty Level | 59 |
Item Response Theory | 59 |
Test Validity | 59 |
Test Items | 55 |
Test Reliability | 41 |
Foreign Countries | 32 |
Test Construction | 20 |
Psychometrics | 19 |
High School Students | 12 |
Item Analysis | 11 |
Statistical Analysis | 11 |
More ▼ |
Source
Author
Bejar, Isaac I. | 3 |
Paek, Insu | 3 |
Schoen, Robert C. | 3 |
Yang, Xiaotong | 3 |
Liu, Sicong | 2 |
Mike Stieff | 2 |
Retnawati, Heri | 2 |
Sideridis, Georgios D. | 2 |
Stephanie M. Werner | 2 |
Ying Chen | 2 |
Ad'hiya, Eka | 1 |
More ▼ |
Publication Type
Reports - Research | 51 |
Journal Articles | 50 |
Reports - Evaluative | 5 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Tests/Questionnaires | 2 |
Dissertations/Theses -… | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Secondary Education | 25 |
Elementary Education | 18 |
Higher Education | 14 |
High Schools | 13 |
Postsecondary Education | 11 |
Middle Schools | 9 |
Junior High Schools | 8 |
Grade 8 | 7 |
Early Childhood Education | 3 |
Grade 4 | 3 |
Grade 7 | 3 |
More ▼ |
Audience
Location
Indonesia | 6 |
Turkey | 5 |
Germany | 4 |
California | 2 |
Greece | 2 |
Idaho | 2 |
Japan | 2 |
Jordan | 2 |
Nevada | 2 |
Nigeria | 2 |
South Africa | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024
This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…
Descriptors: Korean, Test Validity, Test Reliability, Imitation
Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…
Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Stephanie M. Werner; Ying Chen; Mike Stieff – Journal of Chemical Education, 2021
The Chemistry Self-Concept Inventory (CSCI) is a widely used instrument within chemistry education research. Yet, agreement on its overall reliability and validity is lacking, and psychometric analyses of the instrument remain outstanding. This study examined the psychometric properties of the subscale and item function of the CSCI on 1140 high…
Descriptors: Self Concept Measures, Chemistry, Psychometrics, Item Response Theory
Kirya, Kent Robert; Mashood, Kalarattu Kandiyi; Yadav, Lakhan Lal – Journal of Turkish Science Education, 2022
In this study, we administered and evaluated circular motion concept question items with a view to developing an inventory suitable for the Ugandan context. Before administering the circular concept items, six physics experts and ten undergraduate physics students carried out the face and content validation. One hundred eighteen undergraduate…
Descriptors: Motion, Scientific Concepts, Test Construction, Test Items
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Stephanie M. Werner; Ying Chen; Mike Stieff – Grantee Submission, 2021
The Chemistry Self-Concept Inventory (CSCI) is a widely used instrument within chemistry education research. Yet, agreement on its overall reliability and validity is lacking, and psychometric analyses of the instrument remain outstanding. This study examined the psychometric properties of the subscale and item function of the CSCI on 1,140 high…
Descriptors: Self Concept Measures, Chemistry, Psychometrics, Item Response Theory
Ika Zenita Ratnaningsih; Unika Prihatsanti; Anggun Resdasari Prasetyo; Bambang Sumintono – Journal of Applied Research in Higher Education, 2025
Purpose: The present study aimed to validate the Indonesian-language version of the psychological capital questionnaire (PCQ), specifically within the context of higher education, by utilising Rasch analysis to evaluate the reliability and validity aspect such as item-fit statistics, rating scale function, and differential item functioning of the…
Descriptors: Foreign Countries, Indonesian Languages, Test Validity, Psychological Characteristics
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
Nicholas Andrew Soltis; Karen S. McNeal – Journal for STEM Education Research, 2022
System thinking in an important area of study across STEM and non-STEM disciplines. The Earth system approach that drives the geosciences and is essential to issues of sustainability makes system thinking a critical skill in geoscience education. A key area in understanding the development of system thinking skills in the geosciences relies on the…
Descriptors: Test Construction, Test Validity, Science Tests, Scientific Concepts
Bozdag, Hüseyin Cihan; Türkoguz, Suat – International Online Journal of Primary Education, 2021
The study determines the conceptual understanding levels of primary school students on the concept of light according to the Rasch Model with a Four-tier Light Conceptual Understanding Test (LCUT). The participants were 355 (164 girls and 191 boys) primary school students studying at a public school in Izmir city center. In the study, the Rasch…
Descriptors: Foreign Countries, Elementary School Students, Grade 5, Item Response Theory
Akase, Masaki – Language Testing in Asia, 2022
The purpose of this study is to equate and further validate three forms of the vocabulary size test (VST) created by Aizawa and Mochizuki (2010). These three forms, VST 1, 2, and 3, were administered to a cohort of 189 high school students ranging in age from 16 to 18 in April of their 1st, 2nd, and 3rd year of high school. Although these…
Descriptors: Vocabulary Development, Vocabulary Skills, Language Tests, Longitudinal Studies
Rafi, Ibnu; Retnawati, Heri; Apino, Ezi; Hadiana, Deni; Lydiati, Ida; Rosyada, Munaya Nikma – Pedagogical Research, 2023
This study describes the characteristics of the test and its items used in the national-standardized school examination by applying classical test theory and focusing on the item difficulty, item discrimination, test reliability, and distractor analysis. We analyzed response data of 191 12th graders from one of public senior high schools in…
Descriptors: Foreign Countries, National Competency Tests, Standardized Tests, Mathematics Tests
Alnasraween, Moen Salman; Almughrabi, Ayat Mohammad; Ammari, Raeda Mofid; Alkaramneh, Mohammad Saleh – Cypriot Journal of Educational Sciences, 2021
The purpose of this study is to construct a digital culture test in light of the Item Response Theory and to investigate its psychometric properties. The study sample consisted of six hundred fifty (650) male and female students in the eighth grade from the Directorate of Education and Teaching of Salt District. To obtain the results, the…
Descriptors: Foreign Countries, Technological Literacy, Tests, Psychometrics
Tarekegn, Getachew; Alemu, Mekbib; Taddesse, Mesfin; Kind, Per M. – African Journal of Research in Mathematics, Science and Technology Education, 2020
In physics education, assessments address various forms of scientific knowledge. Most of the existing test instruments emphasise the assessment of content knowledge. These tests fail to measure the epistemic aspects of science. Thus, assessing epistemic knowledge is a theme that demands investigation. This study applies Rasch analysis to help…
Descriptors: Science Teachers, Knowledge Level, Energy, Magnets