Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 28 |
Since 2006 (last 20 years) | 74 |
Descriptor
Comparative Analysis | 78 |
Item Response Theory | 78 |
Psychometrics | 78 |
Test Items | 25 |
Models | 23 |
Scores | 23 |
Foreign Countries | 20 |
Measurement | 16 |
Correlation | 15 |
Evaluation Methods | 15 |
Measures (Individuals) | 13 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 3 |
Practitioners | 1 |
Students | 1 |
Teachers | 1 |
Location
Germany | 4 |
Spain | 3 |
United States | 3 |
Australia | 2 |
California | 2 |
France | 2 |
Nigeria | 2 |
Taiwan | 2 |
Turkey | 2 |
Africa | 1 |
Chile | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Han, Yuting; Zhang, Jihong; Jiang, Zhehan; Shi, Dexin – Educational and Psychological Measurement, 2023
In the literature of modern psychometric modeling, mostly related to item response theory (IRT), the fit of model is evaluated through known indices, such as X[superscript 2], M2, and root mean square error of approximation (RMSEA) for absolute assessments as well as Akaike information criterion (AIC), consistent AIC (CAIC), and Bayesian…
Descriptors: Goodness of Fit, Psychometrics, Error of Measurement, Item Response Theory
Xue Zhang; Chun Wang – Grantee Submission, 2022
Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…
Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Musa Adekunle Ayanwale – Discover Education, 2023
Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…
Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Aborisade, Olatunbosun James; Fajobi, Olutoyin Olufunke – Educational Research and Reviews, 2020
West Africa Examination Council (WAEC) and National Examination Council (NECO) are the two major examination bodies saddled with the responsibility of awarding Senior Secondary School Certificate in Nigeria. This study examined the comparability of the psychometric properties of the items constructed by the two examination bodies using Item…
Descriptors: Foreign Countries, Mathematics Tests, Psychometrics, Test Items
Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021
The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…
Descriptors: Test Norms, Scores, Regression (Statistics), Test Items
Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019
Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…
Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests
Stewart, John; Drury, Byron; Wells, James; Adair, Aaron; Henderson, Rachel; Ma, Yunfei; Perez-Lemonche, Ángel; Pritchard, David – Physical Review Physics Education Research, 2021
This study reports an analysis of the Force Concept Inventory (FCI) using item response curves (IRC)--the fraction of students selecting each response to an item as a function of their total score. Three large samples (N = 9606, 4360, and 1439) of calculus-based physics students were analyzed. These were drawn from three land-grant institutions…
Descriptors: Physics, Science Instruction, Scientific Concepts, Item Response Theory
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Kanonire, Tatjana; Federiakin, Denis A.; Uglanova, Irina L. – School Psychology, 2020
The study proposes a multicomponent model of subjective well-being (SWB) in elementary school. The model includes satisfaction with school, affect toward school, well-being related to communication with peers, and subjective physical well-being. The aim of this study is to verify whether well-being related to different aspects of school life can…
Descriptors: Guidelines, Well Being, Elementary School Students, Models
Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019
Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…
Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability
Ozdemir, Hasan Fehmi; Kutlu, Omer; Huang, Shaofu; Crick, Ruth – International Journal of Assessment Tools in Education, 2022
The aim of this study is to adapt the Crick Learning for Resilient Agency (CLARA) to Turkish culture, and to examine the psychometric features of the Inventory according to both Classical Test Theory (CTT) and Item Response Theory (IRT). In this respect, it is a descriptive level survey design research. Two different study groups were formed in…
Descriptors: Item Response Theory, Psychometrics, English (Second Language), English Literature
Türkoguz, Suat – Anatolian Journal of Education, 2020
This study aimed to investigate the item "Response Time Fidelity scores" ("RTFs"), "KuderRichardson Reliability" ("KR[subscript 20]") and "Cronbach's Alpha Reliability" ("alpha") coefficients, calculate "KR[subscript 20]" coefficients with "RTFs" for 30 threshold…
Descriptors: Comparative Analysis, Reaction Time, Multiple Choice Tests, Scores
Weber, Ann M.; Marchman, Virginia A.; Diop, Yatma; Fernald, Anne – Journal of Child Language, 2018
Valid indigenous language assessments are needed to further our understanding of how children learn language around the world. We assessed the psychometric properties and performance of two caregiver-report measures of Wolof language skill (language milestones achieved and vocabulary knowledge) for 500 children (ages 0;4 to 2;6) living in rural…
Descriptors: Validity, Caregivers, Child Language, Language Skills