Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Multidimensional Scaling | 3 |
Multitrait Multimethod… | 3 |
Psychometrics | 3 |
Test Reliability | 3 |
Test Validity | 2 |
Undergraduate Students | 2 |
Accuracy | 1 |
Comparative Testing | 1 |
Construct Validity | 1 |
Evaluation Methods | 1 |
Factor Analysis | 1 |
More ▼ |
Author
Amery D. Wu | 1 |
Bhola, Dennison S. | 1 |
Dik, Bryan J. | 1 |
Duffy, Ryan D. | 1 |
Eldridge, Brandy M. | 1 |
Jake Stone | 1 |
Kong, Xiaojing J. | 1 |
Shun-Fu Hu | 1 |
Steger, Michael F. | 1 |
Wise, Steven L. | 1 |
Publication Type
Journal Articles | 3 |
Reports - Research | 2 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Dik, Bryan J.; Eldridge, Brandy M.; Steger, Michael F.; Duffy, Ryan D. – Journal of Career Assessment, 2012
Research on work as a calling is limited by measurement concerns. In response, the authors introduce the multidimensional Calling and Vocation Questionnaire (CVQ) and the Brief Calling scale (BCS), instruments assessing presence of, and search for, a calling. Study 1 describes CVQ development using exploratory and confirmatory factor analysis…
Descriptors: Multitrait Multimethod Techniques, Construct Validity, Validity, Test Reliability
Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007
This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…
Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory