NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 5 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Park, Siwon – Journal of Pan-Pacific Association of Applied Linguistics, 2017
This paper examines how different test methods may tap different aspects of second language knowledge. It employs multiple-choice (MC) and constructed response (CR) items which yield distinct or convergent information in the computer delivered testing of English in its presentation of this factor. In order to examine the effects of test method, a…
Descriptors: Evaluation Methods, Second Language Learning, English (Second Language), Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Salehi, Mohammad – Language Testing in Asia, 2012
Three approaches of validation enquiry were applied on the data obtained from a proficiency test carried out with 3,398 PhD candidates as a partial requirement for entering PhD program in Iran. The data obtained from the reading section were subjected to an exploratory factor analysis (EFA). The EFA yielded nine factors in the reading section.…
Descriptors: Construct Validity, Test Validity, College Entrance Examinations, Doctoral Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Beaujean, A. Alexander; Firmin, Michael W.; Michonski, Jared D.; Berry, Theodore; Johnson, Courtney – Assessment, 2010
This study assessed trait validity of the Reynolds Intellectual Assessment Scales' (RIAS) Verbal Index (VIX) and Nonverbal Index (NIX) scores in a group of college students. Using both observation of patterns and latent variable modeling of a multitrait-multimethod correlation/covariance matrix, the results indicate that the RIAS VIX scores…
Descriptors: Multitrait Multimethod Techniques, College Students, Intelligence Tests, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007
This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…
Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory