ERIC Number: EJ1420817
Record Type: Journal
Publication Date: 2024
Pages: 18
Abstractor: As Provided
ISBN: N/A
ISSN: N/A
EISSN: EISSN-2469-9896
Available Date: N/A
Evaluating IBM's Watson Natural Language Processing Artificial Intelligence as a Short-Answer Categorization Tool for Physics Education Research
Physical Review Physics Education Research, v20 n1 Article 010116 2024
Recent advances in publicly available natural language processors (NLP) may enhance the efficiency of analyzing student short-answer responses in physics education research (PER). We train a state-of-the-art NLP, IBM's Watson, and test its agreement with human coders using two different studies that gathered text responses in which students explain their reasoning on physics-related questions. The first study analyzes 479 student responses to a lab data analysis question and categorizes them by main idea. The second study analyzes 732 student answers to identify the presence or absence of each of the two conceptual themes. When training Watson with approximately one-third to half of the samples, we find that samples labeled with high confidence scores have similar accuracy to human agreement; yet for lower confidence scores, humans outperform the NLP's labeling accuracy. In addition to studying Watson's overall accuracy, we use this analysis to better understand factors that impact how Watson categorizes. Using the data from the categorization study, we find that Watson's algorithm does not appear to be impacted by the disproportionate representation of categories in the training set, and we examine mislabeled statements to identify vocabulary and phrasing that may increase the rate of false positives. Based on this work, we find that, with careful consideration of the research study design and an awareness of the NLP's limitations, Watson may present a useful tool for large-scale PER studies or classroom analysis tools.
Descriptors: Artificial Intelligence, Physics, Natural Language Processing, Computer Uses in Education, Classification, Computer Software Evaluation, Students, Data Analysis, Educational Research, Accuracy
American Physical Society. One Physics Ellipse 4th Floor, College Park, MD 20740-3844. Tel: 301-209-3200; Fax: 301-209-0865; e-mail: assocpub@aps.org; Web site: https://journals.aps.org/prper/
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: National Science Foundation (NSF), Division of Undergraduate Education (DUE)
Authoring Institution: N/A
Grant or Contract Numbers: 2021099
Author Affiliations: N/A