loading page

Graph Embedding-based Matching Multilingual Out-of-Vocabulary Terms on Social Media
  • Jiakai Gu ,
  • Jason J. Jung
Jiakai Gu
Department of Computer Engineering

Corresponding Author:[email protected]

Author Profile
Jason J. Jung
Author Profile

Abstract

Our study aims to detect multilingual Out-of-Vocabulary (OOV) and matching among multilingual OOV. Based on the original OOV issue, many multilingual OOVs also emerged at the same time. In order to solve this problem, this paper proposes a graph embedding-based matching among multilingual OOV. The method is divided into two parts. The first part is to extract OOV from the network corpus and understand it. In the second part, the OOV in the first part is taken as the target node, and the understood part is taken as the feature node of the target node to construct the graph and embed the graph. Our study uses Chinese, Korean, and Japanese for the experiment. The result of the method that Our study proposed is that F1- score reached 93.94%. Our study also compares this method with other embedding algorithms, and the F1-score is higher than the average of other algorithms F1-score by 9.62%.