TechRxiv
SurveyonMultimodalTransformersforRobotsTechRxiv_.pdf (2.24 MB)
Download file

Survey on Multimodal Transformers for Robots

Download (2.24 MB)
preprint
posted on 2023-02-05, 17:37 authored by Kazuki MiyazawaKazuki Miyazawa, Takayuki NagaiTakayuki Nagai

In recent years, transformers have been attracting considerable attention in various natural language processing tasks. Recently, they have been used not only in natural language processes, but also for processing multimodal data such as images, video, and audio, and their effectiveness has been demonstrated. The processing of multimodal data is extremely important in robot intelligence. Therefore, the multimodal transformers have the potential to contribute to the development of robotics in various domains. In this paper, we review the application of transformers to robots and discuss the possibility of transformers solving the problems in current intelligent robotics.

Funding

JPMJCR15E3

JPMJMS2011

JP19J23364

History

Email Address of Submitting Author

nagai@sys.es.osaka-u.ac.jp

Submitting Author's Institution

Osaka University

Submitting Author's Country

  • Japan

Usage metrics

    Licence

    Exports