GPTEmb_IEEE.pdf (277.45 kB)
Download file

Enhancing Machine Learning Algorithms using GPT Embeddings for Binary Classification

Download (277.45 kB)
posted on 2023-03-27, 05:31 authored by MSVPJ SATHVIKMSVPJ SATHVIK

The Language Model Models (LLMs) have demonstrated their ability to process and understand natural language inputs accurately. This indicates that LLMs are capable of superior natural language processing capabilities. The GPT embeddings are words generated from the GPT backend of the ChatGPT developed by OpenAI, which can produce accurate outputs for user inputs. In this paper, we generate GPT embeddings and apply machine learning algorithms for fake news prediction and sentiment analysis. The proposed method produces remarkable results with an improvement of approximately 12.59% in accuracy compared to traditional embeddings. For fake news detection, GPT embeddings with SVM outperformed LSTM with Glove embeddings. We evaluated 10 machine learning models using 4 versions of GPT embeddings, namely Ada, Babbage, Curie, and Davinci. The sentiment analysis execution resulted in an impressive accuracy of 98.6%. We made our embeddings publicly available for both datasets. We believe that this is a valuable contribution since generating such embeddings requires access to GPT, which is not freely available to researchers.


Email Address of Submitting Author

ORCID of Submitting Author


Submitting Author's Institution

IIIT Dharwad

Submitting Author's Country

  • India

Usage metrics