Generative Adversarial Imitation Learning for Empathy-based AI
Pratyush Muthukumar, Karishma Muthukumar, Deepan Muthirayan, Pramod, Khargonekar

TL;DR
This paper introduces a novel GAIL-based model utilizing GPT-2 for empathetic, context-aware conversational AI that outperforms existing models in generating personalized empathetic responses.
Contribution
It applies generative adversarial imitation learning to fine-tune GPT-2 for empathy-driven dialogue generation, a novel approach in conversational AI.
Findings
Model achieves higher response scores than baselines.
Outperforms recent conversational AI models in multi-turn interactions.
Effective in generating concise, personalized empathetic responses.
Abstract
Generative adversarial imitation learning (GAIL) is a model-free algorithm that has been shown to provide strong results in imitating complex behaviors in high-dimensional environments. In this paper, we utilize the GAIL model for text generation to develop empathy-based context-aware conversational AI. Our model uses an expert trajectory of empathetic prompt-response dialogues which can accurately exhibit the correct empathetic emotion when generating a response. The Generator of the GAIL model uses the GPT-2 sequential pre-trained language model trained on 117 million parameters from 40 GB of internet data. We propose a novel application of an approach used in transfer learning to fine tune the GPT-2 model in order to generate concise, user-specific empathetic responses validated against the Discriminator. Our novel GAIL model utilizes a sentiment analysis history-based reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Residual Connection · Layer Normalization · Softmax · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing
