Large Content And Behavior Models To Understand, Simulate, And Optimize   Content And Behavior

Ashmit Khandelwal; Aditya Agrawal; Aanisha Bhattacharyya; Yaman K; Singla; Somesh Singh; Uttaran Bhattacharya; Ishita Dasgupta; Stefano; Petrangeli; Rajiv Ratn Shah; Changyou Chen; Balaji Krishnamurthy

arXiv:2309.00359·cs.CL·March 19, 2024·1 cites

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior

Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya, Yaman K, Singla, Somesh Singh, Uttaran Bhattacharya, Ishita Dasgupta, Stefano, Petrangeli, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy

PDF

Open Access 1 Models 1 Datasets 1 Video

TL;DR

This paper introduces Large Content and Behavior Models (LCBMs) that incorporate receiver behavior tokens into training to optimize content for desired behaviors and predict receiver responses, extending beyond traditional content understanding.

Contribution

The paper presents the novel concept of LCBMs that integrate receiver behavior data into language models, enabling behavior prediction and content optimization.

Findings

01

Models perform similarly to LLMs on content understanding tasks.

02

Models generalize well for behavior simulation and domain adaptation.

03

The release of the Content Behavior Corpus (CBC) supports further research.

Abstract

Shannon and Weaver's seminal information theory divides communication into three levels: technical, semantic, and effectiveness. While the technical level deals with the accurate reconstruction of transmitted symbols, the semantic and effectiveness levels deal with the inferred meaning and its effect on the receiver. Large Language Models (LLMs), with their wide generalizability, make some progress towards the second level. However, LLMs and other communication models are not conventionally designed for predicting and optimizing communication for desired receiver behaviors and intents. As a result, the effectiveness level remains largely untouched by modern communication systems. In this paper, we introduce the receivers' "behavior tokens," such as shares, likes, clicks, purchases, and retweets, in the LLM's training corpora to optimize content for the receivers and predict their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
CommerAI/large-social-behavior-model
model

Datasets

behavior-in-the-wild/content-behavior-corpus
dataset· 93 dl
93 dl

Videos

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior· slideslive

Taxonomy

TopicsTopic Modeling · Hate Speech and Cyberbullying Detection · Natural Language Processing Techniques