An FNet based Auto Encoder for Long Sequence News Story Generation

Paul K. Mandal; Rakeshkumar Mahto

arXiv:2211.08295·cs.CL·November 18, 2022·1 cites

An FNet based Auto Encoder for Long Sequence News Story Generation

Paul K. Mandal, Rakeshkumar Mahto

PDF

Open Access 1 Repo

TL;DR

This paper introduces an FNet-based autoencoder for generating long news stories, demonstrating efficiency advantages over BERT-based models and analyzing output variations with different seed texts.

Contribution

It presents a novel autoencoder architecture based on FNet for long sequence news generation, highlighting efficiency benefits and output analysis.

Findings

01

FNet autoencoder trains 80% faster on GPUs and 70% faster on TPUs than BERT.

02

The model effectively generates news stories from seed texts.

03

Output quality varies with different seed inputs.

Abstract

In this paper, we design an auto encoder based off of Google's FNet Architecture in order to generate text from a subset of news stories contained in Google's C4 dataset. We discuss previous attempts and methods to generate text from autoencoders and non LLM Models. FNET poses multiple advantages to BERT based encoders in the realm of efficiency which train 80% faster on GPUs and 70% faster on TPUs. We then compare outputs of how this autencoder perfroms on different epochs. Finally, we analyze what outputs the encoder produces with different seed text.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

paulkmandal/fnet_news_headlines
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Video Analysis and Summarization

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Adam · Weight Decay · Residual Connection · Dropout · WordPiece · Attention Dropout