Transformer Based Implementation for Automatic Book Summarization
Siddhant Porwal, Laxmi Bewoor, Vivek Deshpande

TL;DR
This paper explores the use of transformer-based models for automatic book summarization, focusing on abstractive techniques to generate concise and relevant summaries of lengthy documents.
Contribution
It introduces a transformer-based approach specifically tailored for abstractive book summarization, advancing the application of pre-trained language models in this domain.
Findings
Transformer models improve summary relevance and coherence.
Fine-tuning enhances the quality of generated abstracts.
The approach outperforms traditional methods in key metrics.
Abstract
Document Summarization is the procedure of generating a meaningful and concise summary of a given document with the inclusion of relevant and topic-important points. There are two approaches: one is picking up the most relevant statements from the document itself and adding it to the Summary known as Extractive and the other is generating sentences for the Summary known as Abstractive Summarization. Training a machine learning model to perform tasks that are time-consuming or very difficult for humans to evaluate is a major challenge. Book Abstract generation is one of such complex tasks. Traditional machine learning models are getting modified with pre-trained transformers. Transformer based language models trained in a self-supervised fashion are gaining a lot of attention; when fine-tuned for Natural Language Processing(NLP) downstream task like text summarization. This work is an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dropout · Softmax · Adam · Byte Pair Encoding · Residual Connection · Label Smoothing
