DeepTitle -- Leveraging BERT to generate Search Engine Optimized Headlines
Cristian Anastasiu, Hanna Behnke, Sarah L\"uck, Viktor, Malesevic, Aamna Najmi, Javier Poveda-Panter

TL;DR
This paper presents a method for generating search engine optimized headlines for German news articles using a fine-tuned BERT model, incorporating keywords and human evaluation to improve quality.
Contribution
It introduces a novel approach combining BERT fine-tuning with keyword integration for SEO-friendly headlines in German news articles.
Findings
Achieved a ROUGE-L-gram F-score of 40.02
Incorporated keyword optimization for SEO
Used human evaluation to assess headline quality
Abstract
Automated headline generation for online news articles is not a trivial task - machine generated titles need to be grammatically correct, informative, capture attention and generate search traffic without being "click baits" or "fake news". In this paper we showcase how a pre-trained language model can be leveraged to create an abstractive news headline generator for German language. We incorporate state of the art fine-tuning techniques for abstractive text summarization, i.e. we use different optimizers for the encoder and decoder where the former is pre-trained and the latter is trained from scratch. We modify the headline generation to incorporate frequently sought keywords relevant for search engine optimization. We conduct experiments on a German news data set and achieve a ROUGE-L-gram F-score of 40.02. Furthermore, we address the limitations of ROUGE for measuring the quality of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Web Data Mining and Analysis
