CIKM AnalytiCup 2017 Lazada Product Title Quality Challenge An Ensemble   of Deep and Shallow Learning to predict the Quality of Product Titles

Karamjit Singh; Vishal Sunder

arXiv:1804.01000·cs.CL·April 4, 2018

CIKM AnalytiCup 2017 Lazada Product Title Quality Challenge An Ensemble of Deep and Shallow Learning to predict the Quality of Product Titles

Karamjit Singh, Vishal Sunder

PDF

Open Access

TL;DR

This paper combines deep learning and shallow learning models to predict product title quality, demonstrating that their ensemble outperforms individual models in accuracy.

Contribution

It introduces a novel ensemble approach integrating CNN, LSTM, and LightGBM models for product title quality prediction, leveraging their complementarity.

Findings

01

Ensemble improves prediction accuracy over individual models

02

Deep models capture semantic features effectively

03

Shallow models contribute valuable feature-based insights

Abstract

We present an approach where two different models (Deep and Shallow) are trained separately on the data and a weighted average of the outputs is taken as the final result. For the Deep approach, we use different combinations of models like Convolution Neural Network, pretrained word2vec embeddings and LSTMs to get representations which are then used to train a Deep Neural Network. For Clarity prediction, we also use an Attentive Pooling approach for the pooling operation so as to be aware of the Title-Category pair. For the shallow approach, we use boosting technique LightGBM on features generated using title and categories. We find that an ensemble of these approaches does a better job than using them alone suggesting that the results of the deep and shallow approach are highly complementary

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques

MethodsConvolution