Overview of the NLPCC 2017 Shared Task: Chinese News Headline   Categorization

Xipeng Qiu; Jingjing Gong; Xuanjing Huang

arXiv:1706.02883·cs.CL·June 12, 2017·1 cites

Overview of the NLPCC 2017 Shared Task: Chinese News Headline Categorization

Xipeng Qiu, Jingjing Gong, Xuanjing Huang

PDF

Open Access 1 Repo

TL;DR

This paper provides an overview of the NLPCC 2017 shared task focused on categorizing Chinese news headlines into 18 classes, including details about the dataset and resources for participants.

Contribution

It introduces a new dataset of Chinese news headlines with labels and shares resources for future research in headline classification.

Findings

01

Dataset contains 12,000 labeled headlines.

02

Accessible code and dataset for research use.

03

Framework established for headline categorization tasks.

Abstract

In this paper, we give an overview for the shared task at the CCF Conference on Natural Language Processing \& Chinese Computing (NLPCC 2017): Chinese News Headline Categorization. The dataset of this shared task consists 18 classes, 12,000 short texts along with corresponded labels for each class. The dataset and example code can be accessed at https://github.com/FudanNLP/nlpcc2017_news_headline_categorization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FudanNLP/nlpcc2017_news_headline_categorization
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques