Natural Language Processing (almost) from Scratch

Ronan Collobert; Jason Weston; Leon Bottou; Michael Karlen; Koray; Kavukcuoglu; Pavel Kuksa

arXiv:1103.0398·cs.LG·March 3, 2011·5.2k cites

Natural Language Processing (almost) from Scratch

Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray, Kavukcuoglu, Pavel Kuksa

PDF

Open Access 2 Repos

TL;DR

This paper introduces a versatile neural network architecture that learns internal representations from large unlabeled datasets, enabling it to perform multiple NLP tasks without task-specific engineering.

Contribution

The work presents a unified neural approach that minimizes task-specific feature engineering, relying instead on learned representations from unlabeled data for various NLP tasks.

Findings

01

Achieved good performance across multiple NLP tasks

02

Reduced need for task-specific feature engineering

03

Built a computationally efficient, freely available tagging system

Abstract

We propose a unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including: part-of-speech tagging, chunking, named entity recognition, and semantic role labeling. This versatility is achieved by trying to avoid task-specific engineering and therefore disregarding a lot of prior knowledge. Instead of exploiting man-made input features carefully optimized for each task, our system learns internal representations on the basis of vast amounts of mostly unlabeled training data. This work is then used as a basis for building a freely available tagging system with good performance and minimal computational requirements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications