Parallel Knowledge Embedding with MapReduce on a Multi-core Processor
Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman

TL;DR
This paper presents a parallelized approach to knowledge embedding using MapReduce on multi-core processors, significantly speeding up training while maintaining comparable accuracy to traditional single-thread methods.
Contribution
It introduces strategies for merging parallel embeddings in MapReduce, enabling scalable and efficient knowledge embedding training without sacrificing performance.
Findings
Achieves comparable results to single-thread TransE in entity and relation tasks.
Significantly increases training speed with multiple cores.
Demonstrates effective merging strategies for parallel embeddings.
Abstract
This article firstly attempts to explore parallel algorithms of learning distributed representations for both entities and relations in large-scale knowledge repositories with {\it MapReduce} programming model on a multi-core processor. We accelerate the training progress of a canonical knowledge embedding method, i.e. {\it translating embedding} ({\bf TransE}) model, by dividing a whole knowledge repository into several balanced subsets, and feeding each subset into an individual core where local embeddings can concurrently run updating during the {\it Map} phase. However, it usually suffers from inconsistent low-dimensional vector representations of the same key, which are collected from different {\it Map} workers, and further leads to conflicts when conducting {\it Reduce} to merge the various vectors associated with the same key. Therefore, we try several strategies to acquire the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Privacy-Preserving Technologies in Data
