# Column2Vec: Structural Understanding via Distributed Representations of   Database Schemas

**Authors:** Michael J. Mior, Alexander G. Ororbia II

arXiv: 1903.08621 · 2019-03-21

## TL;DR

Column2Vec introduces a method to generate meaningful representations of database columns from metadata, enabling applications like automatic column naming, validated on open-source schemas.

## Contribution

This paper presents Column2Vec, a novel approach for embedding database columns based on metadata, facilitating schema understanding and automation tasks.

## Key findings

- Effective column embeddings generated from metadata
- Successful automatic column naming demonstrated
- Viability shown on open-source schemas

## Abstract

We present Column2Vec, a distributed representation of database columns based on column metadata. Our distributed representation has several applications. Using known names for groups of columns (i.e., a table name), we train a model to generate an appropriate name for columns in an unnamed table. We demonstrate the viability of our approach using schema information collected from open source applications on GitHub.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.08621/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1903.08621/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/1903.08621/full.md

---
Source: https://tomesphere.com/paper/1903.08621