Data Interpolation: An Efficient Sampling Alternative for Big Data   Aggregation

Hadassa Daltrophe; Shlomi Dolev; Zvi Lotker

arXiv:1210.3171·cs.NI·October 12, 2012·2 cites

Data Interpolation: An Efficient Sampling Alternative for Big Data Aggregation

Hadassa Daltrophe, Shlomi Dolev, Zvi Lotker

PDF

Open Access

TL;DR

This paper introduces efficient polynomial interpolation methods for large sensor datasets, capable of handling noise and adversarial data, to better summarize and analyze big data in sensor networks.

Contribution

It extends existing interpolation techniques to multidimensional, noisy, and Byzantine data, providing robust solutions for big data aggregation.

Findings

01

Effective interpolation methods for noisy and Byzantine data

02

Extension of Welch-Berlekamp and Arora-Khot techniques to multidimensional data

03

Robust data representation for sensor data analysis

Abstract

Given a large set of measurement sensor data, in order to identify a simple function that captures the essence of the data gathered by the sensors, we suggest representing the data by (spatial) functions, in particular by polynomials. Given a (sampled) set of values, we interpolate the datapoints to define a polynomial that would represent the data. The interpolation is challenging, since in practice the data can be noisy and even Byzantine, where the Byzantine data represents an adversarial value that is not limited to being close to the correct measured data. We present two solutions, one that extends the Welch-Berlekamp technique in the case of multidimensional data, and copes with discrete noise and Byzantine data, and the other based on Arora and Khot techniques, extending them in the case of multidimensional noisy and Byzantine data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms · Target Tracking and Data Fusion in Sensor Networks · Error Correcting Code Techniques