# Time-domain global similarity method for automatic data cleaning for   multi-channel measurement systems in magnetic confinement fusion devices

**Authors:** Ting Lan, Jian Liu, Hong Qin, Lin Li Xu

arXiv: 1705.04947 · 2017-09-05

## TL;DR

The paper introduces a machine learning-based Time-Domain Global Similarity method for automatic data cleaning in magnetic confinement fusion devices, focusing on physical similarity rather than traditional classification, improving efficiency and objectivity.

## Contribution

It proposes a novel TDGS method that transforms data sorting into a binary classification of physical similarity, independent of discharge parameters, enhancing data cleaning in fusion diagnostics.

## Key findings

- Achieved 98.71% optimized performance on EAST POINT system
- Effectively distinguishes correct from incorrect diagnostic data
- Reduces dependence on discharge parameters for data sorting

## Abstract

To guarantee the availability and reliability of data source in Magnetic Confinement Fusion (MCF) devices, incorrect diagnostic data, which cannot reflect real physical properties of measured objects, should be sorted out before further analysis and study. Traditional data sorting cannot meet the growing demand of MCF research because of the low-efficiency, time-delay, and lack of objective criteria. In this paper, a Time-Domain Global Similarity (TDGS) method based on machine learning technologies is proposed for the automatic data cleaning of MCF devices. Traditional data sorting aims to the classification of original diagnostic data sequences, which are different in both length and evolution properties under various discharge parameters. Hence the classification criteria are affected by many discharge parameters and vary shot by shot. The focus of TDGS method is turned to the physical similarity between data sequences from different channels, which are more essential and independent of discharge parameters. The complexity arisen from real discharge parameters during data cleaning is avoided in the TDGS method by transforming the general data sorting problem into a binary classification problem about the physical similarity between data sequences. As a demonstration of its application to multi-channel measurement systems, the TDGS method is applied to the EAST POlarimeter-INterferomeTer (POINT) system. The optimized performance of the method has reached 0.9871.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.04947/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1705.04947/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1705.04947/full.md

---
Source: https://tomesphere.com/paper/1705.04947