Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s
Maksim Makarenko, Qizhou Wang, Arturo Burguete-Lopez, Silvio Giancola,, Bernard Ghanem, Luca Passone, Andrea Fratalocchi

TL;DR
This paper presents an innovative optoelectronic hardware platform that enables real-time hyperspectral video understanding at 1.2 Tb/s, significantly surpassing existing technologies in speed and spectral resolution.
Contribution
The work introduces a novel hardware-accelerated integrated platform combining optical AI processing with machine vision, achieving unprecedented data throughput for multidimensional video analysis.
Findings
Achieves 1.2 Tb/s processing speed for hyperspectral video
Surpasses similar technologies by three to four orders of magnitude
Validates performance in semantic segmentation and object understanding tasks
Abstract
Foundation models, exemplified by GPT technology, are discovering new horizons in artificial intelligence by executing tasks beyond their designers' expectations. While the present generation provides fundamental advances in understanding language and images, the next frontier is video comprehension. Progress in this area must overcome the 1 Tb/s data rate demanded to grasp real-time multidimensional video information. This speed limit lies well beyond the capabilities of the existing generation of hardware, imposing a roadblock to further advances. This work introduces a hardware-accelerated integrated optoelectronic platform for multidimensional video understanding in real-time. The technology platform combines artificial intelligence hardware, processing information optically, with state-of-the-art machine vision networks, resulting in a data processing speed of 1.2 Tb/s with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Infrared Target Detection Methodologies · Advanced Image and Video Retrieval Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Weight Decay · Cosine Annealing · Refunds@Expedia|||How do I get a full refund from Expedia? · Discriminative Fine-Tuning · Softmax
