A Study of the Efficiency of Spatial Indexing Methods Applied to Large Astronomical Databases
G. B. Berriman, J. C. Good, B. Shiao, T. Donaldson

TL;DR
This study compares the performance of HTM and HEALPix spatial indexing methods on large astronomical databases across different servers, revealing hardware I/O throughput as a key performance factor over indexing scheme choice.
Contribution
It provides a comprehensive performance comparison of HTM and HEALPix indexing methods on real astronomical datasets, emphasizing hardware I/O over indexing scheme differences.
Findings
Performance is mainly limited by I/O throughput.
Index choice has minimal impact at comparable levels.
Higher index levels improve performance but are affected by query size and cell granularity.
Abstract
We report the results of a study to compare the performance of two common database indexing methods, HTM and HEALPix, on Solaris and Windows database servers installed with PostgreSQL, and a Windows Server installed with MS SQL Server. The indexing was applied to the 2MASS All-Sky Catalog and to the Hubble Source Catalog, which approximate the diversity of catalogs common in astronomy. On each server, the study compared indexing performance by submitting 1 million queries at each index level with random sky positions and random cone search radius, which was computed on a logarithmic scale between 1 arcsec and 1 degree, and measuring the time to complete the query and write the output. These simulated queries, intended to model realistic use patterns, were run in a uniform way on many combinations of indexing method and indexing depth. The query times in all simulations are strongly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Geographic Information Systems Studies
