Exploring Molecular Odor Taxonomies for Structure-based Odor Predictions using Machine Learning
Akshay Sajan, Stijn Sluis, Reza Haydarlou, Sanne Abeln, Pasquale Lisena, Raphael Troncy, Caro Verbeek, Inger Leemans, Halima Mouhib

TL;DR
This study demonstrates that incorporating expert and data-driven odor taxonomies enhances machine learning predictions of molecular odors, advancing understanding of the complex structure-odor relationships and providing valuable resources for future research.
Contribution
It introduces and compares expert and data-driven odor taxonomies, showing their effectiveness in improving structure-based odor prediction models and offering a comprehensive dataset for the community.
Findings
Both taxonomies outperform random groupings in prediction accuracy.
The data-driven taxonomy enables critical evaluation of expert classifications.
Enhanced understanding of molecular odor space through combined taxonomy approaches.
Abstract
One of the key challenges to predict odor from molecular structure is unarguably our limited understanding of the odor space and the complexity of the underlying structure-odor relationships. Here, we show that the predictive performance of machine learning models for structure-based odor predictions can be improved using both, an expert and a data-driven odor taxonomy. The expert taxonomy is based on semantic and perceptual similarities, while the data-driven taxonomy is based on clustering co-occurrence patterns of odor descriptors directly from the prepared dataset. Both taxonomies improve the predictions of different machine learning models and outperform random groupings of descriptors that do not reflect existing relations between odor descriptors. We assess the quality of both taxonomies through their predictive performance across different odor classes and perform an in-depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
