# Enhancing chemical reaction search through contrastive representation learning and human-in-the-loop

**Authors:** Youngchun Kwon, Hyunjeong Jeon, Joonhyuk Choi, Youn-Suk Choi, Seokho Kang

PMC · DOI: 10.1186/s13321-025-00987-5 · Journal of Cheminformatics · 2025-04-10

## TL;DR

This paper introduces a system that improves chemical reaction searches by using user feedback and machine learning to find better results.

## Contribution

The novel contribution is an intelligent chemical reaction search system using contrastive learning and human feedback to refine results.

## Key findings

- The system uses contrastive learning to embed chemical reactions as vectors for efficient search.
- User feedback is integrated iteratively to refine and align search results with preferences.
- Experiments show the method improves search effectiveness and user satisfaction.

## Abstract

In synthesis planning, identifying and optimizing chemical reactions are important for the successful design of synthetic pathways to target substances. Chemical reaction databases assist chemists in gaining insights into this process. Traditionally, searching for relevant records from a reaction database has relied on the manual formulation of queries by chemists based on their search purposes, which is challenging without explicit knowledge of what they are searching for. In this study, we propose an intelligent chemical reaction search system that simplifies the process of enhancing the search results. When a user submits a query, a list of relevant records is retrieved from the reaction database. Users can express their preferences and requirements by providing binary ratings for the individual retrieved records. The search results are refined based on the user feedback. To implement this system effectively, we incorporate and adapt contrastive representation learning, dimensionality reduction, and human-in-the-loop techniques. Contrastive learning is used to train a representation model that embeds records in the reaction database as numerical vectors suitable for chemical reaction searches. Dimensionality reduction is applied to compress these vectors, thereby enhancing the search efficiency. Human-in-the-loop is integrated to iteratively update the representation model by reflecting user feedback. Through experimental investigations, we demonstrate that the proposed method effectively improves the chemical reaction search towards better alignment with user preferences and requirements.

Scientific contribution This study seeks to enhance the search functionality of chemical reaction databases by drawing inspiration from recommender systems. The proposed method simplifies the search process, offering an alternative to the complexity of formulating explicit query rules. We believe that the proposed method can assist users in efficiently discovering records relevant to target reactions, especially when they encounter difficulties in crafting detailed queries due to limited knowledge.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11987336/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11987336/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC11987336/full.md

---
Source: https://tomesphere.com/paper/PMC11987336