# Infrared Spectral Descriptors for Reaction Yield Prediction: Toward Redefining Experimental Spaces

**Authors:** Yuya Endo, Hiromasa Kaneko

PMC · DOI: 10.1002/minf.70019 · Molecular Informatics · 2026-02-12

## TL;DR

This paper introduces new infrared-based descriptors that improve predictions of reaction yields in catalytic chemistry.

## Contribution

The study proposes novel wavenumber-based IR descriptors that outperform traditional methods in predicting catalytic reaction yields.

## Key findings

- Wavenumber-based IR descriptors outperformed conventional molecular descriptors and fingerprints in yield prediction.
- Descriptors limited to the fingerprint region (0–1700 cm−1) achieved high accuracy and improved interpretability.
- The approach shows strong generalization performance even with small datasets.

## Abstract

Yield prediction in catalytic reactions is essential for improving chemical process efficiency and product quality. Ligands significantly influence reactivity and selectivity, highlighting the need for descriptors that accurately capture their structural and electronic properties. In this study, we focus on infrared (IR) spectra, which reflects molecular vibrational modes, and propose novel descriptors based on wavenumber information. We evaluated the predictive performance of these descriptors using two datasets: direct Pd‐catalyzed arylation and Suzuki–Miyaura coupling reactions. The wavenumber‐based IR descriptors outperformed conventional molecular descriptors and structural fingerprints (one‐hot encoding, Mordred, MACCS, Morgan fingerprint, RDKit, and density functional theory). Notably, descriptors limited to the fingerprint region (0–1700 cm−1) effectively captured key molecular features, contributing to both high prediction accuracy and improved chemical interpretability. Our results indicate that IR‐based descriptors can achieve strong generalization performance even with small datasets. This approach offers a promising strategy for redefining reaction condition spaces and enhancing the interpretability of predictive models, thereby supporting more informed experimental design.

Wavenumber‐based IR descriptors improve yield prediction in catalytic reactions by capturing ligand structure and electronics. They outperform one‐hot encoding, RDKit, Mordred, MACCS, Morgan, and density functional theory descriptor, offering a robust tool for guiding ligand selection and optimizing experimental conditions.© 2026 WILEY‐VCH GmbH

## Full-text entities

- **Chemicals:** Pd (MESH:D010165)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12899324/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12899324/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12899324/full.md

---
Source: https://tomesphere.com/paper/PMC12899324