# Cross-Database Characterization of Flavonoids and Phenolic Acids: Integrating Drug-likeness Metrics, Molecular Interactions, and Dietary Sources

**Authors:** Christmas Maria Vidal de Barros Rêgo, Zafirah Muhammad Rahman, Anna Paula Aguiar, Tatiane Fabiane Ferreira dos Santos, Sergio Senar, Luciana Aparecida Campos, Ovidiu Constantin Baltatu

PMC · DOI: 10.3390/molecules31040728 · Molecules · 2026-02-20

## TL;DR

This study evaluates the drug potential and food sources of flavonoids and phenolic acids using a unified database approach.

## Contribution

A novel integrative framework combining drug-likeness metrics, molecular interactions, and dietary sources for flavonoids and phenolic acids.

## Key findings

- Isoflavones showed the best drug-likeness profiles with a mean QED score of 0.62.
- Flavonoids had higher binding affinities and targeted more proteins than phenolic acids.
- Herbs and spices were identified as the richest sources of these compounds.

## Abstract

Background: Flavonoids and phenolic acids are recognized for their diverse therapeutic potential, yet their translation into clinical applications remains limited by varying bioavailability and fragmented characterization across databases. A systematic integrative approach is needed to comprehensively evaluate these compounds’ drug-likeness properties based on computational metrics, molecular interactions, and dietary sources within a unified framework. Methods: We analyzed 954 compounds (715 flavonoids, 239 phenolic acids) by integrating data from PhytoHub, Phenol-Explorer, ChEMBL, and FoodDB databases. Drug-likeness was assessed using established metrics, including QED (Quantitative Estimate of Drug-likeness) and DataWarrior drug-likeness scores. Molecular interaction patterns were characterized through ChEMBL activity data, and food source distributions were systematically mapped across major food groups. Results: Drug-likeness assessment revealed complementary evaluation patterns between QED (mean = 0.48 ± 0.24) and DataWarrior scores (mean = −2.46 ± 4.38), with moderate inter-correlation (r = 0.41), indicating that each metric captures distinct aspects of molecular properties. Isoflavones demonstrated the most favorable drug-likeness profiles (mean QED: 0.62 ± 0.18). Molecular interaction analysis demonstrated significantly higher binding affinities for flavonoids (mean ChEMBL activity score: 7.26 ± 1.09) compared to phenolic acids (6.98 ± 0.94, p = 0.014), with flavonoids targeting a broader range of proteins (67 unique targets vs. 33 for phenolic acids). Food source mapping identified herbs and spices as the richest sources (up to 14,500 mg/kg), followed by fruits (40,490 mg/kg total) and teas (37,101 mg/kg total), with distinct compound distribution patterns across food groups. Conclusions: This integrative cross-database approach provides a comprehensive characterization framework for flavonoids and phenolic acids, combining established drug-likeness metrics, molecular interaction analysis, and dietary source mapping. The methodology establishes a systematic foundation for compound evaluation in drug development and nutritional research.

## Linked entities

- **Chemicals:** isoflavones (PubChem CID 72304)

## Full-text entities

- **Diseases:** cardiovascular diseases (MESH:D002318), atherosclerosis (MESH:D050197), cancer (MESH:D009369), chronic venous insufficiency (MESH:D014689), inflammation (MESH:D007249), injury to (MESH:D014947), neurodegenerative disorders (MESH:D019636)
- **Chemicals:** anthocyanins (MESH:D000872), Hydrogen (MESH:D006859), flavones (MESH:D047309), Sativanone (MESH:C000607319), Sanguiin H-6 (MESH:C081844), glycosides (MESH:D006027), Flavonoid (MESH:D005419), polyphenol (MESH:D059808), naringenin (MESH:C005273), lipid (MESH:D008055), Isoflavones (MESH:D007529), flavanones (MESH:D044950), tannins (MESH:D013634), 3'-O-Methyl-(-)-epicatechin 4'-O-sulfate (-), n-octanol (MESH:D020003), Hydroxybenzoic acids (MESH:D062385), aglycones (MESH:C458179), liquiritigenin (MESH:C083152), alkaloids (MESH:D000470), Octanol (MESH:D000442), carotenoids (MESH:D002338), Phenol (MESH:D019800), Water (MESH:D014867), 2-phenylethanol (MESH:D010626), ellagitannins (MESH:D047348), terpenoids (MESH:D013729), genistein (MESH:D019833), Phenolic Acid (MESH:C017616), Quercetin (MESH:D011794), nitrogen (MESH:D009584), hydroxycinnamic acids (MESH:D003373), quercetin 3-rutinoside (MESH:D012431), flavonols (MESH:D044948), Punicalagin (MESH:C115642), flavan-3-ols (MESH:C404987), chalcones (MESH:D047188), gallic acid (MESH:D005707), oxygen (MESH:D010100), Diosmin (MESH:D004145)
- **Species:** Petroselinum crispum (parsley, species) [taxon 4043], Fagopyrum esculentum (common buckwheat, species) [taxon 3617], Homo sapiens (human, species) [taxon 9606], Dracocephalum officinale (hyssop, species) [taxon 39324], Mangifera indica (mango, species) [taxon 29780], Citrus sinensis (apfelsine, species) [taxon 2711], Malus domestica (apple, species) [taxon 3750]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12943102/full.md

## Figures

27 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943102/full.md

## References

59 references — full list in the complete paper: https://tomesphere.com/paper/PMC12943102/full.md

---
Source: https://tomesphere.com/paper/PMC12943102