Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Machine Learning for <i>De Novo</i> Molecular Generation: A Comprehensive Review
1
Zitationen
2
Autoren
2026
Jahr
Abstract
Deep generative models have emerged as powerful computational engines for <i>de novo</i> molecular design, enabling efficient exploration of a vast chemical space that remains inaccessible to traditional experimental approaches. This review provides a comprehensive survey of machine learning-driven molecular generation, systematically organizing the field across three foundational pillars: molecular representations, model architectures, and evaluation frameworks. We present a detailed taxonomy of state-of-the-art generative models, including Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), Recurrent Neural Networks (RNNs), Transformers, Diffusion Models, Normalizing Flows, and Hybrid Architectures, analyzing their underlying mechanisms, comparative strengths, and inherent limitations. Critically, we depart from purely descriptive surveys by systematically examining algorithmic failure modes and practical deployment challenges across model families. We discuss core applications spanning distribution learning and goal-directed generation. Special attention is given to challenging therapeutic domains such as Central Nervous System (CNS) drug discovery, where stringent constraints like blood-brain barrier (BBB) permeability and neurotoxicity mitigation demand multiparameter optimization. We critically evaluate the gap between computational benchmarks and practical medicinal chemistry, addressing synthetic feasibility and experimental validation. Subsequently, we highlight persistent theoretical, computational, and empirical challenges that currently limit widespread deployment, and outline promising future opportunities, including physics-informed architectures, large language models, and autonomous laboratories. This review aims to provide actionable insights for both machine learning researchers and medicinal chemists engaged in next-generation drug discovery.
Ähnliche Arbeiten
UCSF Chimera—A visualization system for exploratory research and analysis
2004 · 46.968 Zit.
AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading
2009 · 35.369 Zit.
Gaussian basis sets for use in correlated molecular calculations. I. The atoms boron through neon and hydrogen
1989 · 31.272 Zit.
The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals
2007 · 29.289 Zit.
<i>VESTA 3</i> for three-dimensional visualization of crystal, volumetric and morphology data
2011 · 24.019 Zit.