Auditing structural bias in machine learning-guided antimicrobial peptide discovery — ASN Events

Auditing structural bias in machine learning-guided antimicrobial peptide discovery (#335)

Victor D Aldas-Bulos 1 , Fabien Plisson 1
  1. Centre for Research and Advanced of the National Polytechnic Institute (CINVESTAV-IPN), Irapuato, GUANAJUATO, Mexico

Antimicrobial peptides (AMPs) are rich and structurally diverse molecules that can kill pathogens by either disrupting their membranes or interacting with their intracellular targets. The lack of bacterial resistance has encouraged their therapeutic avenues against antibiotic-resistant infections.1-2 However, developing the next generation of AMP-based antibiotics includes tackling certain barriers (low metabolic stability, proteolytic degradation, poor oral bioavailability and high toxicity) without compromising their therapeutic promises – a multi-objective optimisation.

Machine learning algorithms intertwine predictive and generative models to design optimised AMP sequences rationally.4 Despite a vast structural diversity, most considered AMP candidates fold into alpha-helical structures. Simple observations or voluntary model bias, the limitations within these models lie in the diversity of peptide sequences and biological information. Here, we audited the structural landscape in GRAMPA, the largest and vastly uncharted repository of 5980 AMPs after benchmarking four protein structure prediction methods (Jpred4, PEP2D, PSIPRED, AlphaFold2).4  The dataset contained mostly loose helices (65.1%), random coils (17.8%), while β-stranded and mixed structures accounted for the rest. We assessed 13 existing machine learning predictors for antimicrobial activity, revealing their preferences for alpha-helical sequences.5

 

 

  1. (1)         Haney, E. F.; et al. Frontiers in Chemistry 2019, 7 (February), 1–22.
  2. (2)         Kozic, M. ; et al. Proteins 2018, 86 (5), 548-565.
  3. (3)         Cardoso, M. H.; et al. Frontiers in Microbiology 2020, 10 (3097), 1-15.
  4. (4)         Aldas-Bulos, V. D. & Plisson, F. Digital Discovery 2023, DOI: 10.1039/d3dd00045a.
  5. (5)         Aldas-Bulos, V. D. & Plisson, F. manuscript in preparation.
#AusPeptide2023