2025 ARCHIVES

Cambridge Healthtech Institute’s 7th Annual

AI/Machine Learning for Early Drug Discovery – Part 2

AI/ML for Exploring and Screening Complex Target Biology and Chemical Space

April 16 - 17, 2025 ALL TIMES PDT

Artificial Intelligence (AI)/Machine Learning (ML) for Early Drug Discovery is a two-part conference that brings together a diverse group of experts from chemistry, target discovery, pharmacology, and bioinformatics to talk about the increasing use of computational tools, models, algorithms, and data analytics for drug development. The talks will highlight the pros and cons of AI/ML-driven decision-making using relevant case studies from small molecule and peptide drug development. The first part of the conference will focus on how AI/ML can help improve drug design, hit identification, predict PK/PD and drug-like properties, and lead optimization. The second part will focus on emerging computational tools and models for target identification, deconvoluting cellular pathways, and to drive niche applications by exploring chemical space for various therapeutic areas.

Wednesday, April 16

12:00 pmRegistration Open

1:30 pmWelcome Remarks

1:35 pm

Chairperson's Remarks

Ewa Lis, PhD, Founder & CEO, Koliber Biosciences

1:40 pm

Peptide Hit Discovery and Optimization Using Machine Learning and Small Peptide Arrays

Ewa Lis, PhD, Founder & CEO, Koliber Biosciences

Peptide discovery methods like phage and mRNA display are standard tools but face challenges such as high false positive rate or costly licensing, limiting novel therapeutic discovery. We present a new platform combining Koliber’s machine learning technology with RobustDx's peptide array technology, demonstrating that large libraries are unnecessary and that hits can be efficiently optimized to nanomolar binding affinity. Additionally, we showcase visualization techniques for binding mode detection and offer insights into the future of machine learning-driven peptide optimization.

2:10 pm

AlphaBind, a Domain-Specific Model to Predict and Optimize Antibody-Antigen Binding Affinity

Ryan Emerson, PhD, Vice President, Data Science, A Alpha Bio Inc.

We present AlphaBind, a domain-specific model achieving state-of-the-art performance in optimizing antibody affinity using protein language model embeddings and extensive pre-training. We demonstrate affinity optimization for four antibodies with just one round of training data generation per antibody, and we demonstrate the use of a fine-tuned AlphaBind model to guide downstream engineering for biodevelopability and germline reversion for one antibody. AlphaBind weights and code are publicly available.

2:40 pm

Decoding ‘De Novo’ Methods: Applying Generative AI to Drug-Like and Synthesizable Molecule Design

Calvin Nyapete, Digital Solutions Specialist, Digital Chemistry & AI Drug Discovery, MilliporeSigma

Explore how generative AI is reshaping molecule design in drug discovery. This session introduces four key de novo methods—standard, scaffold-, reaction-, and structure-based—and shows how each can be applied to real-world challenges in hit generation, hit-to-lead, and lead optimization. Learn how to design novel, drug-like, and synthesizable compounds using tools built with medicinal chemists in mind.

2:55 pm

Cloud Databases & Screening for Virtual Drug Discovery

Janice Darlington, Scientist, Customer Engagement, Collaborative Drug Discovery Inc.

CDD Vault’s intuitive, web-based platform has helped scientists centralize and analyze their data over the last 20+ years. Biologists and chemists use it to register entities, track inventory, record experiments and even calculate structure activity relationships. Granting you the agency to standardize and structure your data, to not only eliminate bottlenecks and help you optimize resources, CDD Vault leverages the latest AI machine learning capabilities today. A recently released proprietary AI tool to help you discover lead compounds is amongst the latest offerings brought to you in one seamless intuitive platform. Covering how collaboration works in CDD Vault's ecosystem and how new ideas can be generated in your virtual drug discovery efforts will be the focus of the high level overview of the CDD Vault platform.

3:10 pmBreakout Discussions (In-Person Only)

Breakout Discussions are informal, moderated discussions, allowing participants to exchange ideas and experiences and develop future collaborations around a focused topic. Each breakout will be led by a facilitator/s who keeps the discussion on track and the group engaged. Please visit the Breakout Discussions page on the conference website for a complete listing of topics and descriptions. Breakout Discussions are offered in-person only.

In-Person Only BREAKOUT DISCUSSION 4:

AI-driven Drug Design, Screening and Optimization

Ruben Abagyan, PhD, Professor, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego

Peter Canning, PhD, Principal Scientist, Protein & Structural Sciences, CHARM Therapeutics

Alexander Taguchi, PhD, Director, Machine Learning & Antibody Discovery, iBio

In-Person Only BREAKOUT DISCUSSION 5:

How Successful are AI/ML Approaches in Drug Development Today?

Bryce Allen, PhD, Co-Founder & CEO, Differentiated Therapeutics

Ryan Emerson, PhD, Vice President, Data Science, A Alpha Bio Inc.

Leif Eriksson, PhD, Professor, Chemistry & Molecular Biology, University of Gothenburg

3:55 pmRefreshment Break in the Exhibit Hall with Poster Viewing

4:45 pm

Generative Protein Design for Overcoming Immune Tolerance in Antibody Discovery

Alexander Taguchi, PhD, Director, Machine Learning & Antibody Discovery, iBio

Immunizations for antibody discovery are often unproductive when the antigen exhibits high sequence homology with the host. We overcome this immune tolerance problem with a generative protein design platform, enhancing the immune response against the antigen of interest. Using this approach, we showcase how this technology can generate antibodies against various targets including Activin E, a 97% homologous human target that has remained intractable to traditional methods.

5:15 pm PANEL DISCUSSION:

Session Speakers Discuss the Future of AI/ML-Driven Peptide/Antibody Design and Optimization

PANEL MODERATOR:

Ewa Lis, PhD, Founder & CEO, Koliber Biosciences

5:45 pmClose of Day

5:45 pmDinner Short Course Registration

6:15 pmDinner Short Course*

SC7: AI Applications in Drug Development: Strategies for Innovation and Integration

*Premium Pricing or separate registration required. See Short Courses page for details.

Thursday, April 17

7:15 amRegistration Open

7:45 amIn-Person Only Breakfast Small Group Discussions: Navigating Career Challenges

Grab a plate and seat (continental breakfast provided by Drug Discovery Chemistry) to talk about career challenges with fellow scientists at your table. This session is being offered in-person only (not recorded).

8:30 am

Plenary Welcome Remarks from Lead Content Director

Anjani Shah, PhD, Senior Conference Director, Cambridge Healthtech Institute

8:35 am

Plenary Keynote Introduction

Jennifer D. Venable, PhD, Senior Director, Discovery Chemistry Site Head, Janssen La Jolla

8:40 am PLENARY KEYNOTE:

Simplifying Synthesis with Radicals

Phil Baran, PhD, Chair & Professor, Department of Chemistry, Scripps Research Institute

Our latest findings on how the use of radical cross-coupling can dramatically simplify the practice of medicinal chemistry will be presented through the invention of reactions that have wide-substrate scope, use ubiquitous starting materials, and are experimentally trivial to conduct.

9:25 amCoffee Break in the Exhibit Hall with Poster Viewing and Best of Show Awards Announced

10:15 am

Chairperson's Remarks

Ruben Abagyan, PhD, Professor, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego

10:20 am

Ultrafast Screening and Optimization in Allosteric Pockets with 3D/AI-CPU/GPU Pipeline: Flavivirus Proteases and More

Ruben Abagyan, PhD, Professor, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego

Finding the first potent and selective inhibitors against a transient, allosteric, or protein-protein interaction pocket is a challenge requiring multiple levels of data, tools, profile definitions, and ultra large screens combined with in silico compound optimization. We present a cloud-based CPU/GPU pipeline designed for that purpose and its application for identifying drug candidates among multibillion compounds. Examples with anti-cancer targets and inhibitors of anti-flaviviral proteases are presented.

10:50 am

Novel AI-Based Methods for Ultra-Large and Ultra-Fast Virtual Screening in Drug Discovery

Leif Eriksson, PhD, Professor, Chemistry & Molecular Biology, University of Gothenburg

The druglike chemical space of available molecular databases contains ~10¹⁰ molecules, and grows faster than traditional screening approaches can handle. We present benchmarked methods that circumvent conformational sampling, enabling ultra-large and ultra-fast screening, including a novel AI-based scoring function, generative AI, and scaffold optimization. We also report on a data-driven molecular descriptor model using Neural Machine Translation, for effectively predicting protonation states, performing similarity searches, and generating molecular derivatives.

11:20 am

Supercharge Computational Drug Discovery with AI-Powered Serverless High-Performance Computing (HPC)

Fengbo Ren, CEO, Computer Science & Engineering, Fovus Corp.

Fovus is an AI-powered, serverless high-performance computing (HPC) platform delivering intelligent, scalable, and cost-efficient supercomputing power at the computational scientists' fingertips. Fovus uses AI to optimize HPC strategies and orchestrates cloud logistics, making cloud HPC a no-brainer and ensuring sustained time-cost optimality for computational drug discovery amid quickly evolving cloud infrastructure. By accelerating time-to-insights and optimizing cloud costs, Fovus helps Biotech clients accelerate Design-Make-Test-Analyze (DMTA) cycles and discover more with less. Join this talk to learn how Fovus can supercharge your computational drug discovery with case studies and GROMACS/AlphaFold 3 benchmarking results.

11:35 am

AI-Driven Virtual Screening and Polypharmacology Analysis

Sita Sirisha Madugula, PhD, Postdoctoral Research Associate, Center for Nanophase Materials Sciences, Oak Ridge National Laboratory

Our research demonstrates the potential of AI and machine learning in drug repurposing, specifically for tuberculosis (TB). Through unsupervised learning and polypharmacology approaches, we identified FDA-approved drugs with potential for repurposing by analyzing molecular descriptors and multi-target interactions. These methods offer efficient pathways to explore chemical and biological spaces, providing new insights into drug efficacy and paving the way for therapeutic solutions in infectious and non-infectious diseases.

12:05 pm

PANEL DISCUSSION: Session Speakers Discuss Strategies for Exploring Chemical and Biological Spaces Using AI/ML Tools

PANEL MODERATOR:

Ruben Abagyan, PhD, Professor, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego

12:35 pmTransition to Lunch

12:40 pm

LUNCHEON PRESENTATION: Beyond Hit ID: Transforming DEL Data Complexity into Powerful Drug Discovery Solutions

Erin Davis, CTO, X-Chem, Inc.

DNA-encoded libraries (DELs) have transformed drug discovery, enabling novel modulator identification across diverse targets. However, DEL data complexity requires high-quality signal-to-noise for AI/ML to extract meaningful insights. This talk explores how X-Chem’s 15 years of DEL data and bespoke ML approaches enhance hit selection and structure-function mapping, unlocking new opportunities in drug discovery through real-world case studies.

1:10 pmDessert Break in the Exhibit Hall: Meet the VCs, Poster Prize Awarded and Book Raffle Winners Announced

1:35pm Poster Winner Announced & Prize Awarded

1:40 pm Book Raffle with Author Signings

(Book Raffle: during exhibit hall breaks until the raffle drawing, enter your name in raffle bins of associated drug discovery books for a chance to win a signed copy of the book. Winners must be present to win).

2:00 pm PLENARY PANEL DISCUSSION:

Venture Capitalist Insights into Trends in Drug Discovery

PANEL MODERATOR:

Daniel A. Erlanson, PhD, Chief Innovation Officer, Frontier Medicines Corporation

PANELISTS:

Seth Lieblich, PhD, Principal, 8VC

Chris Smith, PhD, CSO Partner Team, Curie.Bio

Rachit Neupane, PhD, Life Sciences Investor, General Catalyst

Wendy Young, PhD, Scientific Advisor & Board Director; Former Senior Vice President, Small Molecule Drug Discovery, Genentech

2:50 pm

Chairperson's Remarks

Arvind Rao, PhD, Associate Professor, Department of Computational Medicine and Bioinformatics, University of Michigan

2:55 pm

Harmonizing Diverse Data Types and Sources for Drug Discovery and Machine Learning

Peter Canning, PhD, Principal Scientist, Protein & Structural Sciences, CHARM Therapeutics

DragonFold is CHARM therapeutics' state-of-the-art co-folding platform for prediction of ligand-bound protein structures. Evidence has shown that the functional performance of many ML models improves with target-specific training data. We have established a platform to collect and organize various internal and external data sources to inform drug discovery projects and train ML models for improved output confidence.

3:25 pm

AI Methods to Integrate Multi-Modal Omics, Spatial, and Single-Cell Profiling to Identify Mechanisms and Potential Therapeutic Opportunities

Arvind Rao, PhD, Associate Professor, Department of Computational Medicine and Bioinformatics, University of Michigan

Spatial profiling technologies coupled with scRNAseq enable a multi-factorial, multi-modal characterization of the tissue microenvironment. Objective scoring methods inspired by recent advances in statistics and ML can aid the interpretation of these datasets, as well as their integration with companion data like bulk and single-cell genomics. I will discuss analysis paradigms from ML that can be used to integrate and prioritize gene regulatory programs (and therapeutic candidates) underlying oncogenesis.

3:55 pmNetworking Refreshment Break

4:10 pm

Machine Learning for 3D-Aware Molecular Representations in DEL

Angelina Heyler, Data Scientist, Encoded Libraries, GSK

DNA-encoded libraries (DELs) enable screening billions of ligands against protein targets of interest. To select hits for off-DNA evaluation, quantitative structure-activity relationship (QSAR) modeling is frequently used to find structural features that contribute to enrichment. However, current QSAR typically relies on 2D molecular representations. We leverage machine learning to learn 3D molecular representations for application in hit selection.

4:40 pm

Ligandability of WDR-Containing Proteins Using DEL Then ML

Peter J. Brown, PhD, Chemical Probes, University of North Carolina at Chapel Hill

Target class–focused drug discovery has a strong track record in pharmaceutical research, yet public domain data indicate that many protein families remain unliganded. Here we present a systematic approach to scale up the discovery and characterization of small molecule ligands for the WD40 repeat (WDR) protein family. A pilot hit-finding campaign using DNA-encoded chemical library selection followed by machine learning (DEL-ML) yielded first-in-class, drug-like ligands for 7 of the 16 WDR domains screened. This study establishes a template for evaluation of protein family ligandability and provides extensive WDR resources to discover ligands for this underexplored target class.

5:10 pm

DELs in Medicinal Chemist's Toolbox: Applications beyond Hit Discovery

Kirill Novikov, PhD, Principal Scientist, High Throughput Chemistry, insitro

DNA-encoded libraries (DELs) are traditionally used to find potential hit compounds for specific targets. At insitro, we are enhancing this technology to aid in later stages of drug discovery. We employ targeted second-generation DELs for efficient exploration of chemical space surrounding hit structures. By employing affinity-based electrophoretic separations, such as nDexer and capillary electrophoresis, we can rank DEL members, facilitating early structure-activity relationship (SAR) hypothesis formation and machine-learning model training to refine predictive accuracy in this chemical environment.

5:40 pmClose of Conference