From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering

Bibliographic Details
Title: From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering
Authors: Li, Wen-ran, Cadet, Xavier F., Medina-Ortiz, David, Davari, Mehdi D., Sowdhamini, Ramanathan, Damour, Cedric, Li, Yu, Miranville, Alain, Cadet, Frederic
Publication Year: 2025
Collection: Computer Science
Quantitative Biology
Subject Terms: Quantitative Biology - Quantitative Methods, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
More Details: Protein design with desirable properties has been a significant challenge for many decades. Generative artificial intelligence is a promising approach and has achieved great success in various protein generation tasks. Notably, diffusion models stand out for their robust mathematical foundations and impressive generative capabilities, offering unique advantages in certain applications such as protein design. In this review, we first give the definition and characteristics of diffusion models and then focus on two strategies: Denoising Diffusion Probabilistic Models and Score-based Generative Models, where DDPM is the discrete form of SGM. Furthermore, we discuss their applications in protein design, peptide generation, drug discovery, and protein-ligand interaction. Finally, we outline the future perspectives of diffusion models to advance autonomous protein design and engineering. The E(3) group consists of all rotations, reflections, and translations in three-dimensions. The equivariance on the E(3) group can keep the physical stability of the frame of each amino acid as much as possible, and we reflect on how to keep the diffusion model E(3) equivariant for protein generation.
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2501.02680
Accession Number: edsarx.2501.02680
Database: arXiv
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://arxiv.org/abs/2501.02680
    Name: EDS - Arxiv
    Category: fullText
    Text: View this record from Arxiv
    MouseOverText: View this record from Arxiv
  – Url: https://resolver.ebsco.com/c/xy5jbn/result?sid=EBSCO:edsarx&genre=article&issn=&ISBN=&volume=&issue=&date=20250105&spage=&pages=&title=From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering&atitle=From%20thermodynamics%20to%20protein%20design%3A%20Diffusion%20models%20for%20biomolecule%20generation%20towards%20autonomous%20protein%20engineering&aulast=Li%2C%20Wen-ran&id=DOI:
    Name: Full Text Finder (for New FTF UI) (s8985755)
    Category: fullText
    Text: Find It @ SCU Libraries
    MouseOverText: Find It @ SCU Libraries
Header DbId: edsarx
DbLabel: arXiv
An: edsarx.2501.02680
RelevancyScore: 1147
AccessLevel: 3
PubType: Report
PubTypeId: report
PreciseRelevancyScore: 1146.55029296875
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Li%2C+Wen-ran%22">Li, Wen-ran</searchLink><br /><searchLink fieldCode="AR" term="%22Cadet%2C+Xavier+F%2E%22">Cadet, Xavier F.</searchLink><br /><searchLink fieldCode="AR" term="%22Medina-Ortiz%2C+David%22">Medina-Ortiz, David</searchLink><br /><searchLink fieldCode="AR" term="%22Davari%2C+Mehdi+D%2E%22">Davari, Mehdi D.</searchLink><br /><searchLink fieldCode="AR" term="%22Sowdhamini%2C+Ramanathan%22">Sowdhamini, Ramanathan</searchLink><br /><searchLink fieldCode="AR" term="%22Damour%2C+Cedric%22">Damour, Cedric</searchLink><br /><searchLink fieldCode="AR" term="%22Li%2C+Yu%22">Li, Yu</searchLink><br /><searchLink fieldCode="AR" term="%22Miranville%2C+Alain%22">Miranville, Alain</searchLink><br /><searchLink fieldCode="AR" term="%22Cadet%2C+Frederic%22">Cadet, Frederic</searchLink>
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2025
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: Computer Science<br />Quantitative Biology
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Quantitative+Biology+-+Quantitative+Methods%22">Quantitative Biology - Quantitative Methods</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+Science+-+Artificial+Intelligence%22">Computer Science - Artificial Intelligence</searchLink><br /><searchLink fieldCode="DE" term="%22Computer+Science+-+Machine+Learning%22">Computer Science - Machine Learning</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Protein design with desirable properties has been a significant challenge for many decades. Generative artificial intelligence is a promising approach and has achieved great success in various protein generation tasks. Notably, diffusion models stand out for their robust mathematical foundations and impressive generative capabilities, offering unique advantages in certain applications such as protein design. In this review, we first give the definition and characteristics of diffusion models and then focus on two strategies: Denoising Diffusion Probabilistic Models and Score-based Generative Models, where DDPM is the discrete form of SGM. Furthermore, we discuss their applications in protein design, peptide generation, drug discovery, and protein-ligand interaction. Finally, we outline the future perspectives of diffusion models to advance autonomous protein design and engineering. The E(3) group consists of all rotations, reflections, and translations in three-dimensions. The equivariance on the E(3) group can keep the physical stability of the frame of each amino acid as much as possible, and we reflect on how to keep the diffusion model E(3) equivariant for protein generation.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Working Paper
– Name: URL
  Label: Access URL
  Group: URL
  Data: <link linkTarget="URL" linkTerm="http://arxiv.org/abs/2501.02680" linkWindow="_blank">http://arxiv.org/abs/2501.02680</link>
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsarx.2501.02680
PLink https://login.libproxy.scu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2501.02680
RecordInfo BibRecord:
  BibEntity:
    Subjects:
      – SubjectFull: Quantitative Biology - Quantitative Methods
        Type: general
      – SubjectFull: Computer Science - Artificial Intelligence
        Type: general
      – SubjectFull: Computer Science - Machine Learning
        Type: general
    Titles:
      – TitleFull: From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Li, Wen-ran
      – PersonEntity:
          Name:
            NameFull: Cadet, Xavier F.
      – PersonEntity:
          Name:
            NameFull: Medina-Ortiz, David
      – PersonEntity:
          Name:
            NameFull: Davari, Mehdi D.
      – PersonEntity:
          Name:
            NameFull: Sowdhamini, Ramanathan
      – PersonEntity:
          Name:
            NameFull: Damour, Cedric
      – PersonEntity:
          Name:
            NameFull: Li, Yu
      – PersonEntity:
          Name:
            NameFull: Miranville, Alain
      – PersonEntity:
          Name:
            NameFull: Cadet, Frederic
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 05
              M: 01
              Type: published
              Y: 2025
ResultId 1