Snowflake: A deep learning-based human leukocyte antigen matching algorithm considering allele-specific surface accessibility.

Bibliographic Details
Title:	Snowflake: A deep learning-based human leukocyte antigen matching algorithm considering allele-specific surface accessibility.
Authors:	Niemann, Matthias, Matern, Benedict M., Spierings, Eric
Source:	Frontiers in Immunology; 7/29/2022, Vol. 13, p1-13, 13p
Subject Terms:	HLA histocompatibility antigens, AMINO acid sequence, RECURRENT neural networks, SNOWFLAKES, PEPTIDES
Abstract:	Histocompatibility in solid-organ transplantation has a strong impact on long-term graft survival. Although recent advances in matching of both B-cell epitopes and T-cell epitopes have improved understanding of allorecognition, the immunogenic determinants are still not fully understood. We hypothesized that HLA solvent accessibility is allele-specific, thus supporting refinement of HLA B-cell epitope prediction. We developed a computational pipeline named Snowflake to calculate solvent accessibility of HLA Class I proteins for deposited HLA crystal structures, supplemented by constructed HLA structures through the AlphaFold protein folding predictor and peptide binding predictions of the APE-Gen docking framework. This dataset trained a four-layer long short-term memory bidirectional recurrent neural network, which in turn inferred solvent accessibility of all known HLA Class I proteins. We extracted 676 HLA Class-I experimental structures from the Protein Data Bank and supplemented it by 37 Class-I alleles for which structures were predicted. For each of the predicted structures, 10 known binding peptides as reported by the Immune Epitope DataBase were rendered into the binding groove. Although HLA Class I proteins predominantly are folded similarly, we found higher variation in root mean square difference of solvent accessibility between experimental structures of different HLAs compared to structures with identical amino acid sequence, suggesting HLA’s solvent accessible surface is protein specific. Hence, residues may be surface-accessible on e.g. HLA-A02:01, but not on HLA-A01:01. Mapping these data to antibody-verified epitopes as defined by the HLA Epitope Registry reveals patterns of (1) consistently accessible residues, (2) only subsets of an epitope’s residues being consistently accessible and (3) varying surface accessibility of residues of epitopes. Our data suggest B-cell epitope definitions can be refined by considering allele-specific solvent-accessibility, rather than aggregating HLA protein surface maps by HLA class or locus. To support studies on epitope analyses in organ transplantation, the calculation of donor-allele-specific solvent-accessible amino acid mismatches was implemented as a cloud-based web service. [ABSTRACT FROM AUTHOR]
	Copyright of Frontiers in Immunology is the property of Frontiers Media S.A. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database:	Complementary Index

More Details
ISSN:	16643224
DOI:	10.3389/fimmu.2022.937587
Published in:	Frontiers in Immunology
Language:	English