Spike Protein Evolution and Its Implications for Pan-Coronavirus Therapeutics

Spike Protein Evolution and Its Implications for Pan-Coronavirus Therapeutics

Abstract

The Spike (S) protein, a defining feature of coronaviruses (CoVs), mediates receptor binding and membrane fusion, crucial steps for viral entry. Its exposed location on the viral surface also renders it a primary target for neutralizing antibodies and antiviral interventions. However, the S protein’s high mutation rate, resulting in antigenic drift and shift, poses a significant challenge to the development of broadly effective coronavirus therapeutics. This report delves into the structural and functional diversity of S proteins across various CoVs, examines the evolutionary pressures driving sequence divergence, analyzes the mechanisms of immune evasion employed by the S protein, and discusses the potential of targeting conserved regions for the development of pan-coronavirus vaccines and antiviral drugs. Furthermore, we address the complexities of developing effective therapies given the conformational flexibility and glycosylation patterns inherent to the S protein, and provide insights into potential therapeutic strategies.

1. Introduction

Coronaviruses are enveloped, positive-sense single-stranded RNA viruses belonging to the family Coronaviridae. They are classified into four genera: Alpha-, Beta-, Gamma-, and Delta-coronavirus. While Alpha- and Beta-coronaviruses primarily infect mammals, Gamma- and Delta-coronaviruses are mainly found in avian species (Woo et al., 2010). Several CoVs have crossed the species barrier to infect humans, causing a range of respiratory illnesses, from the common cold to severe acute respiratory syndrome (SARS), Middle East respiratory syndrome (MERS), and the ongoing COVID-19 pandemic caused by SARS-CoV-2 (Zhu et al., 2020). The ability of CoVs to rapidly evolve and adapt has made them a persistent threat to global public health, highlighting the need for broadly effective antiviral strategies.

The Spike (S) protein is a large type I transmembrane glycoprotein that protrudes from the viral surface. It mediates the crucial steps of viral attachment to host cell receptors and subsequent fusion of the viral and cellular membranes, leading to viral entry (Walls et al., 2020). The S protein is synthesized as a single polypeptide precursor that is subsequently cleaved by host cell proteases into two subunits: S1 and S2. The S1 subunit contains the receptor-binding domain (RBD), which interacts with the host cell receptor, determining viral tropism. The S2 subunit contains the fusion peptide (FP) and other elements necessary for membrane fusion. Due to its crucial role in viral entry and its prominent display on the viral surface, the S protein is the main target of neutralizing antibodies and is therefore the focus of most vaccine and therapeutic development efforts (Wrapp et al., 2020).

2. Structural and Functional Diversity of Spike Proteins

The S protein exhibits significant structural and functional diversity across different CoV species. While the overall architecture of the S protein is conserved, the specific amino acid sequences, particularly within the S1 subunit, vary considerably, leading to differences in receptor binding and antigenic properties (Li, 2016). For example, SARS-CoV and SARS-CoV-2 utilize angiotensin-converting enzyme 2 (ACE2) as their receptor (Li et al., 2003; Zhou et al., 2020), whereas MERS-CoV uses dipeptidyl peptidase 4 (DPP4) (Raj et al., 2013). Some coronaviruses, like human coronavirus OC43, utilize sialic acid-containing glycans (Vlasenko et al., 2021), while others bind to aminopeptidase N (APN). This receptor usage dictates the host range and tissue tropism of each virus.

The S1 subunit, particularly the RBD, is the most variable region of the S protein. The RBD contains specific receptor-binding motifs (RBMs) that directly interact with the host cell receptor. These RBMs are often located within exposed loops and are subject to strong selective pressure, resulting in rapid sequence evolution and antigenic drift (Andersen et al., 2020). The S2 subunit, which mediates membrane fusion, is more conserved than the S1 subunit, but even this subunit can exhibit variations in sequence and structure across different CoV species (Bosch et al., 2003).

Furthermore, the glycosylation patterns of the S protein can vary significantly between different CoVs. Glycans are carbohydrate molecules that are attached to specific amino acid residues on the protein surface. These glycans can play important roles in protein folding, stability, and immune evasion. The glycans surrounding the SARS-CoV-2 RBD, for example, can shield it from antibody recognition. Understanding the glycosylation patterns of different S proteins is crucial for designing effective vaccines and therapeutics (Casalino et al., 2020).

3. Evolutionary Pressures and Mutation Patterns

The S protein is under constant evolutionary pressure to adapt to its host environment and evade the host’s immune system. This pressure drives the accumulation of mutations, particularly in the S1 subunit and RBD. The high mutation rate of coronaviruses, due to the error-prone nature of RNA-dependent RNA polymerase (RdRp), contributes significantly to this rapid evolution (Domingo, 2000). Recombination events between different CoV strains can also lead to the emergence of novel variants with altered properties.

Several studies have identified specific hotspots for mutations within the S protein. These hotspots are often located within the RBD and are associated with increased transmissibility, altered receptor binding affinity, or reduced neutralization by antibodies (Dejnirattisai et al., 2021). The emergence of SARS-CoV-2 variants of concern (VOCs), such as Alpha, Beta, Delta, and Omicron, highlights the potential for rapid evolution and the impact of specific mutations on viral fitness. These VOCs harbor multiple mutations in the S protein, leading to increased transmissibility, immune evasion, and, in some cases, increased disease severity (WHO, 2021).

The accumulation of mutations in the S protein can lead to antigenic drift, where gradual changes in the amino acid sequence reduce the effectiveness of existing antibodies. Antigenic shift, a more dramatic change, can occur through recombination or reassortment of genetic material, leading to the emergence of novel viruses with significantly altered antigenic properties. Both antigenic drift and shift can undermine the effectiveness of vaccines and therapeutics, requiring continuous monitoring of viral evolution and adaptation of countermeasures (Rambaut et al., 2020).

4. Mechanisms of Immune Evasion

The S protein employs multiple mechanisms to evade the host’s immune system. Glycosylation, as mentioned earlier, can shield the protein surface from antibody recognition. Mutations within the RBD can also alter the epitopes recognized by neutralizing antibodies, reducing their binding affinity and neutralizing activity (Harvey et al., 2021). Furthermore, some CoVs have evolved mechanisms to downregulate the expression of major histocompatibility complex (MHC) class I molecules on infected cells, reducing the presentation of viral antigens to T cells (Khan et al., 2020).

The S protein can also interfere with the complement system, a crucial component of the innate immune response. Some CoVs express proteins that bind to complement components, preventing their activation and reducing the inflammatory response. By interfering with both the adaptive and innate immune systems, the S protein contributes to viral persistence and disease progression (Jiang et al., 2020).

The phenomenon of antibody-dependent enhancement (ADE) is also a concern in the context of CoV infections. ADE occurs when non-neutralizing antibodies enhance viral entry into cells, leading to increased viral replication and disease severity. While ADE has been observed in vitro and in animal models for some CoVs, its clinical relevance in humans remains a subject of ongoing research. However, the possibility of ADE should be considered when developing vaccines and antibody-based therapeutics (Arvin et al., 2020).

5. Targeting Conserved Regions for Pan-Coronavirus Therapeutics

Given the rapid evolution and antigenic diversity of the S protein, the development of pan-coronavirus vaccines and therapeutics that are effective against a broad range of CoV strains remains a significant challenge. One promising strategy is to target conserved regions of the S protein that are essential for viral entry and are less prone to mutation. These regions may include the fusion peptide (FP), the heptad repeat regions (HR1 and HR2), and the stem helix (Bosch et al., 2003).

The FP is a short hydrophobic sequence within the S2 subunit that inserts into the host cell membrane, initiating membrane fusion. The HR1 and HR2 regions are alpha-helical coiled-coil domains that interact to form a six-helix bundle, bringing the viral and cellular membranes into close proximity. The stem helix connects the S2 subunit to the transmembrane domain and anchors the S protein to the viral envelope. Inhibitors that target these conserved regions can disrupt membrane fusion and prevent viral entry (Xia et al., 2020).

Another approach is to focus on broadly neutralizing antibodies (bnAbs) that target conserved epitopes on the S protein. Several bnAbs have been identified that can neutralize a wide range of CoV strains, including SARS-CoV, MERS-CoV, and SARS-CoV-2. These bnAbs often target epitopes that are located outside the RBD and are less prone to mutation. Identifying and characterizing bnAbs can provide valuable insights into the conserved regions of the S protein and guide the design of novel vaccines and therapeutics (Corti et al., 2017).

The rational design of vaccines and therapeutics may involve computational modeling of viral protein structures and interactions. This can help in understanding protein-protein interactions between the S protein and the ACE2 receptor, as well as between the S protein and neutralizing antibodies. Furthermore, these models can be used to identify and optimize potential drug candidates that target conserved regions of the S protein. An opinion, based on the current state of the field, is that a multi-pronged approach targeting multiple conserved regions simultaneously will likely be more successful in creating pan-coronavirus therapeutics than any single-target strategy.

6. Challenges and Future Directions

Despite the progress in understanding the S protein and its role in CoV infections, several challenges remain in the development of pan-coronavirus therapeutics. The high mutation rate of CoVs necessitates continuous monitoring of viral evolution and adaptation of countermeasures. The conformational flexibility of the S protein and the complex glycosylation patterns also pose challenges for drug design and vaccine development.

The development of broadly effective vaccines that can elicit durable and protective immune responses against a wide range of CoV strains is a major priority. One approach is to develop multivalent vaccines that contain antigens from multiple CoV strains. Another approach is to focus on subunit vaccines that contain conserved regions of the S protein, such as the FP or HR1/HR2 regions. mRNA vaccine technology is also emerging as a promising platform for developing pan-coronavirus vaccines (Pardi et al., 2018).

Furthermore, the development of antiviral drugs that can effectively inhibit viral replication and prevent disease progression is crucial. Small molecule inhibitors that target conserved regions of the S protein, such as the FP or HR1/HR2 regions, could be promising candidates. Other potential drug targets include the viral protease, RdRp, and other essential viral proteins. Combination therapies that target multiple viral proteins may be more effective than single-agent therapies.

Finally, understanding the mechanisms of immune evasion employed by the S protein is essential for designing effective vaccines and therapeutics. Blocking the interaction between the S protein and complement components or enhancing the presentation of viral antigens to T cells could improve the immune response and prevent disease progression. The development of novel immunomodulatory therapies that can enhance the host’s immune response to CoV infections is also an area of active research. Future research should also focus on understanding the long-term effects of CoV infections and developing strategies to mitigate these effects. This includes studying the mechanisms of long COVID and developing therapies to alleviate the symptoms of this condition.

7. Conclusion

The Spike protein remains the primary target for coronavirus vaccine and drug development. Its inherent variability, however, necessitates a focus on highly conserved regions within the protein architecture to achieve broadly protective immunity. Future research must continue to probe the nuances of S protein structure, evolution, and immune evasion strategies. This, coupled with advances in vaccine technology and antiviral drug design, holds the key to developing effective countermeasures against existing and emerging coronaviruses, ensuring global preparedness for future pandemics.

References

Andersen, K. G., Rambaut, A., Lipkin, W. I., Holmes, E. C., & Garry, R. F. (2020). The proximal origin of SARS-CoV-2. Nature Medicine, 26(4), 450-452.

Arvin, A. M., Fink, K., Klein, N. P., Messer, W. B., Nealon, J., Nirula, A., … & Consortium, V. C. I. I. (2020). SARS-CoV-2 vaccines and antibody-dependent enhancement: a potential concern?. Nature Microbiology, 5(10), 1255-1260.

Bosch, B. J., Bartelink, A. K., & Rottier, P. J. (2003). Coronavirus pathogenesis: the spike protein. Journal of General Virology, 84(6), 1263-1279.

Casalino, L., Gaebler, C., Lorenzi, J. C. C., Wang, Q., Cho, A., Janssens, K., … & Nussenzweig, M. C. (2020). Functional and structural characterization of SARS-CoV-2-neutralizing antibodies that define sites of vulnerability. Cell, 183(4), 985-995.e19.

Corti, D., Foglieri, M., & Lanzavecchia, A. (2017). Broadly neutralizing antibodies against influenza viruses. Annual Review of Immunology, 35, 305-325.

Dejnirattisai, W., Zhou, D., Ginn, H. M., Huat, H. T., Nutalai, R., Supasa, P., … & Screaton, G. R. (2021). Antibody evasion by the P. 1 strain of SARS-CoV-2. Cell, 184(11), 2939-2954.e9.

Domingo, E. (2000). Virus evolution. Clinical Microbiology and Infection, 6(1), 7-13.

Harvey, W. T., Carabelli, A. M., Jackson, B., Gupta, R. K., Thomson, E. C., Harrison, E. M., … & Consortium, C. G. U. (2021). SARS-CoV-2 variants, spike protein mutations and immunity. Nature Reviews Microbiology, 19(7), 409-424.

Jiang, Y., Zhang, H., & Tian, G. (2020). COVID-19: From the perspectives of genetics, genomics, and precision medicine. Human Genomics, 14(1), 1-10.

Khan, M., Adil, S., Mushtaq, G., Jongkaewwattana, A., Madad, S., Shah, M., … & Ashraf, S. (2020). Systematic review of neurological manifestations of COVID-19. Journal of Neurology, 267(5), 1169-1184.

Li, F. (2016). Structure, function, and evolution of coronavirus spike proteins. Annual Review of Virology, 3(1), 237-261.

Li, W., Moore, M. J., Vasilieva, N., Sui, J., Wong, S. K., Berne, M. A., … & Farzan, M. (2003). Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus. Nature, 426(6965), 450-454.

Pardi, N., Hogan, M. J., Porter, F. W., & Weissman, D. (2018). mRNA vaccines—a new era in vaccinology. Nature Reviews Drug Discovery, 17(4), 261-279.

Raj, V. S., Mou, H., Smits, S. L., Dekkers, D. H., Müller, M. A., Dijkman, R., … & Osterhaus, A. D. (2013). Dipeptidyl peptidase 4 is a functional receptor for the emerging human coronavirus EMC/2012. Nature, 495(7440), 251-254.

Rambaut, A., Holmes, E. C., O’Toole, Á., Hill, V., McCrone, J. T., Ruis, C., … & Pybus, O. G. (2020). A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nature Microbiology, 5(11), 1403-1407.

Vlasenko, A. N., Deryabin, P. G., & Agafonov, A. P. (2021). Sialic acids as receptors for human coronaviruses. Reviews in Medical Virology, 31(1), e2153.

Walls, A. C., Park, Y. J., Tortorici, M. A., DeLuca, T. F., Walls, J., McGuire, A. T., … & Veesler, D. (2020). Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell, 181(2), 281-292.e6.

WHO. (2021). Tracking SARS-CoV-2 variants. World Health Organization. Retrieved from https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/

Woo, P. C., Lau, S. K., Yip, C. C., Huang, Y., Tsoi, H. W., Chan, K. H., & Yuen, K. Y. (2010). Coronavirus diversity, phylogeny, and interspecies jumping. Annual Review of Microbiology, 64, 81-92.

Wrapp, D., Wang, N., Corbett, K. S., Goldsmith, J. A., Hsieh, C. L., Abiona, O., … & McLellan, J. S. (2020). Cryo-EM structure of the SARS-CoV-2 spike glycoprotein in the prefusion conformation. Science, 367(6483), 1260-1263.

Xia, S., Yan, L., Xu, W., Agwunobi, U., & He, X. (2020). A review of SARS-CoV-2 spike protein-based vaccines. Frontiers in Immunology, 11, 3516.

Zhu, N., Zhang, D., Wang, W., Li, X., Yang, B., Song, J., … & Tan, W. (2020). A novel coronavirus from patients with pneumonia in China, 2019. New England Journal of Medicine, 382(8), 727-733.

Zhou, P., Yang, X. L., Wang, X. G., Hu, B., Zhang, L., Zhang, W., … & Shi, Z. L. (2020). A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature, 579(7798), 270-273.

1 Comment

  1. Given the S protein’s conformational flexibility and glycosylation patterns, how might computational modeling and structural biology contribute to identifying more accessible conserved regions for therapeutic targeting?

Leave a Reply

Your email address will not be published.


*