The structure and function of membrane protein in coronavirus infection and its applications in the development of vaccines and therapeutic drugs
Shengnan Jiang, Jianbo Yuan, Qingyang Li, Ziyan Song, Lijing Cao, Zhenhui Song, Xingcui Zhang

TL;DR
This paper reviews the structure and function of the M protein in coronaviruses and its role in vaccine and drug development.
Contribution
The paper systematically reviews the M protein's structure, function, and applications in combating coronavirus infections.
Findings
The M protein has three transmembrane domains and plays a role in viral assembly and budding.
It regulates host innate immune signaling pathways to evade the immune system.
The M protein has potential applications in vaccine and therapeutic drug development.
Abstract
Coronaviruses have long posed significant harm to human and animal health, causing a variety of diseases. The membrane (M) protein of coronaviruses is one of the four major structural proteins and a key component of the viral structure, playing an important role in viral assembly, budding, and immunomodulation. In this paper, we systematically reviewe the structural and functional characteristics of the M protein, including its three transmembrane domains, N-terminal glycosylation and C-terminal oligomerization domain. In terms of function, we focus on the mechanistic roles of the M protein in viral envelope formation and the nucleocapsid packaging, as well as the newly discovered immune evasion strategy of regulating host innate immune signaling pathways. In addition, we also summarize the applications of M protein in preventing and controlling coronavirus infection and mitigating its…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
FIGURE 1
FIGURE 2
FIGURE 3
FIGURE 4| Virus | Virus species | Number of amino acids | Glycosylation type | Glycosylation site | References |
|---|---|---|---|---|---|
| PEDV | Alphacoronavirus porci | 226 | N-linked | N3, N19 and N189 | ( |
| HCoV-NL63 | Alphacoronavirus amsterdamense | 226 | N-linked | N3, N19 and N188 | ( |
| HCoV-229E | Alphacoronavirus chicagoense | 225 | N-linked | N5, N190 and N207 | ( |
| HKU2-CoV | Alphacoronavirus rhinolophi | 229 | N-linked | N6, N22, N192 and N235 | ( |
| HKU4-2-CoV | Betacoronavirus tylonycteridis | 219 | N-linked | N4 and N20 | ( |
| SARS-CoV | Betacoronavirus pandemicum | 221 | N-linked | N4 | ( |
| SARS-CoV-2 | Betacoronavirus pandemicum | 222 | N-linked | N5 | ( |
| MERS-CoV | Betacoronavirus cameli | 219 | N-linked | N3 | ( |
| IBV | Gammacoronavirus galli | 225 | N-linked | N3 and N6 | ( |
| MHV | Betacoronavirus gravedinis | 228 | O-linked | T5 | ( |
| BCoV | Betacoronavirus gravedinis | 230 | O-linked | S2, S3, T5 and T6 | ( |
| HCoV-OC43 | Betacoronavirus gravedinis | 230 | O-linked | S2, S3, T5, T6, T14 and S28 | ( |
| Virus | Bonding mechanism | Residues | Characteristic |
|---|---|---|---|
| SARS-CoV-2 | Hydrophobic effect | W55, V66, V70, W75, I82, W92, F96, F100, F103, and F112 | Primarily mediated by hydrophobic and electrostatic interactions |
| Polarity effect | R107 | ||
| SARS-CoV | Hydrophobic effect | W19, W57, W91, and F95 | Cysteine-mediated covalent bonds are possible, but the hydrophobic effect remains the primary form of dimerization |
| Covalent bond (disulfide bond) | C63, C85, C158 |
| Virus | Regulation of IFN | Key structure | Mechanism | References |
|---|---|---|---|---|
| SARS-CoV | Upregulate | Val68 | Acts as an intracellular PAMP to activate IFN-β signaling pathway | ( |
| Inhibit | TM domains | Inhibits the formation of the TRAF3-TBK1 complex to prevent the activation of IRF3 | ( | |
| The entire M protein | Impedes the formation of the TRAF3-TANK-TBK1/IKKε complex | ( | ||
| SARS-CoV-2 | Inhibit | TM domains | Inhibits the aggregation formation of MAVS, thereby preventing the recruitment of downstream signaling molecules such as TRAF3, TBK1 and RIG-I, and further inhibiting the phosphorylation and nuclear translocation of IRF3 | ( |
| Trp31 and Leu34 | Induces mitochondrial autophagy to disrupt the MAVS-mediated type I IFN signaling pathway | ( | ||
| MERS-CoV | Inhibit | TM1 domain | Binds to TRAF3 through its N-terminal transmembrane domain, thereby disrupting the formation of the TRAF3-TBK1 complex, suppresses the phosphorylation and activation of IRF3 | ( |
| The entire M protein | Cooperates with ORF4a and ORF4b to inhibit the activation of interferon-stimulated response element (ISRE) | ( |
| Function | Functional particulars | Typical type of virus | References | |
|---|---|---|---|---|
| Constructing the basic structure of the virus | Determining the morphology of the viral envelope | All types of CoVs | ( | |
| Interacting with the proteins of M, E, N and S | ( | |||
| Guiding the assembly and release of virus particles | Co-expressing with other structural proteins to forms VLPs | All types of CoVs | ( | |
| Interacting with E protein | ( | |||
| Interacting with S protein | ( | |||
| Interacting with N protein | ( | |||
| Participating in the immune response | Altering the expression of IFN | Upregulate | SARS-CoV, MHV, IBV, TGEV | ( |
| Inhibit | SARS-CoV, SARS-CoV-2, MERS-CoV, PEDV | |||
| Interacting with HSP70, enhancing viral replication | PEDV, TGEV, MHV, SARS-CoV-2 | ( | ||
| Possessing B cell and T cell antigenic epitopes | SARS-CoV, SARS-CoV-2 | ( | ||
| Inducing apoptosis | Triggering mitochondrial apoptosis | SARS-CoV, SARS-CoV-2 | ( | |
| Interfering with the connection between PDK1 and PKB/Akt, stimulating Caspases-8 and Caspases-9 | ( | |||
| Leading to neurodegeneration and DNA damage | ( | |||
| Interacting with viral RNA | Interacting with viral RNA, packaging RNA into virus particles | All types of CoVs | ( | |
| Potential therapeutic drug | Molecular target | Mechanism of action | Possible effects | Virus | References |
|---|---|---|---|---|---|
| CIM-834 | Pro123 of M protein | Binds to and stabilizes the M protein in its short form, thereby preventing its conformational switch to the long form | Inhibit the viral assembly and reproduction. Exhibits antiviral activity and blocks viral spread | SARS-CoV, SARS-CoV-2 | ( |
| JNJ-9676 | Ser99 and Asn117 of M protein | Stabilizes the M protein dimer in the conformational state intermediate between the long and short forms | SARS-CoV-2 MERS-CoV, HCoV-OC43, MHV | ( | |
| Caffeic acid | Lys50 of M protein | Form hydrogen bond with the active site residue of the M protein | SARS-CoV-2 | ( | |
| Ferulic acid | Lys50 of M protein | ||||
| Colchicine | Met109 of M protein | ( | |||
| Remdesivir | Ala40 and Arg131 of M protein | ||||
| Bafilomycin A1 | Ala40, Asn41 and Asn43 of M protein | ||||
| Temozolomide | Asn41, Asn113 and Glu115 of M protein | ||||
| Brefeldin A (BFA) | ARF1 | Disrupt ARF1 and M localization at the ERGIC and abate virion assembly | SARS-CoV-2 | ( | |
| Golgicide A (GCA) | ARF1 | ||||
| PEP17 | Conformational epitopes of M protein | Blocks the binding of M protein to ARF1 through competitive binding |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSARS-CoV-2 and COVID-19 Research · Animal Virus Infections Studies · Immune responses and vaccinations
Introduction
1
Coronavirus (CoV) infection has become one of the most important infectious diseases affecting both humans and animals. Various CoVs have been isolated or detected in animals such as swine, bats, camels, deer, mink, canines and felines. With a broad host range encompassing avian and mammalian species, CoVs cause a variety of serious diseases that pose significant threats to the agricultural industry (Lin et al., 2021; Malik, 2020; Miranda et al., 2021). Coronaviruses belong to the order Nidovirales, family Coronaviridae, subfamily Orthocoronavirinae. Based on genomic sequence, serotype and other characteristics, members of this subfamily are divided into four genera, namely Alpha-coronavirus (α-CoV), Beta-coronavirus (β-CoV), Gamma-coronavirus (γ-CoV) and Delta-coronavirus (δ-CoV) (Fung and Liu, 2019; Malik, 2020). According to the current database of the International Committee on Taxonomy of Viruses (ICTV) taxonomy release, there are 26 species of α-CoVs, 15 species of β-CoVs., 5 species of γ-CoVs., and 7 species of δ-CoVs., Over the past 20 years, seven types of CoVs., that can infect humans and pose a threat to human health have been identified. Members of β-CoV genus are generally more harmful. Among them, Middle East Respiratory Syndrome Coronavirus (MERS-CoV), Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) and SARS-CoV-2 are highly pathogenic CoVs that can cause severe illness and even death; these viruses also possess a greater potential for widespread transmission.
The virus particle of CoV is spherical, with an envelope and spike proteins, and its nucleocapsid exhibits helical symmetry (Fung and Liu, 2019; Lin et al., 2020; Figure 1a). The virus genome is a single-stranded, linear, non-segmented positive-sense RNA, and the genome is formed from 5-capped and 3-polyadenylated, with a size of 25∼30 kb, making it the largest genome among known RNA viruses (Brian and Baric, 2005; Li et al., 2020; Schoeman and Fielding, 2019). The genome contains multiple open reading frames (ORFs) downstream of the 5’ untranslated region (UTR), encoding non-structural precursor polyproteins (Lai et al., 2006; Marra et al., 2003; Park et al., 2012). CoVs have four structural proteins: nucleocapsid protein (N), spike glycoprotein (S), membrane protein (M), and envelope protein (E), which together form the basic structure of the virus (Zandi and Soltani, 2022). Hemagglutinin esterase (HE), a dimer of a class I membrane protein with a molecular mass of 65 kDa, is also present in some closely related β-CoVs, such as Mouse Hepatitis Virus (MHV), Bovine Coronavirus (BCoV) and HCoV-OC43 (Wang Y. et al., 2020), which constitutes short spikes. The ORF3 accessory protein is also found in certain CoVs, including SARS-CoV-2, and has been confirmed to be an important virulence factor (Zhang J. et al., 2022). In all CoVs, the genes encoding the major structural proteins are arranged in the conserved 5’–3’ ordering as S, E, M, and N, within the 3′-proximal one-third of the genome (Brian and Baric, 2005). There are short untranslated regions at both the 5’ and 3’ ends, and the genes encoding putative accessory proteins are located between the M and N proteins (Woo et al., 2012).
The structure of CoVs and M protein. (a) The structure of CoV particle. The M protein is completely embedded in the viral lipid envelope. (b) Domain organization of CoV M protein. (c) The transmembrane topological structure of the M protein. The CoV M protein spans the viral membrane, with its N-terminal domain located on the exterior of the virion, its C-terminal domain on the interior of the viral membrane, and its transmembrane domain embedded in the lipid envelope. (d) M protein monomer. (e) The M protein dimers (M–M dimers).
The M protein is crucial in the life cycle of CoVs, participating in multiple processes such as viral assembly, envelope formation, and immune modulation. Moreover, it is an important target for therapeutic drug and vaccine development.
Structural features of M protein
2
The M protein is the most conserved and abundant structural glycoprotein in the CoV particle and serves as the backbone in viral assembly. Its full length ranges from 217 to 263 amino acids (aa), with a molecular mass of 25–30 kDa (Malik, 2020; Schoeman and Fielding, 2019). In the present study, the M protein of canine coronavirus (CCoV, an α-CoV) is a typical long-sequence representative, comprising 263 aa, whereas the M protein of porcine δ-CoV (PDCoV) is a compact representative, with a length of 217 aa. Although CoVs form their envelope and bud in the endoplasmic reticulum-Golgi intermediate compartment (ERGIC), their viral surface proteins can localize to the compartments downstream in the secretory pathway in viral isolation study (Schoeman and Fielding, 2019). The M protein is mainly located in Golgi apparatus, and it is transported in and out of the organelle via vesicles. It is also located in the endoplasmic reticulum (ER) and the ERGIC (Dong et al., 2016; Nal et al., 2005). This cycling is crucial for its role in recruiting other viral components and facilitating viral assembly.
The M protein is characterized by being embedded in the membrane via three transmembrane domains (Mahtarin et al., 2022; Narayanan et al., 2000; Figures 1b,c). It has been shown that the transmembrane domains of different M proteins contain distinct amino acid sequences (Mahtarin et al., 2022). M proteins from most CoVs exhibit a high degree of structural similarity (Figure 2a). The viruses included in the sequence alignment represent four genera of CoVs (Figure 2b), including the strains that are highly pathogenic to humans and animals. This enables comparative analysis of M protein sequence features across CoV species. Dynamic sequence comparison of the M proteins from nine CoVs reveals that the overall amino acid sequence homology reaches 38.77%. The sequence similarity of the M protein is an important basis for determining the evolutionary relationships of CoVs. The M protein of SARS-CoV-2 shares 90% homology with that of SARS-CoV, and this value is even higher in SARS-CoV strains isolated from animal hosts, such as bats and pangolins (Thomas, 2020). However, it is noteworthy that the similarity of M protein of MERS-CoV is only about 38%.
Comparison of the amino acid sequences of the M proteins from several CoVs. (a) The structures and sequences of the M protein of SARS-CoV (GenBank: APO40582.1), SARS-CoV-2 (GenBank: QII57163.1), MERS-CoV (GenBank: QGV13486.1), MHV (GenBank: BAJ04700.1), BCoV (GenBank: UZT75403.1), PEDV (GenBank: ANM27773.1), TGEV (GenBank: XEF57925.1), IBV (GenBank: ACS45218.1) and PDCoV (GenBank: QSL97110.1). These sequences were downloaded from the NCBI Protein Database. The analysis data regarding the transmembrane domains are from the UniProt database (https://www.uniprot.org/). (b) Multiple sequence alignment of the M proteins was generated using DNAMAN program. Residues with homology levels higher than 33% (yellow), 50% (blue), 75% (violet), and 100% (red) are highlighted.
N-terminal ectodomain
2.1
A short N-terminal glycosylated ectodomain—exposed either to the exterior of the virus or to the lumen of intracellular organelles—spans the first 1–19 amino acid residues at the N-terminus. In some α-CoVs, the M protein contains an additional hydrophobic segment that functions as a signal peptide. Research on Transmissible Gastroenteritis Virus (TGEV) has shown that this signal peptide fragment is located within the first 50 amino acids at the N-terminus of the M protein (Escors et al., 2001a). In PDCoV, hydrophobicity analysis reveals that the N-terminal region of the M protein is highly hydrophilic (Wu et al., 2023).
Glycosylation is the process by which sugar chains are covalently attached to specific amino acid residues of proteins via glycosyltransferases, and it is an important post-translational modification. The M protein is always glycosylated at its N-terminal ectodomain (Fung and Liu, 2018). According to the site of modification, it can be classified into N-glycosylation and O-glycosylation. Many experiments have shown that almost all CoV M proteins are exclusively N-linked glycoproteins, especially in α-CoVs and γ-CoVs. After exiting the ER, the M protein reaches the Golgi apparatus, where its N-glycans are further modified (Nal et al., 2005; Perrier et al., 2019). The N-glycosylation of the M protein is not mediated by the virus itself but relies on the host cell’s ER and Golgi apparatus glycosylation modification enzymatic systems operating through the secretory pathway. When M protein mRNA is translated on ribosomes attached to the ER, its N-terminal signal peptide directs the nascent polypeptide into the ER lumen. If the amino acid sequence contains the N-glycosylation consensus sequence (Asn-X-Ser/Thr, where X is any amino acid except proline), the oligosaccharide transferase complex catalyzes the transfer of the core oligosaccharide to the target asparagine residue, initiating glycosylation (Aebi, 2013; Shrimal et al., 2015). Subsequently, glucosidases I and II catalyze the initial processing and folding of the glycoprotein. The glycosylated M protein is then transported to the Golgi apparatus via vesicles, where a series of glycosyltransferases further modify the oligosaccharide chains, ultimately yielding mature, structurally diverse N-linked glycans.
Studies on the N-glycosylation of MERS-CoV and SARS-CoV-2 have revealed that their M proteins are modified with polylactosamine chains, and this modification relies on acidic residues located near the first transmembrane domain (Juckel et al., 2023). During the glycosylation of the M protein, certain acidic residues in its N-terminal region are essential for this modification, in addition to the intrinsic N-glycosylation site. For instance, researchers have confirmed that residues E11, E12, and E18 in the SARS-CoV-2 M protein, a residues E9 and D18 in the MERS-CoV M protein are crucial for the proper glycosylation of these proteins (Juckel et al., 2023). In contrast, the M proteins of certain lineage A β-CoVs such as MHV, BCoV, and HCoV-OC43, are modified by O-linked glycosylation (Liang et al., 2019; Yamada et al., 2000). Studies on MHV have shown that its M protein undergoes O-glycosylation in the Golgi apparatus, with the glycosylation sites located on Ser/Thr residues in the extracellular N-terminal domain (de Haan et al., 2000). Since its identification, O-linked glycosylation has been served as a valuable indicator for investigating the maturation, membrane insertion, and intracellular trafficking of the M protein in these three β-CoVs (Fung and Liu, 2018).
Glycosylation plays a crucial role in maintaining the function of the M protein during viral assembly, release, and interaction membrane interaction. Through deletion mutations of glycosylation sites, it has been found that the N-glycosylation of the SARS-CoV M protein is essential for viral assembly and infectivity. Deletion of this site impairs progeny viral particle assembly (Voss et al., 2006). Deletion mutations of the N3 and N6 glycosylation sites in IBV significantly affect viral virulence (Liang et al., 2019). In addition, the M protein influences viral particle assembly. Substituting O-glycosylation sites in MHV with N-glycosylation sites reduces the efficiency of viral particle recombination (de Haan et al., 2003). Currently, one to four potential N-glycosylation sites have been identified in the M proteins of various CoVs. Generally, most CoV M proteins contain a single N-glycosylation site, and some CoVs have been confirmed to possess multiple glycosylation sites on their M protein (Table 1).
Transmembrane domain
2.2
The transmembrane domain consists of three transmembrane (TM) helices that anchor the protein in the viral envelope and are labeled TM1, TM2, and TM3 from the N-terminal (Ujike and Taguchi, 2015). The three transmembrane domains of the CoV M proteins are critically important for virus to target the plasma membrane. Moreover, the 50 amino acid residues at the amino terminus containing the TM1 domain, are crucial for achieving effective M–M protein interaction (Tseng et al., 2010). The plasma membrane trafficking signal of the TM3 domain of the SARS-CoV M protein contains the highly conserved residues phenylalanine 95 (F95) and serine 110 (S110), which are essential for viral assembly.
C-terminal intracellular domain
2.3
There is a larger C-terminal intracellular structural domain in the M protein, featuring a prominent cytoplasmic tail approximately 6–8 nm in length, which has been identified as the major functional region for interactions between host cell components and other viral proteins (Hsieh et al., 2008). The C-terminal domain, which follows the TM3 region, is organized into a tightly membrane-associated amphipathic structural domain and a distal short hydrophobic structural domain (Ujike and Taguchi, 2015), resulting in a topology comprising both the N-terminal ectodomain and C-terminal cytoplasmic domain (Voss et al., 2009). When investigating the intracellular trafficking of the MERS-CoV M protein, researchers have identified a region in the cytoplasmic tail of its C-terminal domain as the determinant for localization to the trans-Golgi network (TGN). This localization process is mediated by a specific set of four residues (Lys199, Gly201, Tyr203, and Arg204) within the MERS-CoV M protein (Perrier et al., 2019). Furthermore, the presence of the C-terminal domain of MERS-CoV M protein is important for inducing its specific localization.
The M protein in the virion constitutes the majority of the viral envelope, and the overall scaffold of the envelope is formed by M–M interactions (Arndt et al., 2010; Neuman et al., 2011; Rodríguez-Enríquez et al., 2022; Figures 1d,e). Research indicates that the M protein, which exists as a dimer, adopts two distinct conformations. This structural flexibility enables it to facilitate membrane bending and bind to the nucleocapsid (Rodríguez-Enríquez et al., 2022). The two conformations of the M protein are termed the long and compact (or short) forms (Figure 3), which influence the curvature of the viral envelope. The long form (M-long) confers rigidity, uniformity, and a narrow curvature to the viral envelope, whereas the short form (M-short) plays a contrasting role by providing greater flexibility and reducing spike density (Neuman et al., 2011). Both the long and short forms of the M protein are critical for viral assembly. The M protein forms dimers through interactions with corresponding regions of its structural domains, and in this binding process, specific sites in the three-transmembrane domain play a key role. These dimerizations and subsequent polymerizations are conserved processes. The dimerization of the M protein is a cooperative process involving its transmembrane helices and a C-terminal β-sandwich domain (BD) oriented toward the interior. The dimerization gives rise to a dome-like structure that covers the inner surface of the transmembrane region via the internal sheets of the BD (Zhang Z. et al., 2022). The M protein is structurally similar to the ion channel protein ORF3a, but it does not function as an ion channel by itself. In the two conformations of the M protein dimer (long form and short form), the internal conformation of each monomer remains essentially unchanged, whereas the relative arrangement between the two BDs undergoes significant changes. During the conversion from the long form to the short form, the two monomers approach each other on the extracellular side and separate on the cytoplasmic side. Concurrently, the dimerization interface between the two BDs also undergoes rearrangement, resulting in an increase in the opening angle of the dome-like BD structure from approximately 100–140 (Zhang Z. et al., 2022). The interconversion between the two conformations is regulated by the hinge region, which may be closely linked to modulation of membrane environment, lipid binding, and membrane curvature during viral assembly.
The two conformations of M protein. The M-short dimer is approximately 7.2 nm high and 5.7 nm wide, with a larger upper cavity, a deeper hinge-region insertion, and a BD opening angle of about 140°. The M-long dimer is about 8.6 nm high and 5.0 nm wide, with a smaller upper cavity, a shallower hinge-region insertion, and a narrower BD opening angle of around 100.
Biological functions of M protein
3
The M protein forms the basic structure of the virus
3.1
The lattice-like, densely packed matrix structure of the viral envelope results in the lateral interactions of homotypic M–M dimers, which constitutes the fundamental structural components of the viral structure formed by the M protein (de Haan et al., 2000). In SARS-CoV-2, the dimer interface composes 38 residues: 17 are from one M protein molecule (W55, P59, L62, V66, A69, V70, W75, I82, A85, W92, L93, F96, F100, F103, R107, M109, and F112) and 21 are from the other M protein molecule (P59, L62, V66, A69, V70, Y71, I82, A85, W92, L93, F96, I97, F100, F103, A104, R107, S108, M109, S111, and F112) (Marques-Pereira et al., 2022). Experiments on SARS-CoV show that residues W19, W57, P58, W91, Y94, F95, and C158 are essential for M protein homodimer interaction (Tseng et al., 2013). In this context, some host membrane proteins are excluded from the viral envelope through the interaction network formed by M–M dimers. Certain residues of the M protein (K14, Y39, R42, N43, R44, F45, Y71, R72, W75, S94, R101, R107, W110, S173, and R174) participate in the formation of polar, contacting with membrane lipids (Marques-Pereira et al., 2022). SARS-CoV may rely more on cysteine-mediated disulfide bonds to stabilize the dimer, whereas SARS-CoV-2 depends more on electrostatic interactions (Table 2). Hydrophobic interactions underpin M protein membrane intercalation and dimerization across all CoVs. A more stable M protein dimer may enhance the virus’s environment tolerance, improve assembly efficiency, and promote immune escape, thereby increasing transmissibility and pathogenicity. However, the specific mechanism requires further validation through in vivo experiments.
The reticular matrix serves as a scaffold for the assembly of either the E or the N protein. When interacting with the N protein, the M protein packages ribonucleoproteins (RNPs, composed of the viral RNA genome and the N protein) into virions, and this function primarily relies on the intracellular C-terminal domain of the M protein. Moreover, M proteins interact pairwise to form dimers, constituting the viral envelope scaffold; such specific interactions can also incorporate the S or HE protein into lattice vacancies, positioning these proteins regularly within the lattice (Ujike and Taguchi, 2015). Research indicates that it is the collective interactions among M protein dimers that primarily induce membrane curvature, whereas E protein pentamers affect membrane deformation, prevent curvature, and maintain membrane flatness (Collins et al., 2021; Monje-Galvan and Voth, 2021).
The M protein promotes the assembly and release of virus particles
3.2
The CoV genomes replicate in the cytoplasm and assemble into viral particles by budding into the smooth membrane of the ERGIC, then traffic to the ER, and are released via exocytosis to infect surrounding cells (Liang et al., 2019; Steiner et al., 2024; Figure 4). The M protein of CoVs is a central player in virion assembly, particularly during replication and transcription, serving as the key organizer that transforms host cell membranes into sites for viral particle production (Bracquemond and Muriaux, 2021; Kuo et al., 2016a; Steiner et al., 2024). The M protein dynamically cycles between the Golgi complex and the ER, and returns to the Golgi from the plasma membrane via the endocytic recycling pathway. The synthesis of the M protein increases during CoV infection (Wang R. et al., 2020). Immunofluorescence and Western blot analyses reveal that M protein expression can be detected as early as post-infection and persists throughout the course of infection until cell death (Wu et al., 2023). CoV assembly is critically dependent on the proper and accurate intracellular transport of the M protein (Desmarets et al., 2023). Through M–M interactions, a scaffold for viral assembly is formed, and host cell membrane proteins are excluded, thereby conferring specificity to the viral envelope (Liang et al., 2019). Heterotypic interactions facilitate the recruitment of other structural proteins and the viral genomic RNA to the assembly site. The M protein not only self-associates but also assists in virion assembly and interacts with other structural proteins (N, E, and S), as well as with accessory proteins (including ORF3), host proteins, and non-structural proteins (Mahtarin et al., 2022; Rodríguez-Enríquez et al., 2022). By recruiting other structural proteins to the budding site, the M protein organizes a specific membrane structural domain for viral assembly (Neuman et al., 2011). Through these interactions, the M protein ensures the efficient assembly and release of new viral particles (Castaño et al., 2021).
M protein promotes the assembly and release of virus particles. (a) On the ERGIC membrane, the number of M-short proteins exceeds that of M-long proteins. Recruitment of the S, E proteins, the RNP, as well as conversion of M-short to M-long. (b) The ERGIC associates with the RNP and then traffics to the Golgi complex. (c) Viral maturation, budding, and assembly. (d) Release. The virus is released outside the cell via exocytosis.
M protein interacts with E protein to form virus-like particles and promote virion release
3.2.1
The co-localization and interaction of the M and E proteins represent one of the most mature and well-defined protein-protein interactions among the structural proteins of CoVs, which are crucial for the formation of CoV protein-containing vesicles (Lopez et al., 2008; Swann et al., 2020; Xu et al., 2020). The coexpression of M and E proteins can induce the formation of virus-like particles (VLPs) and their release. Mediated by the C-terminal domains of the two partner proteins, this interaction occurs at the cytoplasmic side of the ERGIC. Studies have shown that deletions of C-terminal in both the M and E proteins of SARS-CoV-2 impair their ability to interact (Mukherjee et al., 2020), whereas ubiquitination of the N-terminal domain of M protein stabilizes the M-E interaction, indicating a functional connection between the C-terminal and N-terminal domains of the M protein. Recently, the interaction between the M and E proteins of SARS-CoV-2 has been clarified: it reveals that the M-E protein interaction mediates vesicle release by enhancing M protein self-interaction (Satarker and Nampoothiri, 2020; Zhou et al., 2023). This observation may relate to the role of the M protein in viral assembly and/or the role of the E protein in membrane scission (Bracquemond and Muriaux, 2021). Current findings propose that the E protein does not directly induce membrane curvature but rather enhances M protein self-interaction (Yuan et al., 2021). This enhanced interaction indirectly facilitates virion budding by increasing membrane fluidity. Due to the presence of M protein, viruses and VLPs have a thicker envelope region than vesicles or exosomes. However, the formation of VLPs strictly requires the coexpression of the M and E proteins, as the expression of M protein alone is insufficient. Functional cooperation between the M and E proteins leads to the formation of pseudo particles, which exhibits interfering activity similar to that of intact virions (Song and Park, 2012). For ensuring formation of uniformed size of viral particles, the M-E complex is essential for viral maturation and ultimately enables viral release (Yuan et al., 2021).
The M protein interacts with S protein and stabilizes the S protein in assembling virus particle
3.2.2
The M protein dimer is responsible, in part, for recruitment of the S protein. Functional experiments in MHV and BCoV have demonstrated that, in the VLP assembly system, coexpression of the M and S proteins significantly enhances S protein incorporation efficiency and improves its structural orderliness (Ujike and Taguchi, 2015); whereas disruption of M protein dimerization leads to defective S protein recruitment. When coexpressed with the E protein, the M protein participates in regulating S protein localization, likely at the ERGIC or cis-Golgi assembly sites. This intracellular retention of S protein inhibits cell–cell fusion, thereby preventing the formation of syncytia (Boson et al., 2021). In fact, the M protein preferentially captures the S protein and incorporates it into virions or VLPs. The M protein mediates intracellular retention of the S protein by directly binding to its cytoplasmic tail. Moreover, the M protein interacts with the S protein and retains it in the Golgi/ERGIC compartment, where the majority of viral assembly takes place (Arndt et al., 2010; Fung and Liu, 2018). Heparan sulfate proteoglycans (HSPGs) are glycoproteins that constitute ubiquitous components of the mammalian cell surface. The M protein of HCoV-NL63, in synergy with the S protein, facilitates host cell invasion by promoting the binding to HSPGs containing heparan sulfate (Naskalska et al., 2019).
The M protein interacts with the N protein and facilitates viral infection
3.2.3
During virus assembly, the interaction between the M and N proteins plays an irreplaceable role. Various studies have shown that this interaction recruits the RNP complex synthesized in the cytoplasm to the viral assembly site (primarily the ERGIC membrane). In most CoVs, the primary region mediating the M–N protein interaction is localized in the cytoplasmic domain of the C-terminal region of both the M and N proteins. A study of SARS-CoV has shown that the leucine motif (218LL219) at the C-terminus of the internal domain of M protein is essential for efficient packaging of the N protein into VLPs (Tseng et al., 2013). In β-CoVs, research provides direct evidence that the M and N proteins interact in MHV-infected cells, as demonstrated by reciprocal co-immunoprecipitation using an anti-M monoclonal antibody. The main determinant of the MHV M–N protein interaction is located in domain 3 of C-terminus of the N protein (Hurst et al., 2009). In SARS-CoV-2, the mature nucleocapsid (a single-genome condensate) directly interacts with the M protein (Cubuk et al., 2021). Research on SARS-CoV-2 indicates that the interaction between the N and M proteins during the budding process is critically dependent on the C-terminal domain of the N protein (Han et al., 2024). This interaction represents a critical step in virion assembly, bridging the packaged genome to the viral envelope (Bai et al., 2021). In α-CoVs, research on TGEV shows that the C-terminal region of the M protein interacts with the N protein (Escors et al., 2001b). The TGEV M–N protein interaction is temperature-sensitive, and the binding is more stable at lower temperatures. Moreover, assembly of the M and N proteins also requires the assistance of the E protein to enhance interaction stability. Disruption of the M–N interaction severely impairs or completely blocks VLP formation and infectious virion production, findings consistently confirmed across all CoV studies.
The above facts demonstrate that coexpression of the M protein with other structural proteins leads to robust VLP production. Further evidence shows that the SARS-CoV-2 M protein promotes viral replication by disrupting STX18 to induce ATG14-mediated lipid phagocytosis and by combining with its ability to degrade RSAD2 (Yuan and Ding, 2024; Yuan et al., 2024).
The M protein participates in the immune response
3.3
Altering the expression of IFN and regulate the innate immune response
3.3.1
The M protein can alter IFN expression and regulate the innate immune response (Table 3). Glycosylation of N-terminal extracellular domain of the M protein is not indispensable for CoV replication, but it does play a role in virus–host interaction as well as in viral pathogenesis (Liang et al., 2019). Studies have found that, in infectious bronchitis virus (IBV), N-linked glycosylation of the M protein may promote apoptosis and proinflammatory responses during CoV infection, potentially by modulating ER stress and subsequently triggering proinflammatory cytokines production. A study using recombinant MHV to change the glycosylation of its M protein from O-linked to N-linked found that the type I interferon levels were higher than those of wild-type MHV after the modification (Fung and Liu, 2018). This result demonstrates that the glycosylation status of the M protein can modulate the host innate immune response. Additionally, in MHV, researchers have demonstrated that the glycosylation state of the M protein critically influences the type I interferon (IFN) induction in vitro (de Haan et al., 2003), whereas N-linked glycosylation of the M protein’s oligosaccharide side chains has a stronger interferon-inducing capacity than O-linked glycosylation.
Certain CoVs can upregulate IFN expression during host infection. Studies on SARS-CoV have shown that the SARS-CoV M protein can upregulate the transcription of IFN-β (Wang and Liu, 2016). The M protein directly acts as a pathogen-associated molecular pattern (PAMP) in the cytoplasm to activate the Toll-like receptor signaling pathway and the TBK1–IRF3 signaling cascade, thereby stimulating IFN-β expression. In addition, high levels of IFN-α can be detected in the host during early TGEV infection, and the M protein has been confirmed as a potential IFN inducer that primarily promotes the activation of IRF3 (Zhou et al., 2017).
In some CoV infections, the M protein has been shown to play a role in inhibiting IFN expression. The SARS-CoV-2 M protein is a negative regulator of the innate immune response (Fu et al., 2021). SARS-CoV-2 suppresses the type I IFN response by promoting autophagy. This is achieved by inducing mitophagy through the M protein, which specifically inhibits RIG-I-MAVS-triggered IFN-β signaling (Hui et al., 2021). Additionally, the SARS-CoV-2 M protein inhibits the innate antiviral immune response triggered by RNA viruses by impairing recruitment of the mitochondrial antiviral signaling (MAVS) complex by TANK-binding kinase 1 (TBK1), TNF receptor-associated factor 3 (TRAF3), and interferon regulatory factor 3 (IRF3). The M protein targets MAVS to inhibit the innate antiviral response, and its TM1 and TM2 domains are critical for this inhibitory function. Moreover, the SARS-CoV-2 M protein can antagonize both type I and III IFNs, as it can affect the formation and assembly of a multiprotein complex (the RIG-I/MDA-5-MAVS-TRAF3-TBK1 signalosome) (Zhang et al., 2021; Zheng et al., 2020). MERS-CoV utilizes its M protein to suppress the expression of type I IFN by inhibiting TBK1-dependent phosphorylation and activation of IRF3, thus evading the host innate antiviral response (Kindler et al., 2016; Lui et al., 2016). Additionally, SARS-CoV also suppresses type I IFN production by disrupting the TRAF3-TBK1 association via interaction with TRAF3 (Siu et al., 2009). A study has found that the PEDV M protein inhibits TBK1/IKKε-induced phosphorylation and dimerization of IRF7, thereby suppressing IFN-I production and enhancing viral replication (Li et al., 2021).
Different human coronaviruses (HCoVs) cause infections in hosts with markedly varying severity. Pathogenic HCoVs. (SARS-CoV, SARS-CoV-2, and MERS-CoV) can evade innate immune responses by modulating the type I IFN pathway, frequently resulting in severe lower respiratory tract diseases such as acute respiratory distress syndrome (ARDS) (Table 2). During these highly pathogenic HCoV infections, upregulation of inflammatory cytokine genes often leads to dysregulation of the inflammatory response. In contrast, non-pathogenic HCoVs (HCoV-229E, HCoV-OC43) exhibit a comparatively weaker IFN-inhibitory capacity and typically cause only mild upper respiratory tract infections, commonly known as the common cold. Highly pathogenic HCoVs can upregulate IFN-stimulated genes, but in these viruses, the M protein has IFN-antagonistic activity and inhibits IFN expression. A reasonable explanation for this apparent contradiction is that the M protein itself can stimulate the production of IFN. However, when the intracellular IFN I pathway is activated by other pathways, the M protein exerts a negative regulatory effect on activated IFN I pathway. Furthermore, the differences in interferon regulation may also be related to other viral proteins, such as accessory ORFs, NSP1, and NSP2 (Liu et al., 2023b). Interferon regulation by CoV M proteins varies across strains, indicating that IFN regulation by M proteins is not evolutionarily conserved among HCoVs (Zheng et al., 2020).
M protein interacts with heat shock protein and enhances viral infection
3.3.2
Studies on porcine epidemic diarrhea virus (PEDV) have found that the M protein plays a key role in the collaborative replication of PEDV (Park et al., 2021). The infection is accompanied by strong overexpression of heat shock protein 70 (HSP70), which increases the replication speed of PEDV and leads to production of more virions. Through its direct interaction with HSP70, the M protein enhances the expression of this host factor, thereby facilitating PEDV replication. Another study also indicates that the PEDV M protein, which is distributed throughout the cell, affects the growth of intestinal epithelial cells (IECs) (Xu et al., 2015). It is also worth mentioning that previous studies have confirmed that the PEDV M protein, in addition to playing an important role in the viral assembly process, can induce complement-dependent antibodies that neutralize the virus (Song and Park, 2012). Studies on transmissible gastroenteritis virus (TGEV) have elucidated a previously unrecognized function of the M protein, demonstrating its participation in the initial stages of viral replication (Ji et al., 2023). The interaction between heat shock cognate protein 70 (HSC70) and the M protein intracellular domain, mediated by the substrate-binding domain (SBD) of HSC70, plays a role in viral entry. The M protein orchestrates internalization of TGEV by interacting with the host factor HSC70, thereby directing the virion into the cells via the clathrin-mediated endocytosis (CME) pathway. In human and mouse CoV-induced fulminant viral hepatitis, the expression of HSP70 is distinctly upregulated. These findings suggest that HSP70 enhances the replication of CoV in hepatocytes and further facilitates the release of pro-inflammatory cytokines such as IL-1β, TNF-α, and IL-6 during infection (Wang Q. et al., 2024). Differential gene expression analysis has revealed that the SARS-CoV-2 M protein significantly upregulates HSPs and co-chaperone proteins, and this regulatory effect is stronger than that of the N protein (Albalawi et al., 2025). Notably, HSPA6, HSPA1B, HSPBP1, and HSPH1 were identified as HSPs with increased expression (Albalawi et al., 2025; Sun et al., 2021). Severe SARS-CoV-2 infection can trigger an excessive immune response, resulting in a “cytokine storm,” in which macrophages play a central role, and HSP70 further promotes the infiltration of mononuclear phagocytes and neutrophils into the liver after CoV infection (Wang Q. et al., 2024).
M protein mediates the humoral and cellular immunity
3.3.3
It has been proven that the SARS-CoV M protein can induce strong neutralizing antibodies (Pang et al., 2004). Two immunodominant epitopes of the SARS-CoV M protein have been identified in the extreme N-terminal and C-terminal regions (He et al., 2005). The T cell epitope cluster, which is contained in the transmembrane domain of the M protein, plays a major role in driving M protein specific cellular immunity (Liu et al., 2010), inducing CD8 + T-cell responses and eliciting dominant cellular immunity during SARS-CoV infection (Ng et al., 2016; Tang et al., 2011). An immunodominant B cell epitope is located at the S4 position of the active center of the SARS-CoV-2 M protein, which potentially triggers neutralizing antibodies that inhibit the protease function of the M protein (Lu et al., 2021). In a study of PEDV, seven linear B-cell epitopes (designated M1 through M7) have been identified on the M protein, with lengths ranging from 12 to 22 aa (Polyiam et al., 2022). All seven peptide segments react positively with PEDV-seropositive porcine serum. Notably, epitopes M1, M2, and M6 are able to inhibit the neutralizing activity of all tested sera and are regarded as the most promising neutralizing targets in the M protein.
M protein affects the function of mitochondria and induces apoptosis
3.4
The M protein triggers caspase-dependent apoptosis via the mitochondrion-mediated pathway. Specifically, in SARS-CoV, the M protein promotes apoptosis by modulating the release of mitochondrial cytochrome c and affecting the cellular Akt pro-survival signaling pathway (Chan et al., 2007). Akt, also known as protein kinase B (PKB), is a central player in the cellular kinase cascade, regulating diverse cellular functions and influencing the viral life cycle. The PKB/Akt pathway is involved in multiple viral infections, including but not limited to SARS-CoV (Mizutani et al., 2006a; Mizutani et al., 2006b). Caspase-9 is a target of PKB/Akt-mediated phosphorylation, and the activity of caspase-8 is also indirectly regulated by PKB/Akt. Overexpression of the M protein suppresses PKB/Akt phosphorylation, leading to downregulation of pro-survival signaling and ultimately triggering apoptotic cell death. The PKB/Akt signaling pathway plays a pivotal role, as it is critical for maximal viral production and apoptosis regulation. Through the PKB/Akt signaling cascade, the expression of M protein activates caspase-8 and caspase-9 (Tsoi et al., 2014).
Depending on caspase-9, transient overexpression of the SARS-CoV-2 M protein initiates mitochondrial apoptosis by engaging B-cell lymphoma 2 (BCL-2) ovarian killer (BOK). According to the studies on SARS-CoV-2, the M protein has been identified as an inducer of mitochondrial apoptosis in pulmonary epithelial cells (Yang et al., 2022). The M protein triggers caspase-associated cell death, leading to the increased pulmonary barrier permeability (Wang F. et al., 2024; Yang et al., 2022). Studies investigating the interaction between SARS-CoV-2 M protein and human proliferating cell nuclear antigen (PCNA) have shown that the M protein induces cytoplasmic translocation of PCNA from the nucleus. This finding suggests that the M protein may promote DNA damage by involving PCNA (Zambalde et al., 2022).
A study has demonstrated that the M protein of SARS-CoV-2 induces neurodegeneration by disrupting the interaction between the Golgi apparatus and mitochondria (Wang F. et al., 2024). This study also confirms that the M protein interacts with two regulatory factors of the Arf signaling pathway, ArfGEF1 and ArfGAP1, in the Golgi apparatus. Since the PI4KIIIβ/Arf1 pathway is involved in the interaction between the Golgi complex and mitochondria, expression of the M protein may disrupt Golgi structure, thereby leading to structural abnormalities and functional impairment of mitochondria. The M protein promotes retrograde transport from the Golgi apparatus to the ER, inducing mitochondrial fragmentation and functional impairment, which results in decreased ATP production, excess reactive oxygen species (ROS), and ultimately cell death. Additionally, the autophagy cargo receptor (ACR) SQSTM1 plays a dual role in host-pathogen interactions and is often context-dependent. By phosphorylating SQSTM1 at a special site, the M protein promotes antiviral activity (Li et al., 2024). Another study shows that the M protein can also mediate and promote mitophagy. M protein contains a β_3–5_ domain, which plays a significant role in promoting mitochondrial autophagy through PDPK1-mediated phosphorylation of SQSTM1 site. The β_3–5_ domain can counteract the mitophagy promoted by the M protein. The M interferes with the function and distribution of the Golgi apparatus, leading to mitochondrial abnormalities and dysfunction, thereby contributing to neurodegeneration.
M protein interacts with viral RNA and packages the RNA into the viral particle
3.5
Genome packaging is a critical process arising from the parasitic status of CoV (Li et al., 2024). During the replication process of the CoV, large amounts of RNA are produced, including positive-strand genomic RNA (gRNA), negative-strand genomic RNA and positive-strand subgenomic RNA (Narayanan et al., 2000). Both the M and N proteins are known viral molecular partners that mediate packaging signal (PS) recognition. To ensure selective packaging of gRNA into viral particles, the CoV distinguishes its full-length gRNA from other cellular RNAs through genome-specific PS and specifically recruits M and N proteins (Bai et al., 2021). There are three types of models describing how the PS, N, and M proteins interact with each other to help CoVs achieve packaging selectivity. The N protein of the CoV contains two highly conserved, independently folded domains (the N-terminal domain and the C-terminal domain), and a third, predominantly acidic C-terminal region termed the N3 domain (Zhao et al., 2024). The genetic information model integrates the known functions of the N protein domain. In this model, the CTD serves as an RNA-binding module, whereas the N3 domain is hypothesized to be the exclusive site for interaction with the M protein (Chang et al., 2014; Kuo and Masters, 2013; Kuo et al., 2014; Kuo et al., 2016b). The second model explains how the M protein primarily recognizes PS. Evidence indicates that the M protein is the primary and perhaps the exclusive PS recognition element (Masters, 2019). The internal domain of the M protein is oligomerized at the assembly site of the cell membrane, enabling it to mediate specific PS recognition. This implies that each assembled virion contains an individual and specific contact site between the M protein and the gRNA PS (Neuman et al., 2011). Therefore, the M protein drives their interaction and condensation by initiating the nucleation of M-N and N-N protein complexes. The presence of C-tail is vital for the effective encapsidation of gRNA by the N protein, and this process is regulated by M protein (Han et al., 2024).
Studies on MHV have demonstrated that the M protein modulates PS recognition. Evidence indicates that the specific interaction between the M protein and the intracellular defective interfering ribonucleoprotein (DI RNP) complex involves the PS (Narayanan and Makino, 2001). The M protein mediates efficient packaging of DI RNP complexes into MHV particles by specifically recognizing and binding to those complexes that carry packaging signals. Researchers have concluded that the PS is responsible for the specific interaction between the M protein and particular intracellular RNP complexes, thereby drives the efficient and selective packaging of MHV RNAs containing these signals into viral particles.
The M protein plays a significant role in the viral life cycle and also modulates host cells activity. Its functions are closely related to its structural features. The major functions of M protein are elaborated in detail in Table 4.
Application
4
In terms of the application of M protein, although it is not yet widely used in clinical practice as a therapeutic agent, findings from various studies suggest that the M protein holds promising potential for broad clinical applications.
The important application value in drug research of M protein of CoVs
4.1
Due to its crucial role in the viral life cycle, the M protein of CoVs has significant potential for drug development. The M protein plays a key role in the assembly and budding of the virus, and its structure and function are essential for viral maturation and release. Therefore, the M protein can serve as a potential target for the research and development of antiviral drugs, facilitating the creation of specific therapeutic drugs against CoVs and providing robust strategies for epidemic prevention and control. For example, as mentioned above, the PDPK1-targeted peptide effectively eliminates CoV infection by redirecting SQSTM1 from the viral M protein to mitochondria, thereby restoring viral phagocytosis and simultaneously inhibiting mitochondrial autophagy (Li et al., 2024). HSC70 has been identified as a host factor required for the internalization of TGEV, dependent on its interaction with the viral M protein. Consequently, disrupting the M-HSC70 interaction presents a viable strategy for designing antiviral therapies against TGEV (Ji et al., 2023). Through the study of the M protein, researchers have identified a novel drug target within the CoVs replication cycle, along with a potent small-molecule inhibitor. It is essential to consider the membrane environment in drug design when targeting the M protein as a viral drug target. Research has shown that the 199_KxGxYR_204 motif in the C-terminal tail of the MERS-CoV M protein may represent such a CoV-specific target (Desmarets et al., 2023). Recently, a study has found that a structural expansion below the transmembrane domain and above the β-sheet intercalation domain within the M protein dimer is observed in all M proteins. This site is located in the β-sheet sandwich domain near the C-terminus and may serve as a potential drug-binding site (Yegnaswamy et al., 2025). Apart from the M protein, certain host cell components can also serve as targets for antiviral drug development. Research has found that PEDV, SARS-CoV-2, IBV, and PDCoV can all interact with Aurora A (AurA) and histone deacetylase 6 (HDAC6) in host cells via their M proteins, thereby inducing ciliary disassembly during the early infection (Zhuang et al., 2025). This finding suggests that AurA and HDAC6 may serve as broad-spectrum therapeutic targets for multiple CoV infections.
At present, numerous studies have identified a variety of compounds with potential activity against HCoVs (Table 5). CIM-834, an M-targeting molecule, exerts its late-stage mechanism of action by impeding the conformations interconversion from the M-short form to the M-long form, thereby inhibiting viral particle assembly (Laporte et al., 2025). Meanwhile, JNJ-9676, a small-molecule inhibitor targeting the M protein, exerts its antiviral effect by inducing the formation of a binding pocket and stabilizing a novel conformational state of the M protein (Van Damme et al., 2025). It exhibits potent antiviral activity in vitro against zoonotic strains of SARS-CoV and SARS-CoV-2 originating from bats and pangolins. Caffeic acid and ferulic acid are considered to have the potential to inhibit the SARS-CoV-2 M protein, which provides new insights for the development of anti-COVID-19 drugs (Bhowmik et al., 2020). Both compounds exert their effects by binding to Lys50 of the SARS-CoV-2 M protein. Similarly, colchicine, remdesivir, bafilomycin A1, and temozolomide can also bind to the M protein, thereby exerting an inhibitory effect on it (Peele et al., 2021). Additionally, ARF1 small-molecule inhibitors brefeldin A (BFA) and golgicide A (GCA), as well as the synthesized mimic peptide PEP17, can disrupt the localization of ARF1 and M protein on the ERGIC, thereby weakening virion assembly and inhibiting SARS-CoV-2 replication (Zhang et al., 2025). However, the effectiveness of these drugs still needs to be further verified through research.
M protein can serve as an immunological target for developing assays to detect the CoVs
4.2
As one of the viral structural proteins, the biochemical and immunological properties of the M protein indicate that it is highly suitable as a specific antigen for detecting CoV infection (Wu et al., 2023). The M protein can be used to develop immunoassay methods, such as enzyme-linked immunosorbent assay (ELISA) and immunochromatographic assays. In CCoV detection, ELISA based on recombinant M protein is regarded as an effective method for detecting CCoV-specific antibodies in canine serum (Elia et al., 2003). Similarly, an ELISA based on the M protein is highly significant for detecting PEDV and distinguishing its infection from other related porcine viral infections (Ren et al., 2011). In the research of TGEV, the M protein has been expressed in Escherichia coli as a GST fusion protein. Using the phage clone phTGEV-M7 as the antigen, a phage-based immunosorbent assay (PHAGE-ELISA) was established, which not only distinguishes TGEV from other CoVs., but also exhibits higher sensitivity than the conventional antibody-based ELISA (Zou et al., 2013). The M gene sequences of different PDCoV strains share 99% nucleotide identity, and are therefore commonly used as target genes for establishing nucleic acid-based diagnostic assays (Zhang et al., 2016). Monoclonal antibodies (mAbs) are laboratory-produced molecules that target specific viral antigens on the surface of pathogen. They are valuable and indispensable tools for investigating the antigenic properties of the M protein (Dong et al., 2016). Moreover, the establishment of a monoclonal antibody-producing hybridoma cell line specific for the CoV. M protein has laid a solid foundation for CoV identification and the development of related diagnostic methods. When developing molecular diagnostic methods (such as RT-PCR) for CoVs, researchers typically design specific primers and probes targeting the hydrophilic N-terminal coding region of the M gene (Xin et al., 2024). Compared with the high variability of the S protein, using the highly conserved M protein as a diagnostic antigen can theoretically improve the stability and accuracy of detection. However, this approach remains in the early stage of research and is far from clinical application.
M protein is a promising candidate for vaccine design
4.3
The M protein is not only the most abundant viral structural protein but also exhibits high conservation of its amino acid sequence. Specific antibodies recognize its extracellular domain, and this high degree of conservative makes it a promising candidate for a broad-spectrum vaccine. Vaccines based on the M protein therefore hold significant advantages. First, its high degree of conservation may enable cross-protective immunity against diverse CoVs, potentially even across species barriers. Second, in CoV, membrane fusion is mediated by the S protein, whereas the M protein does not participate in this process (Wang Q. et al., 2020). Vaccines developed based on the S protein carry certain risks (Amanat and Krammer, 2020), whereas the distinction between the S and M proteins suggests that vaccines targeting the M protein may be associated with a lower risk of vaccine-associated antibody-dependent enhancement (ADE). Additionally, the M protein can simultaneously stimulate antibody production and a robust T-cell immune response in the host, which may contribute to a more balanced and potentially long-lasting immunity. The antibody response to the extracellular domain of M protein likely confers broad protective benefits against diverse SARS-CoV-2 variants in the population, exerting this effect through cross-protection both in vitro and in vivo (Tang et al., 2024). This finding highlights the value of the M protein as a promising and important antigenic target for designing effective CoV vaccines. The majority of recent vaccines are based on the S protein as the immunogen to stimulate protective immunity against SARS-CoV-2 in humans (Corbett et al., 2020; Liu et al., 2021). In individuals who have recovered from COVID-19, the proportion of CD8 + T cells specific for the M protein exceeds that specific for the S protein (Peng et al., 2020). This finding highlights the need to include M protein epitopes in future vaccine designs (Mattoo and Myoung, 2022). Recently, researchers have identified a cluster of polyfunctional, CD4-restricted T-cell epitopes in a highly conserved region of the SARS-CoV-2 M protein. The epitopes that elicit polyfunctional T-cell responses are promising targets for developing effective vaccines and T-cell-based therapies, as they are designed to induce broad, protective immunity (Grifoni et al., 2020; Keller et al., 2020; Khadri et al., 2024). Currently, in clinical practice, there is no vaccine that uses the M protein as the primary immunogen. However, in models such as that of PDCoV, multi-antigen vaccines based on VLPs (containing S, M, and E proteins) have demonstrated superior efficacy (Liu et al., 2023a). The VLPs induce high levels of serum-specific IgG and virus-neutralizing (VN) antibodies in mice. Furthermore, incorporating both the N and M proteins into the vaccine can elicit a stronger cellular immune response, promoting CD8 + T-cell expansion and IFN-γ production (Yu et al., 2024). Compared with vaccines that express only the S protein, the multi-antigen vaccine can elicit a stronger and broader immune response in the host.
M protein is the key proteins for studying the mechanisms of CoVs infection
4.4
The M protein is indispensable for the process of virus assembly and release. Studying the structure, function, and interactions of the M protein with other viral or host cell proteins provides insights into the fundamental biological processes of CoVs, including infection pathogenesis, the viral replication cycle, and the molecular mechanisms underlying essential steps in virus assembly. Furthermore, the high-level expression of the M protein in infected cells makes it an ideal model system for studying glycosylation and ER-to-Golgi vesicular trafficking (Fung and Liu, 2018).
Discussion
5
The M protein is the most abundant structural protein in viral particles and plays a crucial role in viral assembly, protein-protein interactions, morphogenesis, host interactions, and immune regulation. In recent years, with the global spread of SARS-CoV-2, a series of important advances have been made in research on the M protein. The CoV M protein has a three-transmembrane-domain structure, and its dimerization is essential for viral envelope formation. By interacting with other viral structural proteins and ensuring their capture and coordinated incorporation at the budding site, the M protein orchestrates virion assembly (Tusnády et al., 2015). The M protein acts in synergy with the E protein to facilitate the formation of the viral envelope. It interacts with the N protein to promote the packaging of the RNP into viral particles. The M protein can inhibit the type-I interferon signaling pathway, evade immune recognition via glycosylation, and interact with the host vesicular transport system to promote viral release. The M protein also regulates apoptosis.
In structure studies of the M protein, although numerous studies have been conducted, many questions remain need to be addressed. The structural study has confirmed that the TM domain of the M protein is highly hydrophobic and lacks a detectable ion conduction pathway. However, its potential role in ion conduction cannot be excluded. It is unclear whether its oligomers can function as ion channels, and further research is needed to confirm this. Moreover, the M protein is highly conserved among CoVs, but its glycosylation patterns and specific interactions with other viral proteins exhibit variability. Do these discrepancies influence its function? How do distinct glycan moieties attached to the M protein affect its functional properties? Viral replication is highest in strains expressing an N-glycosylated M protein; however, the precise role of M protein N-glycosylation in CoVs remains incompletely understood (Juckel et al., 2023). A precise assessment of how M protein glycosylation influences its function still requires the development of more specific and sensitive analytical methods. With the development of mass spectrometry technology, new protein enrichment and analysis techniques have greatly enhanced the sensitivity and accuracy of glycosylation modification site identification, which may help advance research on the aforementioned issues. Specifically, current research has proposed a new model suggesting that the M protein of MERS-CoV contains four α-helical, hydrophobic transmembrane domains. And both its N-terminal and C-terminal domains are oriented toward the extracellular side of the viral envelope (Alharbi and Alrefaei, 2021). At present, research confirming this inference is limited. If this hypothesis holds true, the additional (fourth) transmembrane domain may stabilize the binding of M protein to the viral envelope or the host cell membrane, thereby enhancing the overall structural stability of the viral envelope. Moreover, the additional transmembrane region and its associated ring-like structure may expose new binding surfaces that mediate specific interactions with other M proteins, auxiliary viral factors, or host membrane proteins, which may affect viral assembly or immune modulation.
The M protein holds significant importance and considerable potential value in immunological detection, the development of novel anti-CoV drugs, and vaccine research. In developing immunological targets for coronavirus detection, the M protein is typically used as an antigen in ELISA. In the Escherichia coli expression system, the M protein can be expressed at high levels, facilitating the preparation and standardization of ELISAs based on recombinant M protein (Elia et al., 2003). Furthermore, the availability of substantial quantities of recombinant M protein provides an opportunity to gain deeper insights into the biological functions of the M protein and its immunological role during CoV infection. In terms of research on drugs against CoVs, small molecules or polypeptide inhibitors can interfere with M-M protein interactions, thereby disrupting the formation of the viral envelope scaffold. By interfering with the interaction between the M and N proteins, this approach is expected to prevent recruitment of the viral genome to the assembly sites. When developing antiviral drugs, locking the M protein into a specific conformation to block virion assembly represents a key strategy (Laporte et al., 2025; Van Damme et al., 2025). The M protein is highly conserved, and drugs developed against it may have broad-spectrum activity against CoVs. In addition, the M protein holds significant potential as a supplementary antigen in vaccine development. Although the M protein exhibits a limited ability to induce neutralizing antibodies, it can elicit high-titer binding antibodies. Meanwhile, the M protein contains abundant and highly conserved T-cell epitopes that can strongly induce CD4 + and CD8 + T-cell responses. These responses are crucial for eliminating infected cells and establishing long-term immune memory. The emergence of the novel human SARS-CoV-2 virus has underscored the critical and urgent need for continuous monitoring of viral infections, the development of practical antivirals against CoVs, and the development of innovative and more effective vaccines (Chalupka et al., 2024; Polatoğlu et al., 2023; Zhu et al., 2022). At the same time, the structural similarities among the M proteins of different CoVs suggest that developing a common inhibitor targeting these CoVs is feasible. Research suggests that the mutation in the M protein may be induced by co-occurring mutations in the spike protein receptor-binding domain. This finding is of great significance for vaccine development and therapeutic strategies (Ye et al., 2024). Expression both the S and M proteins in a vaccine can trigger a stronger immune response than expressing only the S protein (Bellier et al., 2022). This stronger immune response is specifically manifested by vaccines containing both proteins, which induce high levels of serum IgA and mucosal IgA. A study has found that in the vaccine breakthrough infection (VBI) cohort, immunity against the wild-type SARS-CoV-2 and its S proteins declined after receiving the primary booster vaccine against COVID-19 (Paniskaki et al., 2022). This result indicates that the neutralizing efficacy of vaccines based solely on the S protein as an antigen is limited.
Currently, there are still many challenges in the research on M protein applications. For instance, the immunogenicity of the M protein is relatively weak, potent adjuvants or advanced delivery systems are needed; structural analysis—particularly of its transmembrane domain—remains technically challenging, hindering the rational design of targeted therapeutics; and clinical data on M protein-based vaccines is still insufficient, warranting further validation through well-designed trials. At present, research on the S protein of CoVs is more in-depth and extensive. As CoVs continue to mutate and new epidemics emerge, developing an M protein–based “second pillar” strategy represents a wise and prudent approach. This complementary strategy augments the current S protein-centric paradigm and is essential for constructing a robust, multi-layered defense system against CoVs.
This article reviews the structure, biological functions, and potential applications of the CoV M protein. Much remains unknown about the M protein and requires further study, such as the immunomodulatory function of distinct M protein mutants, the effect of the M protein on host cell metabolism, and its inhibitory effects on CoV infection. In addition to the major structural proteins, other accessory proteins are also present in the CoVs. Although these proteins are not always essential for viral replication, they may play certain roles in processes such as viral packaging and immune evasion. However, interactions between the M protein and these accessory proteins, as well as their precise functions, remain unclear and require rigorous experimental validation. Other critical questions related to the M protein are also worthy of exploration: Do M protein mutants elicit cross-reactive T-cell responses against heterologous CoVs? What is the therapeutic efficacy of M protein-targeted inhibitors in vivo? And how does M protein expression modulate antigen presentation or dendritic cell activation? Addressing these questions will highlight promising directions for future research. Furthermore, the M protein has demonstrated considerable value in laboratory studies and warrants further investigation into its potential clinical applications. In summary, this review synthesizes current knowledge on the CoV M protein, providing a foundational reference to enhance mechanistic understanding of its biological functions and to inform assessments of its potential as a therapeutic target and for vaccine development.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aebi M. (2013). N-linked protein glycosylation in the ER. Biochim. Biophys. Acta 1833 2430–2437. 10.1016/j.bbamcr.2013.04.001 23583305 · doi ↗ · pubmed ↗
- 2Albalawi W. Thomas J. Mughal F. Kotsiri A. Roper K. Alshehri A. (2025). SARS-Co V-2 S, M, and E structural glycoproteins differentially modulate endoplasmic reticulum stress responses. Int. J. Mol. Sci. 26:1047. 10.3390/ijms 26031047 39940816 PMC 11816748 · doi ↗ · pubmed ↗
- 3Alharbi S. Alrefaei A. (2021). Comparison of the SARS-Co V-2 (2019-n Co V) M protein with its counterparts of SARS-Co V and MERS-Co V species. J. King Saud. Univ. Sci. 33:101335. 10.1016/j.jksus.2020.101335 33432259 PMC 7787911 · doi ↗ · pubmed ↗
- 4Amanat F. Krammer F. (2020). SARS-Co V-2 vaccines: Status report. Immunity 52 583–589. 10.1016/j.immuni.2020.03.007 32259480 PMC 7136867 · doi ↗ · pubmed ↗
- 5Arndt A. Larson B. Hogue B. G. (2010). A conserved domain in the coronavirus membrane protein tail is important for virus assembly. J. Virol. 84 11418–11428. 10.1128/JVI.01131-10 20719948 PMC 2953170 · doi ↗ · pubmed ↗
- 6Bai Z. Cao Y. Liu W. Li J. (2021). The SARS-Co V-2 nucleocapsid protein and its role in viral structure, biological functions, and a potential target for drug or vaccine mitigation. Viruses 13:1115. 10.3390/v 13061115 34200602 PMC 8227405 · doi ↗ · pubmed ↗
- 7Bellier B. Saura A. Luján L. Molina C. Luján H. Klatzmann D. A. (2022). Thermostable oral SARS-Co V-2 vaccine induces mucosal and protective immunity. Front. Immunol. 13:837443. 10.3389/fimmu.2022.837443 35281065 PMC 8913903 · doi ↗ · pubmed ↗
- 8Bhowmik D. Nandi R. Jagadeesan R. Kumar N. Prakash A. Kumar D. (2020). Identification of potential inhibitors against SARS-Co V-2 by targeting proteins responsible for envelope formation and virion assembly using docking based virtual screening, and pharmacokinetics approaches. Infect. Genet. Evol. 84:104451. 10.1016/j.meegid.2020.104451 32640381 PMC 7335633 · doi ↗ · pubmed ↗
