Introduction
Zika virus (ZIKV) is an endemic, neurovirulent arbovirus whose primary vectors are
Aedes aegypti and
Aedes albopictus mosquitoes. Its primary transmission mode is the horizontal transfer of mosquito-infectious saliva during blood feeding (
Musso and Gubler 2016). Currently, the Centers for Disease Control and Prevention has designated 86 countries as ZIKV risk areas (
Prevention 2022). ZIKV is popularly known for the 2015/2016 epidemic in Brazil; during that time, it was revealed that the virus can be vertically transmitted from an infected pregnant woman to their fetuses, which can lead to congenital disabilities such as microcephaly in utero (
Tang 2016). ZIKV can also cause debilitating symptoms in adults, including Guillain-Barré Syndrome, fever, rash, headaches, and joint pain, and can also be sexually transmitted (
Barzon et al. 2016;
Mansuy et al. 2016;
Tang 2016). Currently, there are no treatments or vaccines available (
USA Food and Drug Administration 2022). ZIKV possesses a positive-sense, non-segmented RNA genome of ∼10.4 kb, which is composed of a 5′ terminal region (TR) (∼108 nt), three structural genes—capsid (C), membrane (prM), and envelope (E)—and seven nonstructural genes—NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5, followed by a long, 3′ TR (∼432 nt). The TRs play vital roles in viral genome cyclization, which regulates viral transcription and translation, and they alter host mRNA turnover, suppresses interferon response, and modulate the host-cell environment (
Donald et al. 2016;
Chavali et al. 2017;
Sirohi and Kuhn 2017;
Sanford et al. 2019;
Schneider and Wolfinger 2019;
Michalski et al. 2019;
Wolfinger 2021).
An area of interest for antiviral targeting is G-quadruplexes (G4s)—noncanonical structures identified in all domains of life and almost all viral groups in the Baltimore classification (
Ruggiero and Richter 2020). Viral G4s are known to play critical roles throughout regulatory processes in their life cycles, such as transcriptional and translational activities, as well as alternative splicing (
Patel et al. 2000;
Métifiot et al. 2014;
Murat et al. 2014;
Meinke et al. 2016). G4 structures consist of at least two consecutive guanine tetrads in close proximity Hoogsteen base pairing with each other in both DNA and RNA (
Wang and Patel 1993;
Burge et al. 2006;
Bhattacharyya et al. 2016), altogether stabilized by a monovalent cation (preferably K
+) at the coordinating site (
Fig. 1A). DNA G4s can adopt parallel, antiparallel, and hybrid conformations, whereas RNA G4s exclusively adopt propeller-type parallel topology due to the 2′-OH preventing syn-conformation torsion angles (
Fay et al. 2017) (
Fig. 1A).
It has been shown that many human proteins, including DExD-box helicases, interact with G4s (
Linder et al. 1989;
Brázda et al. 2014;
Zhang et al. 2021). The DExD-box family proteins are involved in a wide range of functions, including embryonic development, cell proliferation, hematopoiesis, metabolism, immune response, cancer pathogenesis, inflammation, autoimmune diseases, and influence of viral replication (
Meier-Stephenson et al. 2018;
Andrisani et al. 2022). We have previously demonstrated that the DEAD-box helicases, DDX3X
132-607 or DDX17
135-555, interact and unfold the intergenic region (IGR) and 5′ noncoding region of the Rift Valley fever virus and the Japanese encephalitis and Zika virus 5′ TRs, respectively (
Nelson et al. 2020;
Nelson et al. 2021). DDX3X has been reported as a critical factor required for dengue and hepatitis C viral replication and as an inhibitory factor for West Nile virus replication (
Ariumi et al. 2007;
Brai et al. 2019). It has also been shown to interact with RNA G4s. Conversely, it is unclear if DDX17 directly binds to G4s, since it is involved in G4 binding with other proteins, but no direct interaction has been demonstrated (
Fortuna et al. 2021;
Dardenne et al. 2014). A previous study on the identification of the ZIKV genomic RNA structure of both African and Asian/American lineages using selective 2′-hydroxyl acylation analyzed by primer extension identified a conserved, canonical stem-loop structure within the 3′ TR (classified as the 3′ stem-loop [SL]) (
Li et al. 2018). Interestingly, a study by
Fleming et al. (2016) found potential G-quadruplex sequences within ZIKV using the quadruplex-forming G-rich sequences Mapper, and they identified and characterized a G4 contained in the 3′ SL, albeit using a 25-mer oligo short-range interaction approach (
Kikin et al. 2006;
Fleming et al. 2016). From the perspective of long-range RNA interactions, it is still unclear if a G4 is maintained in the 3′ SL. Additionally, a previous study indicated that conformational switching could occur between G4 and canonical RNA structures, as in the case of this study, a hairpin (
Bugaut et al. 2012). These events are based on the proximity of specific mono and divalent cations, which are altogether generated by RNA liquid--liquid phase separation (
X. Liu et al. 2021). From these recent discoveries, the 3′ SL could exist as canonical and noncanonical states based on ion proximity.
Because of the lack of treatment options for ZIKV, investigation of alternative targets within the viral structure or life cycle is being explored, and G4s is one of them (
Majee et al. 2021). The G4 binding molecules, Braco-19, TMPyP4, Berberine, NiL, 360 A, and pyridostatin (PDS), have all been shown to interact with ZIKV G4s in vitro and cell culturing assays and showed that these molecules altogether reduce viral replication and protein production to varying degrees (
Majee et al. 2021;
Zou et al. 2021). It remains unclear if the 3′ TR G4 exists in a larger RNA structure containing long-range interactions, as previous studies focus on a short-range approach using short oligos. Moreover, deciphering the various human proteins that interact with this G4 could provide insight into its function during viral replication.
In this study, we investigated the presence of a G4 in the full-length ZIKV 3′ TR. For this purpose, we first used bioinformatics analyses to demonstrate that the 3′ SL G4 sequence is present in multiple ZIKV isolates. By implementing a long-range interaction approach with long RNA transcripts of the 3′ TR (432 nt) and 3′ SL (104 nt) (
Fig. 1B), we were able to identify the presence of a G4 in the 3′ TR transcript using the BG4 antibody, TMPyP4, and PDS, which all specifically bind to G4s. Subsequently, we elucidated that human DDX3X
132-607 and DDX17
135-555 interact with the 3′ TR and that DDX17
135-555 unwinds the G4 present in the 3′ SL in an ATP-dependent manner.
Materials and methods
Bioinformatics analysis of conserved G4s in the 3′ TR
ZIKV viral genome data were downloaded from NCBI GenBank on 14 January 2021, comprising 1028 ZIKV isolates. Filtering for isolates that cover the complete G4-containing 3′ SL regions was performed with an Infernal covariance model (
Nawrocki and Eddy 2013) constructed from the 3′ SL seed alignment of the Spondweni group featured in the viRNA GitHub repository (
https://github.com/mtw/viRNA), resulting in 113 ZIKV isolates. Filtering these for a nonredundant set of 3′ SL regions resulted in a total of 14 representative ZIKV isolates. The ZIKV isolates from Haiti used in this study (KU509998.3) are identical to strain MN577544.1, featured in our nonredundant set, throughout the entire 3′ SL region. A structural multiple sequence alignment of the 3′ SL region of 14 representative ZIKV isolates was computed with LocARNA (
Will et al. 2007) and visualized with Jalview (
Clamp et al. 2004).
RNA transcript expression of 3′ TR, 3′ SL, and G4 mutant
The ZIKV 3′ TR (10375–10807) (432 nt) sequence was derived from the ZIKV isolate from Haiti (GenBank: KU509998.3) (
Lednicky et al. 2016). This isolate also has a nearly identical sequence to the Asian/American lineage, one of the predominant lineages across the globe (
Beaver et al. 2018). We designed three plasmid constructs from this sequence containing the full 3′ TR, the 3′ SL, and a 3′ TR with the G4 sequence mutated (G4 mutant). The G4 mutant construct was designed wherein five nucleotide mutations were made (5′-GGCGGCCGGUGUGGGG-3′ → 5′-GACGACCGAUGUGAGA-3′) (
Fig. 1B) in the hypothesized G4 sequence to disrupt the G4 formation. All constructs were made using pUC57-Kan plasmids with a T7 promoter upstream of the desired transcript, with an
XbaI cut site at the end of the transcript to linearize the plasmid such that T7 polymerase dissociates during in vitro transcription (IVT) reactions. The 3′ TR and 3′ SL plasmid constructs were commercially synthesized by Integrated DNA Technologies, and BioBasic synthesized the G4 mutant.
The synthesized plasmids were transformed into competent Escherichia coli NEB5ɑ cells (New England Biolabs), and the cells were further propagated to replicate the plasmid DNA for IVTs. The plasmid DNA was extracted, linearized using XbaI restriction digestion, and subsequently used for IVT reaction. The T7 polymerase-based (expressed and purified in-house) IVT reaction was performed by mixing transcription buffer (200 mM Tris-Cl [pH 7.5], 75 mM MgCl2, 10 mM spermadine, and 50 mM NaCl), 100 mM DTT, 25 mM nucleoside triphosphates, 100 mM guanosine monophosphate, 0.5 U/µL IPPase, 10 µM T7 polymerase, Ribolock, and the digested plasmid and performed for 3 h at 37 °C. This was followed immediately by purification via size-exclusion chromatography (SEC) using a Cytiva Superdex 200 Increase 10/300 GL for the ZIKV 3′ SL and a Sephacryl S-400 HR for the 3′ TR and G4 mutant RNA (Fig. S1A/B). The columns were pre-equilibrated in G4 buffer (10 mM HEPES [pH 7.5] and 100 mM KCl). For purity analysis, the peak fractions were analyzed on a 2% agarose gel (Fig. S1C). The purified RNA was then ethanol precipitated at −80 °C in preparation for RNA labeling for subsequent experiments.
Expression and purification of BG4 antibody, as well as DDX17135-555 and DDX3132-607 proteins
The plasmid coding for the BG4 antibody was obtained from Addgene (pSANG10-3 F-BG4) and transformed into E. coli BL21 DE3 for protein expression. The transformed cells were grown at 37 °C until the optical density reached 0.6 at 600 nm. Subsequently, cells were induced with 1 mM isopropylthio-β-galactoside and were grown for 16 h at 16 °C, followed by cell harvesting through centrifugation. The harvested cells were resuspended in lysis buffer (50 mM Tris-Cl [pH 7.5], 500 mM NaCl, 0.2% tween, 5% glycerol, 5 mM β-mercaptoethanol, and 10 mM imidazole), supplemented with 0.1 mg/mL lysozyme, 200 units of DNase I (Thermofisher), 2 mM PMSF, as well as protease inhibitors (pepstatin, aprotinin, and AEBSF) from Biobasic. The cell suspension was sonicated and clarified using centrifugation (30 000× g); the clarified lysate was loaded onto a 1 mL Histrap high-performance column from Cytiva mounted on an Äkta Start system and purified using an elution gradient from 10 to 150 mM imidazole supplemented to the lysis buffer. The fractions were purified via SEC using a Superdex 75 increase 10/300 GL (Cytiva) pre-equilibrated with 1X PBS mounted on an Äkta Pure system (Fig. S2). The fractions were analyzed on a 12% SDS-PAGE gel, pooled, and concentrated to 14 µM and stored at −20 °C until further use.
DDX3X
132-607, DDX17
135-555, and DDX17 mutants are altogether truncated versions of the protein (Fig. S3A), and each was expressed according to protocols described previously using an
E. coli expression system similar to the BG4 antibody (
Nelson et al. 2020,
2021) (Fig. S3B). DDX17 mutant was also purified using a pET28a cloned vector with glycine mutations on amino acids which coordinate ATP hydrolysis; the Q motif, which interacts with the adenine of ATP; and motifs 1 and 2 in the ATP binding domain, which interact with the triphosphate (Fig. S3A,C) (
Ali 2021a;
Ali 2021b;
Tanner et al. 2003). The mutant assays were performed to prevent ATP hydrolysis, and consequently, helicase activity.
Alexa Fluor 488 labeling of the RNA
Each purified RNA transcript was labeled at the 5′ end with Alex fluor 488. Note that 1.25 mg of 1-ethyl-3- (3-dimethylaminopropyl) carbodiimide hydrochloride was mixed with 8 µL of concentrated RNA in water and 5 µL Alexa Fluor 488 in 0.2 M KCl, and 20 µL of 0.1 M imidazole (pH 6). All reactions were incubated at room temperature for 18 h. The RNA was then diluted in 460 µL of G4 buffer (10 mM HEPES [pH 7.5] and 100 mM KCl) and loaded onto a Cytiva Superdex 200 Increase 10/300 GL to purify the labeled RNA from the unlabeled fluorophore. Each RNA sample was then collected and heat-cooled at 95 °C for 5 min, followed by room temperature incubation for 15 min. The RNA was tested for fluorescence counts on the NanoTemper Technologies Monolith NT.115 microscale thermophoresis (MST) device to ensure optimal labeling for downstream experiments, similar to previous experiments that used Alex Fluor 488 for labeling their nucleic acids (
Yoo et al. 2004;
Johnson et al. 2006;
Kynast et al. 2004). The labeled RNA was then vitrified in liquid nitrogen and stored at −80 °C until downstream experimentation.
Interaction studies using microscale thermophoresis
A Monolith NT.115 MST from NanoTemper Technologies was used to evaluate the affinity of RNA constructs with their ligands. The ZIKV 3′ TR, G4 mutant, and 3′ SL RNA transcripts were evaluated at 80 nM at 100% excitation at medium MST power for the BG4 binding affinity and PDS binding check runs. For the MST binding studies, the samples were serially diluted two-fold in the MST capillaries with concentrations ranging from 6.8 µM – 0.8 nM for BG4, 7 µM – 14.6 nM for DDX3X132-607, and 19 µM – 0.58 nM for DDX17135-555. All MST data were collected in triplicate at room temperature using the Nano-Blue filter and medium IR-laser power for BG4, while high power was used for DDX3X132-607 or DDX17135-555. The dissociation constants were calculated using the Monolith NT.115 analysis software by plotting the ligand % bound, and for the PDS runs, the average response amplitude was recorded in triplicate, and the Monolith NT.115 analysis software provided the signal-to-noise ratio values.
Electrophoretic mobility shift assays (EMSAs) were performed to support the MST binding data (Fig. S4) with the ZIKV 3′ TR mixed with two-fold serial diluted concentrations of DDX3X132-607 or DDX17135-555. Furthermore, the oligonucleotides used in the MST helicase assay were also mixed with DDX3X132-607 or DDX17135-555 to ensure that the helicase assay results on the MST were a result of the 3′ SL being unwound, not the oligo binding to the protein. The labeled ZIKV RNA was scanned on Cy2, whereas the oligo was scanned on Cy5.
Unwinding assays of the ZIKV 3′ TR G4
To assess the ability of DDX3X
132-607 and DDX17
135-555 to unwind the noncanonical structured area in the 3′ SL, we used MST and a complementary RNA (5′-AGUUUCCACCACGCUGGCCGCCA-3′) that would only base pair to the G4 portion of the target RNA, if it was unfolded by DDX3X
132-607 or DDX17
135-555, as performed previously (
Nelson et al. 2020,
2021). When fluorescence and (or) migratory differences occur between the BSA control and the helicase, this provides evidence that the helicase unfolded the noncanonical region, as the oligo base pairing causes a change in fluorescence, shape, charge, and (or) hydration shell of the target RNA (
Huang and Zhang 2021). The reaction was performed using 1 µM ZIKV 3′ SL RNA sample, 4.25 µM ATP, 40 nM Cy-5 DNA Oligo, and 20 µM of either DDX3X
132-607 or DDX17
135-555. BSA was selected as a negative control. The reaction was incubated for 15 min prior to the measurements. The experiments were performed eight times in total over two binding check runs on the Nanotemper Monolith NT.115. At 50% excitation and medium MST power, the signal-to-noise ratio was assessed for each run to determine successful unwinding activity, where, again, a signal-to-noise ratio of ≥5 indicates that binding is occurring.
A spectrofluorimetric unwinding assay was also performed using a Quanta Master 60 fluorescence spectrometer (Photon Technology International). The reactions were performed using a mixture of 2 µM protein, 50 nM RNA, and 3.5 µM ThT—a well-characterized G4 binding biosensor (
Xu et al. 2016). The mixture was incubated for 5 min to allow ThT to bind to the G4 on the RNA, followed by a fluorescence measurement taken at 425 nm excitation and emission collected from 450–510 nm with a step size of 1 nm increments. Following initial data collection, 60 µM of ATP was titrated into the mixture, followed by a 5 min incubation to allow the helicase to unfold the G4, preventing ThT from binding, as the G4s is unfolded, and the same fluorescence parameters were used to collect the emission. The change in fluorescence was then plotted (
n = 3), and a
t-test was performed to check for statistical significance. DDX17
135-555, DDX3X
132-607, and a control, BSA, were used to validate the MST results of unwinding.
Discussion
Our work demonstrates the presence of G4 in ZIKV 3′ TR and establishes that human helicase DDX17
135-555 can unwind the G4 structure. These results unveil a previously unreported function of DDX17. Further work is required to elucidate the function(s) of the 3′ TR G4 in the ZIKV life cycle. DDX3X has been extensively studied for its ability to interact with various RNA structures, including G4 (
Herdy et al. 2018). Consistent with this finding, our previous study demonstrated the capacity of DDX3X
132-607 unwinding secondary structure within the ZIKV and Japanese Encephalitis virus 5′ TRs (
Nelson et al. 2021). We hypothesize that the observed inability to unwind G4 structures might be attributed to a structural feature within the substrate rather than the absence of a functional domain within the helicase. However, it is still unclear whether DDX3X can unwind G4s (
H. Liu et al. 2021;
Caterino and Paeschke 2022). Previous literature porposes hypotheses into the role of G4s in the 3′ TR of the viral genome, such as serving as a “capped” degradation-resistant region comparable to G overhangs in telomeres, evasion of the host intracellular immune response, assisting in viral transcription initiation, or act as another element in host protein “sponging,” as this structure is contained in sub-flaviviral RNA (
Moon et al. 2015;
Lista et al. 2017;
Reznichenko et al. 2019;
Michalski et al. 2019;
Bryan 2020). Moreover, protein sponging could also be assisted with a G4 serving as a nucleation point for RNA phase separation, potentially giving the virus more control of ion proximity and structural switching between a G4 and stem-loop (
Bugaut et al. 2012;
X. Liu et al. 2021). This would potentially provide ZIKV an extension of the number of human proteins that can be recruited to the terminal regions as well, thus providing a more streamlined replication. Furthermore, it has also been shown that the 12 regions of the 3′ TR contain N6-methyladenosines (m
6A) enrichment, and eight out of the 12 m
6A regions were on the 3′ SL G4. Previous literature showed that in the case of ZIKV, when m
6A was suppressed, it increased viral replication, suggesting that m
6A could be a mechanism the cell uses to slow viral replication (
Fleming et al. 2019). Overall, further investigation into the role of G4 from ZIKV 3′ TR is required.
ZIKV remains a significant public health concern globally, and due to concerns of potential antibody-dependent enhancement caused by antibodies generated from flaviviral vaccines or infection, novel treatment avenues to stop viral replication are urgently required. To this end, our study demonstrated that ZIKV contains a G4 within the 3′ SL of the 3′ TR that is maintained across multiple ZIKV isolates and is unfolded by human DDX17135-555. This discovery allows for further investigation to exploit this structured region by providing potential treatments using G4 binding molecules or other molecules that disrupt human DDX17–3′ SL interaction. Further studies of the function of the 3′ SL G4 during viral replication are essential, but the foundation of the identification of the G4 and two RNA-binding proteins allows for a benchmark to start exploring this region as a target to attempt to stop ZIKV replication, providing a novel treatment for ZIKV infection.