In Silico Characterisation of a Type D Betaretrovirus from Malayan Flying Fox, Pteropus vampyrus
Received Date: Mar 06, 2018 / Accepted Date: Mar 27, 2018 / Published Date: Mar 30, 2018
Bats are well known for their role as reservoirs of viral diseases. Previous studies have reported the presence of endogenous gamma and betaretrovirus in diverse species of bats. Here, we characterize one of these lineages- Pteropus vampyrus endogenous betaretrovirus namely PvEB in depth describing the genomic organization, age estimation and phylogenetic approaches to examine the origin and history of this virus. Phylogenetic analysis of gag, pol and full genome sequence has placed PvEB robustly within the betaretrovirus genus. Subsequently, PvEB was estimated to invade the bat genome since 1.5-8.8mya. The results of bioinformatics analysis were verified through amplification of the PvEB reverse transcriptase gene with the size of 700bp. With these findings together with the previous report, it is evident that PvEB is a member of type D betaretrovirus group.
Keywords: Asian bat; Pteropus vampyrus ; Endogenous betaretrovirus
AMV: Avian Myeloblastosis Virus; BFV: Bovine Foamy Virus; BIV: Bovine Immunodeficiency Virus; BLV: Bovine Leukemia Virus; CAEV: Caprine Arthritis-Encephalitis Virus; EFV: Equine Foamy Virus; EIAV: Equine Infectious Anemia Virus; FFV: Feline Foamy Virus; FIV: Feline Immunodeficiency Virus; HFV: Human Foamy Virus; HIV1: Human Immunodeficiency Virus; HIV2: Human Immunodeficiency Virus II, HTLV1: Human T-lymphotropic Virus-1; HTLV2: Human Tlymphotropic Virus-2; HTLV3: Human T-lymphotropic Virus-3; MMTV: Mouse Mammary Tumor Virus; MSFV: Macaque Simian Foamy Virus; PCOV: Porcine Type-C Oncovirus; RELIK: Rabbit Endogenous Lentivirus Type-K; REV: Reticuloendotheliosis Virus; RSV: Rous Sarcoma Virus; SIV: Simian Immunodeficiency Virus; VISNA: Visna-Maedi Virus; WDH1: Walleye Dermal Hyperplasia 1; WDH2: Walleye Dermal Hyperplasia 2; WDSV: Walleye Dermal Sarcoma Virus.
Retroviruses that are coming from the Retroviridae family infect vertebrates and are characterized by a replication strategy in which their RNA genomes are reverse transcribed into a DNA copy and integrated into the genome of the infected cell. Occasionally, this process occurs in germ line cells, such that integrated retrovirus DNA can be vertically inherited as genomic loci called endogenous retroviruses (ERVs) . ERVs can reach fixation in their host species, however, most are defective and incapable of producing infectious particles . Many mammalian species harbor ERVs are closely related to the exogenous gamma and betaretrovirus genera including rodents and bats.
Bats are the second most species-rich group of mammals (after rodents) and are broadly divided into two groups, Microchiroptera and Megachiroptera. The latter group comprises single family, called Pteropodidae that includes all flying foxes and Old World fruit bats. The Microchiroptera and Megachiroptera are estimated to have diverged approximately 45-50.2 million years ago (mya) -within the range of available fossil data for bats [3,4]. The lineage that gave rise to the contemporary Megachiropteran species, Pteropus vampyrus is estimated to have diverged from the main Megachiropteran branch approximately 31.9 mya .
Bats are known for gregarious roosting lifestyle which makes them as an ideal host for numerous pathogens  including Hendra virus , Nipah virus [7,8] and Ebola virus  that conveyed significant health and economic impact as humans and livestock are jeopardized by these viral pathogens. Previous study has discovered bats to harbor diverse endogenous gammaretroviruses  and endogenous betaretroviruses . Hayward et al. has revealed the integration time of endogenous retrovirus into bat genome which was based on the number of nucleotide variations between 5’ and 3’ LTR of each betaretrovirus. It was estimated to take place within 3.2mya to 36.3mya with one element closely to PvEB, called PvERV-βJ was likely to integrate at 6.3mya.
In addition, Betaretroviruses can be further divided into Type B and Type D based on distinctive morphologies of its retroviral particle  and genomic organisation.
Under the genomic organization, some of these betaretroviruses, could be infectious such as Type B Mouse Mammary Tumor Virus (MMTV) which possess superantigen (sag) protein as a virulence factor and Type D (Mason Pfizer Monkey Virus, MPMV and Jaagsiekte Sheep Retrovirus, JSRV) which encode oncogenic gene at its envelope region. In this study, we analyze PvEB with phylogenetic analysis of their viral genes (gag, pol, env), dating the invasion time of PvEB into the Pteropus vampyrus genome and verified its presence using PCR amplification of its reverse transcriptase gene.
Materials and Methods
Input sequences used in bioinformatic analysis consisted of nucleotides and amino acids sequences. These sequences were obtained from published studies and are available in GenBank and UniProt databases. Sequences of gag, pol and env were tested as query sequences in BLAST, sequence alignment and phylogenetic tree analysis. Among the sequences used were HIV-1 (K03455), HIV-2 (M30502), VISNA (L06906), EIAV (AF033820), FIV (P16088), CAEV (ACN82422), BIV (AAA42767), SIV (M30931), MMTV (P03365.2), JSRV (P31623), SMRV (M23385), MPMV (M12349), RSV (NC_001407), AMV (Q83135), BLV (K02120), HTLV-1 (NC_001436), HTLV-2 (NC_001488), HTLV-3 (DQ093792), BFV (NC_001831), EFV (Q9J4C88), FFV (Y08851), HFV (Y07725), MSFV (H6V7I7), WDH1 (Q9WHI7), WDH2 (Q9WHJ2), WDSV (A4GTG2), REV (KJ909531), XMRV (EF185282), MULV (AF010170), PCOV (AJ279057), GALV (NC_001885), Gypsy Ty3 retrotransposon (U15406). In addition, sequences of Rabbit Endogenous Lentivirus Type-K (RELIK) were acquired from Katzourakis et al., and Keckesova et al.
During genomic mining analysis of Pteropus vampyrus , 26 solo LTRs and 36 partial or full proviruses were successfully retrieved. Additionally, when determined through BLAST analysis, PvEB was shown to have higher percentage of nucleotide similarity with type D betaretroviruses, such as MPMV (60% identity; E-value 0.0) and SMRV (59% identity; E-value 0.0) rather than to type B betaretroviruses, MMTV (50% identity; E-value 0.0) and JSRV (54% identity; E-value 0.0).
The LTRs (5’ and 3’) were searched based on sequence similarity between the upstream of gag and downstream of env. The difference between the pair of the LTRs were observed and calculated in percentage in order to predict whether it is a recent integration of retrovirus into the host genome. The LTRs were also searched for TATA box, primer binding sites (PBS) at its 5’ and polypurine tract (PPT) at its 3’.
Phylogenetic tree reconstruction
In order to determine the phylogenetic relationships between PvEB and other groups of retroviruses, maximum likelihood (ML) trees were constructed from the ClustalW alignments of gag, pol, env and fulllength sequences. Using MEGA 5.1 software , the ML trees were constructed using selected substitution model . The best model substitution for each gene together with the AIC and log likelihood value were listed in Table 1 below.
|Gene||AIC value||Log likelihood value||Best model|
Table 1: Best model for each major viral gene and whole genome sequence.
Whelan and Goldman (WAG) substitution matrix  was selected as the best fit model for gag and env whereas General Reverse Transcriptase (rtREV) model  was selected for pol and whole genome sequences (Table 1). These models were selected based on lowest AIC value and high log likelihood value  when compared to other listed models. This is expected since different genomic regions may have different selective pressures and evolutionary constraint . Apart from that, all positions containing gaps were eliminated. In order to determine the strength of support for the placement of PvEB and other groups of retroviruses, 100 of bootstrap replication were used for each viral gene. Thirty two sequences of established retroviruses genera plus two sequences of established endogenous betaretrovirus (bat and guinea pig) were used as ingroup taxa whereas one gypsy retrotransposon was used as outgroup .
Estimation of integration time of invasion to host genome
The integration time of PvEB in bats was estimated using LTR analysis with the following formula T= (D/R)/2 depicted from  and . The differences between the nucleotides of 5’ and 3’ LTR was abbreviated as ‘D’ whereas the genomic substitution rate was indicated as ‘R’. Both LTRs of PvEB were assumed to be indistinguishable at the time of integration. Through comparisons of both LTRs, the pair differed by 0.007 substitutions per site whereas the rate of neutral substitution for bat has been estimated at ~3.968 × 10-10 .
DNA extraction and PCR amplification
We used polymerase chain reaction (PCR) approach to amplify the reverse transcriptase gene of PvEB and confirm its presence in the Pteropus vampyrus genome. P. vampyrus tissue sample (MZU/M/199) was obtained from Universiti Malaysia Sarawak (UNIMAS) Zoological Museum. Since the tissue was highly degraded, we extracted the genomic DNA using modified method of CTAB extraction and PCR purification . Tissue sample was ground together with CTAB extraction buffer (7.5 g CTAB powder, 1 M Tris, 0.5M EDTA, 5M NaCl and distilled water) and 20 μl of Proteinase K was added into microcentrifuge tube. The sample was incubated at 55˚C for 30 minutes. The extraction solution was centrifuged at 13000rpm for 5 minutes and later the supernatant was transferred into new tube. The supernatant was then treated as per manual in QIAquick PCR Purification Kit. The eluate was used as template in PCR amplifications.
PCR products were generated from 4 ul of DNA template, 1 × PCR buffer, 200 μM dNTPs and 0.25U Taq DNA polymerase (OneTaq). The primers used were designed from pol region of PvEB with its forward primer 5’TTTGAGGGCTTACTTGATTCG3’ and reverse primer 5’TTACATGGATGATATTCTTGTAGC3’ making the final volume of 25 μl. Amplifications were performed on BioRAD thermocycler under the following conditions: 1 cycle of 95˚C for 2 minutes, 35 cycles of 94˚C for 15 seconds, 63˚C for 20 seconds and 72˚C for 2 minutes and 1 cycle of 72˚C for 10 minutes. The PCR product was then separated on 1% agarose gel in 1 × TAE buffer with 1 kb DNA ladder (NorgenBiotek) as DNA marker. Genomic DNA extraction was performed according to the method described in  with few modifications since the tissue sample was ancient and has been preserved in 95% ethanol for quite some time.
Results and Discussion
PvEB is a type-D endogenous betaretrovirus based on genomic organisation
To obtain further insight into the genetic structure of PvEB, we analyzed the full sequence of PvEB from our previous study  and showed the gag, pol and env genes into a summarized schematic diagram (Figure 1) based on their location in open reading frames. All viral genes present in PvEB were included together with their length as in Figure 1a and b.
Figure 1: (A) Genomic structure of PvEB. Panel 1 shows complete genomic organization of PvEB; 5’LTR-gag-dUTPase-pol-env-orf1-3’LTR. Orf 1 was also present in PvEB. Panel II shows the presence of TATA box, Primer binding site (PBS) at the 5’end and polypurine tract (PPT) at the 3’ end. (B) Open reading frame map of PvEB. Full length bar indicates stop codon and half length bar indicates start codon.
A full genome characterisation of PvEB has indicated that it has a typical betaretrovirus genome organization with three distinct features of a type D betaretrovirus. Firstly, PvEB has all the viral gene with gag (744aa), dUTPase-protease (256aa), pol (1120aa) and env (663aa) coding domain. The size is in the range of other type D betaretrovirus such as Squirrel Monkey Retrovirus (SMRV), Mason-Pfizer Monkey Virus (MPMV) and Jaagsiekte Sheep Retrovirus (JSRV) as shown in Table 2 below.
|Virus||Gag||Size (aa) Pol||Env||Total length (bp)|
Table 2: Comparison of viral gene sizes between PvEB and other Betaretrovirus.
Secondly, dUTPase and protease genes are located upstream of pol gene of different frame. The location of dUTPase in PvEB is similar to other type D betaretroviruses such as MPMV and SMRV which is upstream of pol gene. In comparison, lentiviral dUTPase genes are located in the pol coding domain in between the genes encoding reverse transcriptase and integrase.
Studies on the varying dUTPase acquisitions also showed that retroviral dUTPase genes found in protease frame have a monophyletic origin and were acquired by alpha-like retroviruses earlier in evolution, before or during the formation of betaretroviruses, between reverse transcriptase and integrase . In previous study by Baldo and McClure, (1999), betaretroviruses was shown to possess dUTPase at the upstream of protease while the lentiviruses possessed dUTPase in between reverse transcriptase and integrase of pol gene. Additionally, the motifs found in dUTPase were almost congruent to the motifs described in by McGeoch (1990) of dUTPase of betaretroviral families . The five dUTPase motifs found in PvEB are SGTIL, GRAS, SVVDADYEGEIK, RIAQ and RGASNPGSS. dUTPase in betaretrovirus has a fundamental role in DNA metabolism and repair , viability and infectivity of viruses . Additionally, non-primate lentiviruses and betaretroviruses possess dUTPase gene at distinct genomic location where betaretroviruses dUTPase gene located at the upstream of the protease gene  which is in congruent with the result discussed above .
Thirdly, as retroviruses were previously categorized based on the specific tRNA that anneals to their PBS required for initiation of reverse transcription, we determined the specific tRNA used by PvEB. It was found to harbour a PBS complementary to tRNALys2 sequence typical of a type D betaretrovirus.
Other features of PvEb include the pol and env retroviral elements have several stop codons and are thus expected to yield truncated gene product. However, the open reading frames (ORFs) for gag and pro reading frames are intact, suggesting that they might code for functional proteins. The gag ORF contains a predicted retroviral matrix protein (gag p10; 73.3% identity; E-value 1.2e-22) and core nucleocapsid protein (gag p24; 134% identity; E-value 3.9e-40) while the protease region has a trimeric dUTP diphosphate domain (dUTPAse; 61.7% identity; E-value 4.1e-17).
PvEB close to mammalian endogenous type D betaretrovirus based on phylogenetic tree analysis
PvEB grouped within the type D betaretrovirus clade from mammalian (i.e. MPMVand SMRV) with 68% and 99% bootstrap support respectively in phylogenies constructed using both gag and pol sequences (Figure 2a and 2b). Moreover, these findings are consistent with the findings in  that both genes are closely related to SMRV and MPMV, a type D betaretroviruses. Similarly, additional analysis using the full-length sequence of PvEB with other retroviruses also showed significant bootstrap support (74%) for the inclusion of PvEB within the betaretrovirus clade (Figure 2c). However, phylogenetic tree of env gene showed that PvEB is located within the gammaretroviruses with 84% bootstrap support (Figure 2d). This slight different grouping pattern is expected since the env region is highly divergent in nature . This could also be due to recombinant origin, that possess gamma-beta type env . In addition, the relationship of PvEB with other betaretroviruses is in parallel to the findings in  which reported that common vampire bat (Desmodus rotundus) was related to primate and rodent betaretroviruses.
PvEB has diverged from its host since 8.8mya
The integration time of PvEB in bats was estimated using LTR analysis with the following formula T= (D/R)/2 depicted from  and . The differences between the nucleotides of 5’ and 3’ LTR was abbreviated as ‘D’ whereas the genomic substitution rate was indicated as ‘R’. Both LTRs of PvEB were assumed to be indistinguishable at the time of integration. Through comparisons of both LTRs, the pair differed by 0.007 substitutions per site whereas the rate of neutral substitution for bat has been estimated at ~3.968 × 10-10 . Therefore, the estimated age of PvEB genome invasion is approximately 8.8mya. Though the exact invasion of PvERV-βJ occurred more recent that is 6.3mya, the divergence time of PvEB is still congruent to the findings of , whom reported the invasion of endogenous betaretrovirus in the origins of modern bats have occurred within 3.2 to 36.3mya. Nevertheless, when applying the average mammalian neutral substitution rate, 2.292 × 10-9 , PvEB was estimated to have a minimum integration date of 1.5mya. This finding extends the range of time since divergence of bat endogenous betaretrovirus that is from 1.5-36.3mya.
Estimated size of amplified reverse transcriptase gene is 700bp
As shown in Figure 3, the estimated size of PCR product was 700bp. The PCR primer was designed from domain 1 to V of reverse transcriptase gene from our previous bioinformatics work on this sample. This PCR product is in congruent to most endogenous retrovirus and retroelements obtained from previous study.
PvEB possesses a typical type D betaretrovirus genome organization and phylogenetic tree analysis of gag, pol and full-length sequence data placed PvEB robustly within a type D betaretrovirus group. It was estimated that the invasion of bat genome by PvEB dated back from 1.5 to 8.8 mya. PCR amplification of the PvEB reverse transcriptase gene confirmed its presence in the Pteropus vampyrus genome. Therefore, the findings of PvEB using bioinformatics analysis is supported by the phylogenetic analysis and later, the molecular work verified the presence of a type D betaretrovirus in the Pteropus vampyrus genome.
We thank Faculty of Applied Sciences (FSG) and Research Management Institute UiTM for the research grant. This study was supported by research excellence fund (600-RMI/DANA 5/3/RIF (146/2014). We would also like to thank Prof Mohd Tajuddin Abdullah and postgraduate students from Faculty of Resources Sciences and Technology, UNIMAS. The research permit was issued by Sarawak Forestry Department, NCCD.907.4.4 (Jld.8)-52 and Park Permit-339/2014.
- Kurth R, Bannert N (2010) Retroviruses: Molecular Biology, Genomics and Pathogenesis. Caister Academic Press, Norfolk, UK, 72-107.
- Denner J (2010) Endogenous Retrovirus. Retroviruses: Molecular Biology, Genomics and Pathogenesis” Caister Academic Press, Norfolk, UK, Edited by R. Kurth and N. Bannert 35-70.
- Bastian JST, Tanaka K, Anunciado RVP, Natural NG, Sumalde AC (2001) Phylogenetic Relationships among Megachiropteran species from the Two Major Islands of the Philippines, Deduced from DNA Sequences of the Cytochrome b gene. Can J Zool 79: 1671-1677.
- Ryan-R JJ, Guan AKH, Kumaran JV, Esa YB, Sallehin AA (2008) Malaysian Fruit Bats Phylogeny Inferred Using Ribosomal RNA. Pertanika J Trop Agric Sci 31: 67-77.
- Breed AC, Field HE, Smith CS, Edmonston J, Meers J (2010) Bats without borders: Long-distance movements and implications for disease risk management. Eco Health 7: 204-212.
- Halpin K, Young PL, Field HE, Mackenzie JS (2000) Isolation of Hendra virus from pteropid bats: a natural reservoir of Hendra virus. J Gen Virol 81: 1927-1932.
- Yob JM, Field H, Rashdi AM, Morrissy C, vanderHeide B (2001) Nipah virus Infection in Bats (order Chiroptera) in Peninsular Malaysia. Emerg Infect Dis 7: 439-441.
- Rahman SA, Hassan L, Epstein JH, Mamat ZC, Yatim AM (2013) Risk Factors for Nipah virus infection among pteropid bats, Peninsular Malaysia”. Emerg Infect Dis 19: 51-60.
- Leroy EM, Kumulungui B, Pourrut X, Rouquet P, Hassanin A, et al. (2005) Fruit bats as reservoirs of Ebola virus. Nature 438: 575-576.
- Cui J, Tachedjian M, Wang L, Tachedjian G, Wang LF, et al. (2012) Discovery of retroviral homologs in bats: implications for the origin of mammalian gammaretroviruses. J Virol 86: 4288-4293.
- Hayward JA, Tachedjian M, Cui J, Field H, Holmes EC, et al. (2013) Identification of Diverse Full-Length Endogenous Betaretroviruses in Megabats and Microbats. Retrovirology 10: 35.
- Coffin JM, Hughes SH, Varmus HE (1997) Purification, Composition, and Morphology of Virions, Retroviruses, Cold Spring Harbor Laboratory Press.
- Posada D (2003) Selecting models of evolution. The phylogenetic handbook: A practical approach to DNA and protein phylogeny. Cambridge University Press, Cambridge 256-282.
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M (2011) MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol 28: 2731-2739.
- Whelan S, Goldman N (2001) A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol 18: 691-699.
- Cui J, Holmes EC (2012) Endogenous lentiviruses in the ferret genome. J Virol 86: 3383-3385.
- Henzy JE, Johnson WE (2013) Pushing the endogenous envelope. Philos Trans R Soc Lond B Biol Sci 368: 20120506.
- Dimmic MW, Rest JS, Mindell DP, Goldstein RA (2002) rtREV: An amino acid substitution matrix for inference of retrovirus and reverse transcriptase phylogeny. J Mol Evol 55: 65-73.
- Escalera-Zamudio M, Mendoza MLZ, Heeger F, Loza-Rubio E, Rojas-Anaya E (2015) A novel endogenous betaretrovirus in the common vampire bat (Desmodus rotundus) suggests multiple independent infection and cross-species transmission events. J Virol 89: 5180-5184.
- Yang DY, Eng B, Waye JS, Dudar JC, Saunders SR (1998) Technical Note: Improved DNA Extraction from Ancient Bones Using Silica-Based Spin Columns. Am J Phys Anthropol 105: 539-543.
- Razlan NFA, Kambol R (2014) Computer Based Characterisation of Pteropus vampyrus Endogenous Betaretrovirus (PvEB) from Bat Genome Database, Computer Applications and Industrial Electronics (ISCAIE) Symposium, IEEE 6-11.
- Jern P, Sperber GO, Blomberg J (2005) Use of Endogenous Retroviral Sequences (ERVs) and Structural Markers for Retroviral Phylogenetic Inference and Taxonomy. Retrovirology 2: 50.
- McGeoch DJ (1990) Protein Sequence Comparisons Show that the Pseudoproteases Encoded by Poxyviruses and Certain Retroviruses belong to dUTP ase Family. Nucleic Acids Res 18: 4105-4110.
- Barabas O, Rumlova M, Erdei A, Pongracz V, Pichova I (2003) dUTPase and Nucleocapsid Polypeptides of the Mason-Pfizer monkey Virus Form A Fusion Protein in the Virion with Homotrimeric Organization and Low Catalytic Efficiency. J Biol Chem 278: 38803-38812.
- Pongracz-NV, Barabas O, Fuxreiter M, Simon I, Pichova I, et al. (2007) Flexible Segments Modulate Co-Folding of dUTPase and Nucleocapsid Proteins. Nucleic Acids Res 35: 495-505.
- Baldo AM, McClure MA (1999) Evolution and horizontal transfer of dUTPase-encoding genes in viruses and their hosts. J Virol 73: 7710-7721.
- Keckesova Z, Ylinen LMJ, Towers GJ, Gifford RJ, Katzourakis A (2009) Identification of a RELIK orthologue in the European hare (Lepus europaeus) reveals a minimum age of 12 million years for the lagomorph lentiviruses. Virology 384: 7-11.
- Katzourakis A, Tristem M, Pybus OG, Gifford RJ (2007) Discovery and analysis of the first endogenous lentivirus. Proc Natl Acad Sci 104: 6261-6265.
- Kumar S, Subramanian S (2002) Mutation Rates in Mammalian Genomes. Proc Natl Acad Sci USA 99: 803-808.
Citation: Kambol R, Mahandran NFAR, Yunus NM (2018) In Silico Characterisation of a Type D Betaretrovirus from Malayan Flying Fox, Pteropus vampyrus. J Phylogenetics Evol Biol 6:196. Doi: 10.4172/2329-9002.1000196
Copyright: © 2018 Kambol R, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Select your language of interest to view the total content in your interested language
Share This Article
- Total views: 126
- [From(publication date): 0-2018 - Jun 21, 2018]
- Breakdown by view type
- HTML page views: 107
- PDF downloads: 19