Recent Innovation Infomation Technology| OMICS International | Journal Of Information Technology And Software Engineering

Our Group organises 3000+ Global Conferenceseries Events every year across USA, Europe & Asia with support from 1000 more scientific societies and Publishes 700+ Open Access Journals which contains over 50000 eminent personalities, reputed scientists as editorial board members.


The deoxyribonucleic acid (DNA) constitutes the physical medium in which all properties of living organisms are encoded. The understanding of its sequence is primary concern in molecular biology. Some important molecular biology databases (ERIBL, GenBank, DDJB) are developed around the world to accumulate nucleotide sequences (DNA, RNA) and amino-acid sequences of proteins. It is well acknowledged that their size increases nowadays exponentially fast. Not as big yet as some other scientific databases, their size is in hundreds of GB. For complete genomes, these texts can be very elongated. The human genome for example contains three billions characters over twenty-three pairs of chromosomes. It contains all the genetic substance of the human beings. With escalating number of genome sequences being made available, the difficulty of storing and using databases has to be addressed. The compression of genetic information as a result constitutes a very important job. Another factor which is also to be considered is the prediction of certain kind of disease by applying the searching a pattern in the compressed domain. The importance of common compressibility for identifying patterns of interest from genomes is recognized and it is also established that compressibility is a well dimension of relatedness among sequences. Many compression algorithms use the characteristics of DNA like point mutation or reverse complement to achieve a god compression rate. Many compression methods have been exposed to compress DNA sequences. Invariably, all the compression methods instigate so far take benefit of the fact that DNA sequences are made of only four alphabets, together with schemes to exploit the repetitive nature of DNA. Some methods have the capability to search the disease but none of the methods investigate the prediction of diseases by searching a pattern in compressed DNA sequences. (Prediction of Disease in Compressed DNA Sequences: An Open Problem, Ashutosh Gupta)
  • Share this page
  • Facebook
  • Twitter
  • LinkedIn
  • Google+
  • Pinterest
  • Blogger

Last date updated on January, 2021