Please use this identifier to cite or link to this item: https://repository.cihe.edu.hk/jspui/handle/cihe/2302
DC FieldValueLanguage
dc.contributor.authorSiu, Wan Chien_US
dc.contributor.otherCheng, K.-O.-
dc.contributor.otherWu, P.-
dc.contributor.otherLaw, N.-F.-
dc.date.accessioned2022-02-17T05:49:00Z-
dc.date.available2022-02-17T05:49:00Z-
dc.date.issued2015-
dc.identifier.urihttps://repository.cihe.edu.hk/jspui/handle/cihe/2302-
dc.description.abstractTraditionally, intra-sequence similarity is exploited for compressing a single DNA sequence. Recently, remarkable compression performance of individual DNA sequence from the same population is achieved by encoding its difference with a nearly identical reference sequence. Nevertheless, there is lack of general algorithms that also allow less similar reference sequences. In this work, we extend the intra-sequence to the inter-sequence similarity in that approximate matches of subsequences are found between the DNA sequence and a set of reference sequences. Hence, a set of nearly identical DNA sequences from the same population or a set of partially similar DNA sequences like chromosome sequences and DNA sequences of related species can be compressed together. For practical compressors, the compressed size is usually influenced by the compression order of sequences. Fast search algorithms for the optimal compression order are thus developed for multiple sequences compression. Experimental results on artificial and real datasets demonstrate that our proposed multiple sequences compression methods with fast compression order search are able to achieve good compression performance under different levels of similarity in the multiple DNA sequences.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.ispartofIEEE/ACM Transactions on Computational Biology and Bioinformaticsen_US
dc.titleCompression of multiple DNA sequences using intra-sequence and inter-sequence similaritiesen_US
dc.typejournal articleen_US
dc.identifier.doi10.1109/TCBB.2015.2403370-
dc.contributor.affiliationSchool of Computing and Information Sciencesen_US
dc.relation.issn1557-9964en_US
dc.description.volume12en_US
dc.description.issue6en_US
dc.description.startpage1322en_US
dc.description.endpage1332en_US
dc.cihe.affiliatedNo-
item.languageiso639-1en-
item.fulltextNo Fulltext-
item.openairetypejournal article-
item.grantfulltextnone-
item.openairecristypehttp://purl.org/coar/resource_type/c_6501-
item.cerifentitytypePublications-
crisitem.author.deptYam Pak Charitable Foundation School of Computing and Information Sciences-
crisitem.author.orcid0000-0001-8280-0367-
Appears in Collections:CIS Publication
SFX Query Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.