International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 76 - Issue 15 |
Published: August 2013 |
Authors: Moin Mahmud Tanvee, Shaikh Jeeshan Kabeer, Tareque Mohmud Chowdhury, Asif Ahmed Sarja, Md. Tayeb Hasan Shuvo |
![]() |
Moin Mahmud Tanvee, Shaikh Jeeshan Kabeer, Tareque Mohmud Chowdhury, Asif Ahmed Sarja, Md. Tayeb Hasan Shuvo . Mining Maximal Adjacent Frequent Patterns from DNA Sequences using Location Information. International Journal of Computer Applications. 76, 15 (August 2013), 26-32. DOI=10.5120/13322-0819
@article{ 10.5120/13322-0819, author = { Moin Mahmud Tanvee,Shaikh Jeeshan Kabeer,Tareque Mohmud Chowdhury,Asif Ahmed Sarja,Md. Tayeb Hasan Shuvo }, title = { Mining Maximal Adjacent Frequent Patterns from DNA Sequences using Location Information }, journal = { International Journal of Computer Applications }, year = { 2013 }, volume = { 76 }, number = { 15 }, pages = { 26-32 }, doi = { 10.5120/13322-0819 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2013 %A Moin Mahmud Tanvee %A Shaikh Jeeshan Kabeer %A Tareque Mohmud Chowdhury %A Asif Ahmed Sarja %A Md. Tayeb Hasan Shuvo %T Mining Maximal Adjacent Frequent Patterns from DNA Sequences using Location Information%T %J International Journal of Computer Applications %V 76 %N 15 %P 26-32 %R 10.5120/13322-0819 %I Foundation of Computer Science (FCS), NY, USA
The rapid development of bioinformatics has resulted in the explosion of DNA sequence data which is characterized by large number of items. Studies have shown that biological functions are dictated by contagious portions of the DNA sequence. Finding contiguous frequent patterns from long data sequences such as DNA sequences is a particularly challenging task and can pave the way towards new breakthroughs. Apriori based techniques were among the first to be used in frequent contagious pattern mining. Later improved approaches like GSP, Prefix Span were also applied but the approaches required either large number of sequence scans, generated large number of candidates or required higher number of intermediate sequential patterns. In this paper an improvement of the positional based approach for contagious frequent pattern mining is DNA sequences is proposed. The proposed algorithm improves the existing positional based approach by introducing a new amalgamated sorting and joining technique which helps to reduce time and space complexity. The proposed approach outperforms traditional existing contiguous frequent mining approaches.