Application of the Smith Waterman and Jukes Cantor Algorithm in the Arrangement of the SARS CoV-2 Virus

Tony Yulianto*    -  Department of Mathematics, Universitas Islam Madura, Indonesia
Mohamad Tafrikan    -  Department of Mathematics, Universitas Islam Negeri Walisongo Semarang, Indonesia
Rica Amalia    -  Department of Mathematics, Universitas Islam Madura, Indonesia
Emi Yunita    -  Department of Midwifery, Universitas Islam Madura, Indonesia
Moch. Haikal    -  Department of Biology Education, Universitas Islam Madura, Indonesia
Fathorrozi Ariyanto  -  Department of Informatics Engineering, Universitas Islam Madura, Indonesia
Zuhrotul Hasanah  -  Department of Mathematics, Universitas Islam Madura, Indonesia

(*) Corresponding Author

In early 2020, the world was shocked by an outbreak of a new pneumonia that started in Wuhan, Hubei Province, which then spread rapidly to more than 190 countries and territories. This outbreak was named coronavirus disease 2019 (COVID-19) caused by Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). The spread of this disease has had a wide social and economic impact. There is still a lot of controversy surrounding this disease, including in the aspects of diagnosis, treatment, and prevention. Therefore, a study was carried out on studies related to COVID-19 that have been widely published since the beginning of 2020 until the end of March 2020.So to overcome this problem, the Smith Waterman Jukes Cantor Algorithm was made to align Covid19 by taking the a pair of DNA and RNA sequencesto align protein sequences. From this alignment, the percentage of identical and mutations will be known. The identical percentage in the genetic code will prove that although the symptoms caused by the disease are almost the same, the protein sequences are not necessarily the same. Based on the simulation results of the distance between sequences that produce a phylogenetic tree using the jukes cantor method, it was obtained that 4 groups of 26 sequences were divided into groups, namely, group 1 consists of 16 sequences, group 2 consists of 6 sequences, group 3 consists of 2 sequences, group 4 consists of 2 sequences. Based on these groups, it turns out that the China Wuhan sequence (sequence MT291826) is located in group 1 and other countries that are almost similar to the sequence in China Wuhan, namely the country of Timoe Leste with the sequence MT641766 also located in group 1.

©2022 JNSMR UIN Walisongo. All rights reserved.

Keywords: Covid19; DNA and RNA; Jukes Cantor; Smith Waterman Algorithm

  1. Y. S. Ismail, Febrian, C. Yulvizar and R. Ramadhani, "Identification Of The Bacterium Isolate From Mackerel Fish (Rastrelliger sp.) Using 16S rRNA Gene," IOP Conference Series: Earth and Environmental Science, 2019.
  2. . Sundari and . Khadijah, "The Application Of Barcode DNA RbcL Gene For Identification Of Medicinal," IOP Conf. Series: Journal of Physics: Conf. Series, 2019.
  3. C. Kirana and Samsu, "The Effect of Climate On The Outbreak Of Covid-19: A Review," IOP Conf. Series: Earth and Environmental Science, 2021.
  4. V. Gallego, H. Nishiura, R. Sah and A. J. R. Morales, "The COVID-19 outbreak and implications for the Tokyo 2020 Summer Olympic Games," Travel Med Infect Dis, vol. 34, no. 101604, 2020.
  5. M. Gupta, A. Abdelmaksoud, M. Jafferany, T. Lotti, R. Sadoughifar and M. Goldust, "COVID-19 and economy," Dermatologic Therapy, p. 1, 2020.
  6. W. C. W. Chan, "Nano Research for COVID-19," ACS Nano, vol. 14, no. 4, pp. 3719-3720, 2020.
  7. . World Health Organization, COVID-19 Weekly Epidemiological Update, All The World: National Authorities, 2020.
  8. A. R. Poetsch, "The genomics of oxidative DNA damage, repair, and resulting mutagenesis," Computational and Structural Biotechnology Journal, vol. 18, pp. 207-219, 2020.
  9. T. R. F. Smith, A. Patel, S. Ramos, D. Elwood, X. Zhu, J. Yan, E. N. Gary, S. N. Walker, K. Schultheis, M. Purwar, . Z. Xu, J. Walters, P. Bhojnagarwala, M. Yang, . N. Chokkalingam, P. Pezzoli, E. Parzych, E. L. Reuschel, A. Doan, N. Tursi, M. Vasquez, . J. Choi, E. T. Ruiz, I. Maricic, . M. A. Bah, Y. Wu, D. Amante, D. H. Park, Y. Dia, A. R. Ali, F. I. Zaidi, A. Generotti, K. Y. Kim, T. A. Herring, S. Reeder, V. M. Andrade, K. Buttigieg, G. Zhao, . J.-M. Wu, D. Li, L. Bao, J. Liu, W. Deng, C. Qin, A. S. Brown, M. Khoshnejad, N. Wang, J. Chu, D. Wrapp, J. S. McLellan, K. Muthumani, B. Wang, M. W. Carroll, J. J. Kim, J. Bover, D. W. Kulp, L. M. P. F. Humeau, D. B. Weiner and K. E. Broderick, "Immunogenicity of a DNA vaccine candidate for COVID-19," Nature Communications, vol. 11, no. 2601, pp. 1-13, 2020.
  10. Y. Kwon, J. M. Daley and P. Sung, "Reconstituted System for the Examination of Repair DNA Synthesis in Homologous Recombination," Methods in Enzymology, vol. 591, pp. 307-325, 2017.
  11. A. M. Fleming, Y. Ding and C. J. Burrows, "Sequencing DNA for the Oxidatively Modified Base 8-Oxo-7,8-Dihydroguanine," Methods in Enzymology, vol. 591, pp. 187-210, 2017.
  12. Y. Zhang, J. Wu, M. Li, J. Lin and Z. Wang, "A Three-Level Scoring System for Fast Similarity Evaluation Based on Smith-Waterman Algorithm," 2020 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5, 2020.
  13. . Alhadi, G. Ardaneswari, H. Tasman and D. Lestari, "Performance evaluation of fast smith-waterman algorithm for sequence database searches using CUDA GPU-based parallel computing," Journal of Next Generation Information Technology, vol. 5, no. 2, pp. 38-46, 2014.
  14. Z. Xia, Y. Cui, A. Zhang, T. Tang, L. Peng, C. Huang, C. Yang and X. Liao, "A Review of Parallel Implementations for the Smith–Waterman Algorithm," Interdisciplinary Sciences: Computational Life Sciences, pp. 1-14, 2021.
  15. R. Barnes, A Review of the Smith-Waterman GPU Landscape, Berkeley: Electrical Engineering and Computer Sciences University of California, 2020, pp. 1-23.
  16. L. Li, J. Lin and Z. Wang, "PipeBSW: A Two-Stage Pipeline Structure for Banded Smith-Waterman Algorithm on FPGA," 2021 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 2021.
  17. K. Hammad, Z. Wu, E. G. Zadeh and S. Magierowski, "A Scalable Hardware Accelerator for Mobile DNA Sequencing," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 29, no. 2, pp. 273 - 286, 2021.
  18. M. G. Awan, J. Deslippe, A. Buluc, O. Selvitopi, S. Hofmeyr, L. Oliker and K. Yelick, "ADEPT: a domain independent sequence alignment strategy for gpu architectures," BMC Bioinformatics, vol. 21, no. 406, pp. 1-29, 2020.
  19. M. J. Pallen, "Microbial Bioinformatics 2020," Microbial Biotechnology, vol. 9, no. 5, pp. 681-686, 2016.
  20. M. I. Irawan, I. Mukhlash, A. Rizky and A. R. Dewi, "Application of Needleman-Wunch Algorithm To," IOP Conf. Series: Journal of Physics: Conf. Series, 2019.
  21. R. A. Purba, S. Suparno and M. Giatman, "The Optimalization of Cosine Similarity Method in Detecting Similarity," IOP Conf. Series Materials Science and Engineering, 2020.
  22. D. Rahmalia, T. Herlambang, A. M. Rohmah and A. Muhith, "Weights Optimization Using Firefly Algorithm On," Journal of Physics Conference Series, 2020.
  23. K. N. Goswami and K. A. Srivastav, "Mathematical Modeling of Zika Virus Diasease With Non Linear Incidence and Optimal Control," IOP Conf. Series: Journal of Physics: Conf. Series, 2018.
  24. Q. Zou, G. Lin, X. Jiang, X. Liu and X. Zeng, "Sequence clustering in bioinformatics: an empirical study," Briefings in Bioinformatics, vol. 21, no. 1, pp. 1-10, 2018.
  25. J. J. Davis, . A. . R. Wattam, . R. K. Aziz, T. Brettin, R. Butler, R. M. Butler, . P. Chlenski, N. Conrad, A. Dickerman, E. M. Dietrich, J. L. Gabbard, S. Gerdes, A. Guard, R. W. Kenyon, D. Machi, C. Mao, . D. M. Olson, M. Nguyen, E. K. Nordberg, G. J. Olsen, R. D. Olson, . J. C. Overbeek, . R. Overbeek, B. Parrelloh, G. D. Pusch, M. Shukla, C. Thomas, M. VanOeffelen, V. Vonstein, A. S. Warren, F. Xia, D. Xie, H. Yoo and R. Stevens, "The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities," Nucleic Acids Research, vol. 48, no. 1, p. 606–612, 2020.
  26. F. Gabler, S. Z. Nam, S. Till, M. Mirdita, M. Steinegger, J. Söding, A. N, L. and V. Alva, "Protein Sequence Analysis Using the MPI Bioinformatics Toolkit," Current Protocols in Bioinformatics, vol. 72, no. 108, pp. 1-30, 2020.
  27. A. Poran, D. Harjanto, M. Malloy, C. M. Arieta, D. A. Rothenberg, D. Lenkala, M. M. v. Buuren, T. A. Addona, M. S. Rooney, L. Srinivasan and R. B. Gaynor, "Sequence-based prediction of SARS-CoV-2 vaccine targets using a mass spectrometry-based bioinformatics predictor identifies immunogenic T cell epitopes," Genome Medicine, vol. 12, no. 70, pp. 1-15, 2020.
  28. B. Robson, "Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus," Computers in Biology and Medicine, vol. 119, no. 103670, pp. 1-19, 2020.
  29. B. Xu, C. Li, H. Zhuang, J. Wang, Q. Wang and X. Zhou, "Efficient Distributed Smith-Waterman Algorithm Based on Apache Spark," 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), pp. 608-615, 2017.
  30. Y. Liu, T.-T. Tran, F. Lauenroth and B. Schmidt, "SWAPHI-LS: Smith-Waterman Algorithm on Xeon Phi coprocessors for Long DNA Sequences," 2014 IEEE International Conference on Cluster Computing (CLUSTER), pp. 257-265, 2014.
  31. S. K. Zahid, L. Hasan, A. A. Khan and S. Ullah, "A novel structure of the Smith-Waterman Algorithm for efficient sequence alignment," 2015 Third International Conference on Digital Information, Networking, and Wireless Communications (DINWC), pp. 6-9, 2015.
  32. F. Muhamad, R. Ahmad, S. Asi and M. Murad, "Performance Analysis Of Needleman-Wunsch Algorithm (Global) And Smith-Waterman Algorithm (Local) In Reducing Search Space And Time For Dna Sequence Alignment," Journal of Physics: Conference Series, vol. 1019, no. 012085, pp. 1-8, 2018.
  33. S. A. M. A. Junid, M. F. M. Idros, A. H. A. Razak, F. N. Osman and N. M. Tahir, "Parallel processing cell score design of linear gap penalty smith-waterman algorithm," 2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA), pp. 299-302, 2017.
  34. S. Röhling, A. Linne, J. Schellhorn, M. Hosseini, T. Dencker and B. Morgenstern, "The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances," PLOS ONE, pp. 1-18, 2020.
  35. P. K. Pandey, Y. S. Singh, P. S. Tripathy, R. Kumar, S. K. Abujam and J. Parhi, "DNA barcoding and phylogenetics of freshwater fish fauna of Ranganadi River, Arunachal Pradesh," Gene, vol. 754, no. 144860, pp. 1-28, 2020.
  36. J. Yavarian, N. Z. S. Jandaghi, K. Sadeghi, S. S. Malekshahi, V. Salimi, A. Nejati, F. A. Minejad, N. Ghavvami, F. Saadatmand, S. Mahfouzi, G. Fateminasab, N. Parhizgari, A. Ahmadi, K. Razavi, S. Ghabeshi, M. Saberian, E. Zanjani, F. Namazi, T. Shahbazi, F. Rezaie, H. Erfani, M. M. Gouya, M. N. Dadras and T. M. Azad, "First Cases of SARS-CoV-2 in Iran, 2020: Case Series Report," Iran Journal Public Health, vol. 49, no. 8, pp. 1564-1568, 2020.
  37. S. Awasthi, A. K. Mahadani, G. Sanyal and P. Bhattacharjee, "Modified indel treatment for accurate Phylogenetic Tree construction," 2020 International Conference on Computation, Automation and Knowledge Management (ICCAKM), 2020.
  38. J. Rusinko and M. McPartlon, "Species tree estimation using Neighbor Joining," Journal of Theoretical Biology, vol. 414, no. 7, pp. 5-7, 2017.
  39. T. Le, A. Sy, E. K. Molloy, Q. Zhang, S. Rao and T. Warnow, "Using Constrained-INC for Large-Scale Gene Tree and Species Tree Estimation," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 18, no. 1, pp. 2-15, 2020.
  40. H. Prasetya, Performance Comparison Between Kimura 2-Parameters and Jukes-Cantor Model in Constructing Phylogenetic Tree of Neighbour Joining (NJ), Bogor: IPB (Bogor Agricultural University), 2011.

Open Access Copyright (c) 2022 Journal of Natural Sciences and Mathematics Research
Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Journal of Natural Sciences and Mathematics Research
Published by Faculty of Science and Technology
Universitas Islam Negeri Walisongo Semarang

Jl Prof. Dr. Hamka Kampus III Ngaliyan Semarang 50185
Website: https://journal.walisongo.ac.id/index.php/JNSMR
Email:jnsmr@walisongo.ac.id

ISSN: 2614-6487 (Print)
ISSN: 2460-4453 (Online)

View My Stats

Lisensi Creative Commons

This work is licensed under a Creative Commons Lisensi Creative Commons .

apps