Kombucha origin clustering based on 16S metabarcoding datasets analysis
DOI:
https://doi.org/10.21580/jnsmr.v11i2.23949Keywords:
Bioinformatics, Fermented Product, Bootstrapped Hierarchical Clustering, Microbiome, Product OriginAbstract
Consumers of fermented products increasingly demand detailed information on product ingredients, quality, health benefits, and origin. Herein, we have chosen kombucha as a model for a fermented product. This study aims to establish the origin information of kombucha using clustering analysis of 16S metabarcoding datasets. We have downloaded and analysed datasets of kombucha 16S metabarcoding originating from 5 distinct places: Brazil, the United States, the United Kingdom, Turkey, and Thailand. We randomly selected datasets from the collection (n = 32) and analyzed them on the SHAMAN server to develop an initial microbiome profile. We implemented hierarchical agglomerative Clustering and found that Ward's method and the Chao distance produced the best cluster tree, which consistently separates kombucha into five distinct clades, reflecting their origin. We have extended our examination to include more datasets (n=13) to build the final cluster tree (total n = 45). We have also assessed the uncertainty of the final Clustering by pvclust in R. The pvclust cluster tree is comparable in topology to the final cluster tree built using Ward's method and Chao distance. The pvclust cluster tree features stable clades that are highly supported by AU (Approximately Unbiased) values (p-value ≥ 95%). Each kombucha was also placed correctly and consistently according to its respective origin. We have successfully conducted analyses and demonstrated that a simple clustering method, combining Ward's method and the Chao distance, is the most effective for classifying kombucha by origin using a 16S metabarcoding dataset.
Downloads
References
Alotaibi, K. (2015). Non-metric multi-dimensional scaling for distance-based privacy-preserving data mining [Doctoral Thesis, University of East Anglia]. https://ueaeprints.uea.ac.uk/id/eprint/52228
Chiu, C.-H., & Chao, A. (2014). Distance-based functional diversity measures and their decomposition: A framework based on hill numbers. PLoS ONE, 9(7), e100014. https://doi.org/10.1371/journal.pone.0100014
Dash, S., Shakyawar, S. K., Sharma, M., & Kaushik, S. (2019). Big data in healthcare: Management, analysis and future prospects. Journal of Big Data, 6(1), 54. https://doi.org/10.1186/s40537-019-0217-0
Dechakhamphu, A., Wongchum, N., Chumroenphat, T., Tanomtong, A., Pinlaor, S., & Siriamornpun, S. (2023). In vitro and in vivo evaluation for antioxidant and anti-diabetic properties of cyperus rotundus l. Kombucha. Foods, 12(22), 4059. https://doi.org/10.3390/foods12224059
Gower, J. C. (2014). Principal coordinates analysis. In R. S. Kenett, N. T. Longford, W. W. Piegorsch, & F. Ruggeri (Eds.), Wiley StatsRef: Statistics Reference Online (1st ed.). Wiley. https://doi.org/10.1002/9781118445112.stat05670
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202
Júnior, J. C. D. S., Meireles Mafaldo, Í., De Lima Brito, I., & Tribuzy De Magalhães Cordeiro, A. M. (2022). Kombucha: Formulation, chemical composition, and therapeutic potentialities. Current Research in Food Science, 5, 360–365. https://doi.org/10.1016/j.crfs.2022.01.023
Kaashyap, M., Cohen, M., & Mantri, N. (2021). Microbial diversity and characteristics of kombucha as revealed by metagenomic and physicochemical analysis. Nutrients, 13(12), 4446. https://doi.org/10.3390/nu13124446
Laavanya, D., Shirkole, S., & Balasubramanian, P. (2021). Current challenges, applications and future perspectives of SCOBY cellulose of Kombucha fermentation. Journal of Cleaner Production, 295, 126454. https://doi.org/10.1016/j.jclepro.2021.126454
Landis, E. A., Fogarty, E., Edwards, J. C., Popa, O., Eren, A. M., & Wolfe, B. E. (2022). Microbial diversity and interaction specificity in kombucha tea fermentations. mSystems, 7(3), e00157-22. https://doi.org/10.1128/msystems.00157-22
Li, C., Chen, Y., & Shang, Y. (2022). A review of industrial big data for decision making in intelligent manufacturing. Engineering Science and Technology, an International Journal, 29, 101021. https://doi.org/10.1016/j.jestch.2021.06.001
Martínez Leal, J., Valenzuela Suárez, L., Jayabalan, R., Huerta Oros, J., & Escalante-Aburto, A. (2018). A review on health benefits of kombucha nutritional compounds and metabolites. CyTA - Journal of Food, 16(1), 390–399. https://doi.org/10.1080/19476337.2017.1410499
Murtagh, F., & Legendre, P. (2014). Ward’s hierarchical agglomerative clustering method: Which algorithms implement ward’s criterion? Journal of Classification, 31(3), 274–295. https://doi.org/10.1007/s00357-014-9161-z
Ricotta, C., & Podani, J. (2017). On some properties of the Bray-Curtis dissimilarity and their ecological meaning. Ecological Complexity, 31, 201–205. https://doi.org/10.1016/j.ecocom.2017.07.003
Shimodaira, H. (2002). An approximately unbiased test of phylogenetic tree selection. Systematic Biology, 51(3), 492–508. https://doi.org/10.1080/10635150290069913
Shimodaira, H. (2004). Approximately unbiased tests of regions using multistep-multiscale bootstrap resampling. The Annals of Statistics, 32(6). https://doi.org/10.1214/009053604000000823
Suzuki, R., & Shimodaira, H. (2006). Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics, 22(12), 1540–1542. https://doi.org/10.1093/bioinformatics/btl117
Volant, S., Lechat, P., Woringer, P., Motreff, L., Campagne, P., Malabat, C., Kennedy, S., & Ghozlane, A. (2020). SHAMAN: A user-friendly website for metataxonomic analysis from raw reads to statistical analysis. BMC
Bioinformatics, 21(1), 345. https://doi.org/10.1186/s12859-020-03666-4
Xia, X., Dai, Y., Wu, H., Liu, X., Wang, Y., Yin, L., Wang, Z., Li, X., & Zhou, J. (2019). Kombucha fermentation enhances the health-promoting properties of soymilk beverage. Journal of Functional Foods, 62, 103549. https://doi.org/10.1016/j.jff.2019.103549
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Journal of Natural Sciences and Mathematics Research

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Copyright
The copyright of the received article shall be assigned to the publisher of the journal. The intended copyright includes the right to publish the article in various forms (including reprints). The journal maintains the publishing rights to published articles. Authors are allowed to use their articles for any legal purposes deemed necessary without written permission from the journal, but with an acknowledgment to this journal of initial publication.
Licensing
