Kombucha origin clustering based on 16S metabarcoding datasets analysis

Authors

  • Imam Bagus Nugroho Universitas Gadjah Mada, Indonesia https://orcid.org/0000-0002-3252-4920
  • Darmawan Ari Nugroho Universitas Gadjah Mada, Indonesia
  • Abdul Rahman Siregar Universitas Gadjah Mada, Indonesia

DOI:

https://doi.org/10.21580/jnsmr.v11i2.23949

Keywords:

Bioinformatics, Fermented Product, Bootstrapped Hierarchical Clustering, Microbiome, Product Origin

Abstract

Consumers of fermented products increasingly demand detailed information on product ingredients, quality, health benefits, and origin. Herein, we have chosen kombucha as a model for a fermented product. This study aims to establish the origin information of kombucha using clustering analysis of 16S metabarcoding datasets. We have downloaded and analysed datasets of kombucha 16S metabarcoding originating from 5 distinct places: Brazil, the United States, the United Kingdom, Turkey, and Thailand. We randomly selected datasets from the collection (n = 32) and analyzed them on the SHAMAN server to develop an initial microbiome profile. We implemented hierarchical agglomerative Clustering and found that Ward's method and the Chao distance produced the best cluster tree, which consistently separates kombucha into five distinct clades, reflecting their origin. We have extended our examination to include more datasets (n=13) to build the final cluster tree (total n = 45). We have also assessed the uncertainty of the final Clustering by pvclust in R. The pvclust cluster tree is comparable in topology to the final cluster tree built using Ward's method and Chao distance. The pvclust cluster tree features stable clades that are highly supported by AU (Approximately Unbiased) values (p-value ≥ 95%). Each kombucha was also placed correctly and consistently according to its respective origin. We have successfully conducted analyses and demonstrated that a simple clustering method, combining Ward's method and the Chao distance, is the most effective for classifying kombucha by origin using a 16S metabarcoding dataset.

Downloads

Download data is not yet available.

Author Biography

Imam Bagus Nugroho, Universitas Gadjah Mada

Department of Agroindustrial Technology

References

Alotaibi, K. (2015). Non-metric multi-dimensional scaling for distance-based privacy-preserving data mining [Doctoral Thesis, University of East Anglia]. https://ueaeprints.uea.ac.uk/id/eprint/52228

Chiu, C.-H., & Chao, A. (2014). Distance-based functional diversity measures and their decomposition: A framework based on hill numbers. PLoS ONE, 9(7), e100014. https://doi.org/10.1371/journal.pone.0100014

Dash, S., Shakyawar, S. K., Sharma, M., & Kaushik, S. (2019). Big data in healthcare: Management, analysis and future prospects. Journal of Big Data, 6(1), 54. https://doi.org/10.1186/s40537-019-0217-0

Dechakhamphu, A., Wongchum, N., Chumroenphat, T., Tanomtong, A., Pinlaor, S., & Siriamornpun, S. (2023). In vitro and in vivo evaluation for antioxidant and anti-diabetic properties of cyperus rotundus l. Kombucha. Foods, 12(22), 4059. https://doi.org/10.3390/foods12224059

Gower, J. C. (2014). Principal coordinates analysis. In R. S. Kenett, N. T. Longford, W. W. Piegorsch, & F. Ruggeri (Eds.), Wiley StatsRef: Statistics Reference Online (1st ed.). Wiley. https://doi.org/10.1002/9781118445112.stat05670

Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202

Júnior, J. C. D. S., Meireles Mafaldo, Í., De Lima Brito, I., & Tribuzy De Magalhães Cordeiro, A. M. (2022). Kombucha: Formulation, chemical composition, and therapeutic potentialities. Current Research in Food Science, 5, 360–365. https://doi.org/10.1016/j.crfs.2022.01.023

Kaashyap, M., Cohen, M., & Mantri, N. (2021). Microbial diversity and characteristics of kombucha as revealed by metagenomic and physicochemical analysis. Nutrients, 13(12), 4446. https://doi.org/10.3390/nu13124446

Laavanya, D., Shirkole, S., & Balasubramanian, P. (2021). Current challenges, applications and future perspectives of SCOBY cellulose of Kombucha fermentation. Journal of Cleaner Production, 295, 126454. https://doi.org/10.1016/j.jclepro.2021.126454

Landis, E. A., Fogarty, E., Edwards, J. C., Popa, O., Eren, A. M., & Wolfe, B. E. (2022). Microbial diversity and interaction specificity in kombucha tea fermentations. mSystems, 7(3), e00157-22. https://doi.org/10.1128/msystems.00157-22

Li, C., Chen, Y., & Shang, Y. (2022). A review of industrial big data for decision making in intelligent manufacturing. Engineering Science and Technology, an International Journal, 29, 101021. https://doi.org/10.1016/j.jestch.2021.06.001

Martínez Leal, J., Valenzuela Suárez, L., Jayabalan, R., Huerta Oros, J., & Escalante-Aburto, A. (2018). A review on health benefits of kombucha nutritional compounds and metabolites. CyTA - Journal of Food, 16(1), 390–399. https://doi.org/10.1080/19476337.2017.1410499

Murtagh, F., & Legendre, P. (2014). Ward’s hierarchical agglomerative clustering method: Which algorithms implement ward’s criterion? Journal of Classification, 31(3), 274–295. https://doi.org/10.1007/s00357-014-9161-z

Ricotta, C., & Podani, J. (2017). On some properties of the Bray-Curtis dissimilarity and their ecological meaning. Ecological Complexity, 31, 201–205. https://doi.org/10.1016/j.ecocom.2017.07.003

Shimodaira, H. (2002). An approximately unbiased test of phylogenetic tree selection. Systematic Biology, 51(3), 492–508. https://doi.org/10.1080/10635150290069913

Shimodaira, H. (2004). Approximately unbiased tests of regions using multistep-multiscale bootstrap resampling. The Annals of Statistics, 32(6). https://doi.org/10.1214/009053604000000823

Suzuki, R., & Shimodaira, H. (2006). Pvclust: An R package for assessing the uncertainty in hierarchical clustering. Bioinformatics, 22(12), 1540–1542. https://doi.org/10.1093/bioinformatics/btl117

Volant, S., Lechat, P., Woringer, P., Motreff, L., Campagne, P., Malabat, C., Kennedy, S., & Ghozlane, A. (2020). SHAMAN: A user-friendly website for metataxonomic analysis from raw reads to statistical analysis. BMC

Bioinformatics, 21(1), 345. https://doi.org/10.1186/s12859-020-03666-4

Xia, X., Dai, Y., Wu, H., Liu, X., Wang, Y., Yin, L., Wang, Z., Li, X., & Zhou, J. (2019). Kombucha fermentation enhances the health-promoting properties of soymilk beverage. Journal of Functional Foods, 62, 103549. https://doi.org/10.1016/j.jff.2019.103549

Downloads

Published

2025-12-10

How to Cite

Nugroho, I. B., Darmawan Ari Nugroho, & Abdul Rahman Siregar. (2025). Kombucha origin clustering based on 16S metabarcoding datasets analysis. Journal of Natural Sciences and Mathematics Research, 11(2), 103–112. https://doi.org/10.21580/jnsmr.v11i2.23949

Issue

Section

Original Research Articles

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.