BBG52: A New Dataset for Plant Species Recognition in the Balikpapan Botanical Gardens, Borneo Island

Authors

DOI:

https://doi.org/10.35718/iiair.v1i1.1277

Keywords:

computer vision, plant classification, balikpapan botanical gardens, residual network, transfer learning, deep learning, machine learning

Abstract

The Balikpapan Botanical Garden serves as conservation area in Indonesia for preserving biodiversity, particularly the endemic species of Kalimantan. Accurate and efficient identification and classification of plant species are crucial for conservation efforts. However, traditional methods are often time-consuming and require an expert necessitating the need for an automated approach. In this study, we manually collected a dataset of natural images in the Balikpapan Botanical Garden that simulate real-world conditions which contains 5200 image samples of 52 different plant species named BBG52. We compared the manual train-test data splitting by considering intra-class variants againts random splitting to evaluate the performance differences. To construct classification model, we employed ResNet variants as pre-trained models—ResNet-34, ResNet-50, and ResNet-10—and examined the effect of the hidden layer in the classification part of the model. Our empirical results demonstrate that manual data splitting yields better performance than random splitting. Furthermore, the ResNet-50 model without additional hidden layers achieved the best performance with an accuracy of 96.88% and F1-score of 0.9689. The computational analysis provided empirical evidence that the model requires 0.1379 seconds on a CPU and 0.0861 seconds on a GPU demonstrating the model’s efficiency in the constrained device. The BBG52 dataset is openly accessible at https://github.com/inidhanii/BBG52

References

T. D. Siboro, “Manfaat Keanekaragaman Hayati Terhadap Lingkungan,” JURNAL ILMIAH SIMANTEK ISSN. 2550-0414, vol. 3, Feb. 2019.

A. Antonelli et al., State of the World’s Plants and Fungi 2023. 2023. doi: 10.34885/wnwn-6s63.

D. W. Purnomo, M. Siregar, J. R. Witono, and D. Usmadi, “Rencana 10 Tahun (2020-2030) Pengembangan Kebun Raya di Indonesia,” 2020.

J. Wäldchen, M. Rzanny, M. Seeland, and P. Mäder, “Automated plant species identification—Trends and future directions,” Apr. 01, 2018, Public Library of Science. doi: 10.1371/journal.pcbi.1005993.

A. Lawrence and W. Hawthorne, “Plant identification: Creating user-friendly field guides for biodiversity management,” Plant Identification: Creating User-Friendly Field Guides for Biodiversity Management, pp. 1–276, Oct. 2013, doi: 10.4324/9781849772198.

L. Alzubaidi et al., “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions,” J Big Data, vol. 8, no. 1, Dec. 2021, doi: 10.1186/s40537-021-00444-8.

K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” 2015. [Online]. Available: http://image-net.org/challenges/LSVRC/2015/

A. Atique, S. Karim, S. Shahid, and Z. Alamgir, “Identification of Plant Species through Leaf Vein Morphometric and Deep Learning,” Pak J Bot, vol. 54, no. 6, pp. 2195–2202, 2022, doi: 10.30848/PJB2022-6(38).

R. Pujiati and N. Rochmawati, “Identifikasi Citra Daun Tanaman Herbal Menggunakan Metode Convolutional Neural Network (CNN),” Journal of Informatics and Computer Science, vol. 03, 2022.

Y. Sun, Y. Liu, G. Wang, and H. Zhang, “Deep Learning for Plant Identification in Natural Environment,” Comput Intell Neurosci, vol. 2017, 2017, doi: 10.1155/2017/7361042.

V. Bodhwani, D. P. Acharjya, and U. Bodhwani, “Deep residual networks for plant identification,” in Procedia Computer Science, Elsevier B.V., 2019, pp. 186–194. doi: 10.1016/j.procs.2019.05.042.

Y. Gong, G. Liu, Y. Xue, R. Li, and L. Meng, “A survey on dataset quality in machine learning,” Inf Softw Technol, vol. 162, p. 107268, Oct. 2023, doi: 10.1016/j.infsof.2023.107268.

B. Falahkhi, E. F. Achmal, M. Rizaldi, R. Rizki, and N. Yudistira, “Perbandingan Model AlexNet dan ResNet dalam Klasifikasi Citra Bunga Memanfaatkan Transfer Learning Comparison of AlexNet and ResNet Models in Flower Image Classification Utilizing Transfer Learning,” 2022, [Online]. Available: http://journal.ipb.ac.id/index.php/jika

M. S. I. Musyaffa, N. Yudistira, M. A. Rahman, and J. Batoro, “IndoHerb: Indonesia Medicinal Plants Recognition using Transfer Learning and Deep Learning,” Aug. 2023, [Online]. Available: http://arxiv.org/abs/2308.01604

T. Nguyen and V. Truong Hoang, “VNPlant-200 – A Public and Large-Scale of Vietnamese Medicinal Plant Images Dataset,” 2021, pp. 406–411. doi: 10.1007/978-3-030-49264-9_37.

E. D. Cubuk, B. Zoph, J. Shlens, and Q. V. Le, “RandAugment: Practical automated data augmentation with a reduced search space,” Sep. 2019, [Online]. Available: http://arxiv.org/abs/1909.13719

K. He, X. Zhang, S. Ren, and J. Sun, “Identity Mappings in Deep Residual Networks,” Mar. 2016, [Online]. Available: http://arxiv.org/abs/1603.05027

A. Paszke et al., “PyTorch: An Imperative Style, High-Performance Deep Learning Library,” in Advances in Neural Information Processing Systems 32, Curran Associates, Inc., 2019, pp. 8024–8035.

Sample images from the BBG52 dataset

Published

30-04-2025

How to Cite

Ramadhani, R., Gusti Ahmad Fanshuri Alfarisy, & Boby Mugi Pratama. (2025). BBG52: A New Dataset for Plant Species Recognition in the Balikpapan Botanical Gardens, Borneo Island. Innovative Informatics and Artificial Intelligence Research, 1(1), 19–25. https://doi.org/10.35718/iiair.v1i1.1277