ECOLINGUISTIC STUDY WITH AI-BASED FRAMEWORK ON NIAS MEDICINAL PLANTS
Abstract
Indigenous medicinal plant knowledge constitutes a crucial component of ecolinguistic systems, as it is embedded in linguistic expressions that reflect ecological relationships, healing practices, and cultural values. However, this knowledge is increasingly threatened by language shift and insufficient documentation, particularly within low-resource indigenous communities. This study develops an AI-based ecolinguistic framework to systematically document and represent Nias ethnomedicinal knowledge by integrating ethnobotanical field data with culturally grounded artificial intelligence approaches. Qualitative data were obtained through semi-structured interviews with traditional Nias healers, resulting in the identification of fifteen commonly used medicinal plant species. To assess cultural salience and communal consensus, the study applied the Relative Frequency of Citation (RFC) index. The quantitative findings reveal an uneven distribution of cultural prominence among the documented species. Notably, Gundre and Mbulu Nazalöu emerged as the most frequently cited plants (FC = 14; RFC = 0.93 each), indicating their central role within the Nias ethnomedical knowledge system. The documented knowledge was subsequently structured using a Knowledge Graph model and enhanced through a Retrieval-Augmented Generation (RAG) architecture to enable contextualized, culturally sensitive knowledge representation. The proposed framework demonstrates how artificial intelligence can support the preservation, organization, and revitalization of endangered indigenous medicinal knowledge while maintaining its ecolinguistic integrity.
References
Birhane, A., Prabhu, V. U., & Kahembwe, E. (2021). Multimodal datasets: Misogyny, porn, and malignant stereotypes. arXiv preprint arXiv:2110.01963.
Fill, A., & Penz, H. (2020). The Routledge handbook of ecolinguistics (2nd ed.). Routledge. https://doi.org/10.4324/9780429468310
Gao, Y., Liu, Z., Li, Z., & Sun, M. (2023). Retrieval-augmented generation for knowledge-intensive NLP tasks. ACM Computing Surveys, 55(6), 1–38. https://doi.org/10.1145/3546258
Gao, Y., Xiong, Y., Gao, L., Liu, K., Pan, J., Bi, Y., Dai, Y., Sun, J., & Wang, H. (2023). Retrieval-augmented generation for large language models: A survey. arXiv preprint arXiv:2312.10997.
Harrison, K. D. (2021). The last speakers: The quest to save the world’s most endangered languages. National Geographic.
Hogan, A., Blomqvist, E., Cochez, M., d’Amato, C., de Melo, G., Gutierrez, C., Kirrane, S., Neumaier, S., Polleres, A., & Navigli, R. (2022). Knowledge graphs. ACM Computing Surveys, 54(4), 1–37. https://doi.org/10.1145/3447772
Ji, S., Pan, S., Cambria, E., Marttinen, P., & Yu, P. S. (2022). A survey on knowledge graphs: Representation, acquisition, and applications. IEEE Transactions on Neural Networks and Learning Systems, 33(2), 494–514. https://doi.org/10.1109/TNNLS.2021.3070843
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W. T., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.
Maffi, L. (2021). Biocultural diversity and sustainability. Annual Review of Anthropology, 50, 25–40. https://doi.org/10.1146/annurev-anthro-101819-110523
Magueresse, A., Caron, G., & Heetderks, E. (2021). Low-resource languages: A review of past work and future challenges. Language Resources and Evaluation, 55(1), 1–31. https://doi.org/10.1007/s10579-020-09505-4
Phillips, O., & Gentry, A. H. (2021). The useful plants of Tambopata, Peru: Statistical hypotheses tests with a new quantitative technique. Economic Botany, 75(1), 1–15. https://doi.org/10.1007/s12231-020-09501-9
Rahman, A. H. M. M., Alam, M. S., Khan, S. K., & Naderuzzaman, A. T. M. (2021). Documentation of traditional medicinal plant knowledge: Methods and ethical considerations. Journal of Ethnobiology and Ethnomedicine, 17(1), 1–14.
Stibbe, A. (2021). Ecolinguistics: Language, ecology and the stories we live by (2nd ed.). Routledge.
Ullah, S., Ahmad, M., Zafar, M., Sultana, S., & Bahadur, S. (2022). Quantitative ethnobotany and traditional knowledge of medicinal plants. Journal of Ethnobiology and Ethnomedicine, 18(1), 1–15. https://doi.org/10.1186/s13002-022-00512-8
UNESCO. (2021). International decade of indigenous languages 2022–2032. UNESCO Publishing.
Waruwu, O. (2025). Critical Discourse Analysis Of Hate Speech On Instagram: A Politician’s Educational Documents In Indonesia, 2025. Haga : Jurnal Pengabdian Kepada Masyarakat, 4(2), 51-63. https://doi.org/10.57094/haga.v4i2.4100
Waruwu, O. (2025). An Analysis Of Proclitics And Enclitics In West Nias Language: A Morphosyntactic Study. Research on English Language Education, 7(2), 53-64. https://doi.org/10.57094/relation.v7i2.4004
Waruwu, O. (2024). Increasing students’ reading comprehension ability on descriptive text by using language experience approach at the tenth grade of SMA Negeri 1 Lahomi. NDRUMI: Jurnal Ilmu Pendidikan dan Humaniora, 7(2), 16–26. https://jurnal.uniraya.ac.id/index.php/NDRUMI
Yaseen, G., Ahmad, M., Sultana, S., Alharrasi, A. S., Hussain, J., & Zafar, M. (2022). Ethnobotanical indices for quantitative analysis of medicinal plant knowledge. Plants, 11(3), 345. https://doi.org/10.3390/plants11030345









