APPLICATION OF LARGE LANGUAGE MODELS FOR BUILDING A “FOREST OF TERM HIERARCHIES”

Authors

DOI:

https://doi.org/10.28925/2663-4023.2025.27.712

Keywords:

large language models, artificial intelligence, term hierarchy forest, term ontologies, data visualization, cybersecurity

Abstract

One of the ways to organize and systematize knowledge is to form terminology ontologies that allow you to structure information in specific subject areas, such as cybersecurity. With the revolutionary emergence of large language models (LLMs), new opportunities are emerging to automate the process of building a "forest of term hierarchies" (FTH). Building a FTH is essential for several key aspects of cybersecurity and knowledge management, such as unifying terminology, improving communication, optimizing information retrieval, systematizing knowledge, adapting to new challenges, and supporting research and innovation. The article discusses the role of LLM in building FTH in the context of modern challenges of the information environment. Thanks to revolutionary advances in artificial intelligence, LLMs automate and optimize the processes of processing, analyzing, and structuring large amounts of text data. The article describes the key stages of FTH implementation using LLM, including text data processing, determining the discriminant power of terms, establishing links between them, and visualizing the results. A methodology for determining the associative relationships between predefined terms for building FTH is proposed. Examples of the practical implementation of the proposed methodology based on the use of the information-analytical system "Cyber Aggregator" are given. An example of forming a prompt for building a FTH for the generative artificial intelligence system DeepSeek.com is demonstrated. The technology of FTH visualization is proposed by using the program for analyzing and visualizing graphs CSV2Graph. The use of the proposed technologies makes it possible to increase the efficiency and accuracy of building terminological ontologies, which is important for adapting to the rapidly growing information flows in the modern world.

Downloads

Download data is not yet available.

References

Babayeva, G., Maennel, K., & Maennel, O. M. (2022). Building an ontology for cyber defence exercises. 2022 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW), 423–432. https://doi.org/10.1109/EuroSPW55150.2022.00050

Adach, M., Hänninen, K., & Lundqvist, K. (2021). Structured information retrieval of security ontologies. MDH, Tech. Rep. http://www.es.mdh.se/pdf_publications/6449.pdf

Rackevičienė, S., & Mockienė, L. (2020). Cyber Law Terminology as a New Lexical Field in Legal Discourse. International Journal for the Semiotics of Law - Revue internationale de Sémiotique juridique, 33(4), 673–687. https://doi.org/10.1007/s11196-020-09690-0

Canito, A., Aleid, K., Praça, I., Corchado, J., & Marreiros, G. (2020). An ontology to promote interoperability between cyber-physical security systems in critical infrastructures. 2020 IEEE 6th International Conference on Computer and Communications (ICCC), 553–560. https://doi.org/10.1109/ICCC51575.2020.9345163

Satyapanich, T., Ferraro, F., & Finin, T. (2020). CASIE: Extracting cybersecurity event information from text. Proceedings of the AAAI Conference on Artificial Intelligence, 34(5), 8749–8757. https://doi.org/10.1609/aaai.v34i05.6401

Sikos, L. F. (2023). Cybersecurity knowledge graphs. Knowledge and Information Systems, 65(12), 3511–3531. https://doi.org/10.1007/s10115-023-01860-3

Lande, D. (2014). Formation of networks of natural hierarchies of terms based on the analysis of text corpora on legal topics. Legal informatics, 1(41), 12–17.

Lande, D., Snarskii, A., Yagunova, E., & Pronoza, E. (2014). Network of Natural Terms Hierarchy as a Lightweight Ontology. Thirteenth Mexican International Conference on Artificial Intelligence (MICAI 2014), 16–23.

Lande, D. V., Snarskii, A. A., Yagunova, E. V., & Pronoza, E. V. (2013). The Use of Horizontal Visibility Graphs to Identify the Words that Define the Informational Structure of a Text. 12th Mexican International Conference on Artificial Intelligence, 209–215.

Lande, D., Subach, I., & Puchkov, A. (2020). System of Analysis of Big Data from Social Media. Information & Security: An International Journal, 47(1), 44–61. https://doi.org/10.11610/isij.4703

BigSearch Space. (n.d.). https://bigsearch.space/uli.html

DeepSeek. (n.d.). https://www.deepseek.com/

Downloads


Abstract views: 334

Published

2025-03-27

How to Cite

Puchkov, O., Lande, D., & Subach, I. (2025). APPLICATION OF LARGE LANGUAGE MODELS FOR BUILDING A “FOREST OF TERM HIERARCHIES”. Electronic Professional Scientific Journal «Cybersecurity: Education, Science, Technique», 3(27), 6–21. https://doi.org/10.28925/2663-4023.2025.27.712

Most read articles by the same author(s)