MATHEMATICAL MODEL FOR AUTOMATIC CREATION THE SEMANTIC THESAURUS FOR THE SCIENTIFIC TEXT

Authors

  • Oleg Volkovskiy
  • Egor Kovylin

DOI:

https://doi.org/10.34185/1562-9945-6-125-2019-08

Keywords:

Thesaurus, latent-semantic analysis, semantic document model

Abstract

The paper deals with the issues related to the use of the algorithm of constructing the semantic model of the document for the creation the thesaurus of the scientific text terms in the natural language. The purpose of the paper is to develop an approach to create matching between elements of scientific text that do not have a direct syntactic link, but are semantically related to the one field. The relevance of the research is that the system does not use linguistic or vocabulary knowledge during its work, which makes it a universal tool for forming semantic correspondence between terms in a scientific text. Obtained results show that the semantic labels of a document have the highest number of intersections with semantic contours when they contain the largest number of semantically significant stems in their composition, which allows to make assumptions about a direct semantic connection between terms corresponding to such stems.

References

N.M. Bogest. Hierarchical and associative relations in thesaurus on the example of the designer dictionary // Bulletin of the Samara State Aerospace University. - 2012. №2 (33). –p.-228-236.

Voloshin P., Svitla S. Automated creation of subject area thesaurus for local search engines // “Knowledge - Dialogue - Solution” International Book Series “information science & computing”, Number 15. - FOI ITHEA Sofia, Bulgaria. - 2009. - p. 24–31.

V. Trusov Construction of thesauruses, thematic classifications and rubricators for information retrieval in distributed information systems/ Bulletin of the Novosibirsk State University - 2015. №2 (13). - p. 86-101

O.S. Volkovsky, Y. R. Kovylin. Computer System of Building of the Semantic Model of the Document // 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP) - p. 322-327 – Lviv, 2018. DOI: 10.1109/DSMP.2018.8478591.

O.S. Volkovsky, Y. R. Kovylin. Computer system of intellectual semantic search with the text generation using// Bulletin of the Kherson National University - 2018. №3 (66). -p. 238-245.

Downloads

Published

2019-12-27