With the help of modem technologies and big data technologies, it is also becoming difficult to implement large volumes of texts in the Uzbek language. Training language models from large corpora, extracting meaningful information from real-time chats, semantic indexing of texts and creating search engines are all now implemented through the integration of platforms such as Hadoop, Spark, NoSQL, Kafka and Elasticsearch.