Yousef Taheri Sojasi is passionate about machine learning and natural language processing. His internship, “Weak signals and text data” is supervised by Stephan Clémençon, professor at Télécom Paris and Matthieu Labeau, associate professor at Télécom Paris. The internship started on 30/03/2020 and will end on 31/08/2020. Its aim is the development of representation methods that  make it easier to detect weak signals in text data. Weak signal detection is a major challenge when it comes to applications.  The method takes its inspiration from methods and criteria based on extreme value theory, which extend the scope of supervised and unsupervised learning techniques.

Key words: automatic natural language processing, word representation, extreme value theory, supervised learning, unsupervised learning