The Association for Computational Linguistics (ACL) is the premier international scientific and professional society for people working on computational problems involving human language, a field often referred to as either computational linguistics or natural language processing (NLP).
Computational linguistics is the scientific study of language from a computational perspective. Computational linguists are interested in providing computational models of various kinds of linguistic phenomena. These models may be “knowledge-based” (“hand-crafted”) or “data-driven” (“statistical” or “empirical”).
Activities of the ACL include the holding of an annual meeting each summer and the sponsoring of the journal Computational Linguistics, published by MIT Press; this conference and journal are the leading publications of the field.
This year was the 60th Annual Meeting, it took place May 22-27, 2022 as a hybrid event, in Dublin and online. The DSAIDIS researchers presented four papers:
2022|The 60th Annual Meeting of the Association for Computational Linguistics
On November 22, the DSAIDIS Chair Meetup was held at Télécom Paris. The purpose of this meeting was to connect industrial partners and students of the DSAIDIS chair.
After a presentation sessions of the various industrial partners who support the chair in its activities, PhD and postdoc students had the opportunity to meet and discuss in small groups in dedicated rooms.
13h45 Welcome
14H00 Presentation of industrial partners
14h15 Installation of the teams at the 5th floor – meeting partners / students
15h30 Coffee break
15h45 Meeting partners / students
17H00 End
On Tuesday, November 15, the “Optimization and neural networks” workshop of the DSAIDIS chair was held. Permanent members and PhD students presented their research work.
Olivier Fercoq
[EN] I will present the ADAM algorithm, which is a famous stochastic gradient method with adaptive learning rate. It is based on exponential moving averages of the stochastic gradients and their squares in order to estimate the first and second moments.
Then I will explain the main ideas of its convergence proof in the case of a convex objective function. The challenges are the following: 1) the estimation of the first moment is biased; 2) the learning rate is a random variable. They are solved by finding terms that telescope almost surely and by using the fact that learning rate is small when the gradient estimate is noisy.
Maxime Lieber
[EN] In this talk, we revisit the tuning of the spectrogram window length, making the window length a continuous parameter optimizable by gradient descent instead of an empirically tuned integer-valued hyperparameter.
We first define two differentiable versions of the STFT w.r.t. the window length, in the case where local bins centers are fixed and independent of the window length parameter, and in the more difficult case where the window length affects the position and number of bins. We then present the smooth optimization of the window length with any standard loss function. We show that this optimization can be of interest not only for any neural network-based inference system, but also for any STFT-based signal processing algorithm. We also show that the window length can not only be fixed and learned offline, but also be adaptive and optimized on the fly. The contribution is mainly theoretical for the moment but the approach is very general and will have a large-scale application in several fields.Enzo Tartaglione
[EN] Recent advances in deep learning optimization showed that, with some a-posteriori information on fully-trained models, it is possible to match the same performance by simply training a subset of their parameters which, it is said, “had won at the lottery of initialization”.
Hicham Janati
The day ended with a discussion on big data and frugal AI.
The Neural Information Processing Systems Foundation is a non-profit corporation whose purpose is to foster the exchange of research on neural information processing systems in their biological, technological, mathematical, and theoretical aspects. Neural information processing is a field which benefits from a combined view of biological, physical, mathematical, and computational sciences.
The primary focus of the Foundation is the presentation of a continuing series of professional meetings known as the Neural Information Processing Systems Conference or NeurIPS, held over the years at various locations in the United States, Canada and Spain.
The 2022 edition will take place in New Orleans Morial Convention Center, USA.
This year, 7 papers from the DSAIDIS chair were accepted:
ICML, the International Conference on Machine Learning, is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning.
ICML is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, or robotics.
ICML is one of the fastest growing artificial intelligence conferences in the world. Participants at ICML span a wide range of backgrounds, from academic and industrial researchers, to entrepreneurs and engineers, to graduate students and postdocs.
This year, it was held in Baltimore, Maryland USA, from July 17 to 23. The DSAIDIS researchers presented three papers:
Functional Output Regression with Infimal Convolution: Exploring the Huber and ϵϵ-insensitive Losses [Arxiv]
Alex Lambert (KU Leuven) · Dimitri Bouche (Télécom Paris) · Zoltan Szabo (Ecole Polytechnique) · Florence d’Alché-Buc (Télécom Paris, Institut Polytechnique de Paris)
Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters [Arxiv]
Luc Brogat-Motte (Télécom Paris) · Rémi Flamary (École Polytechnique) · Celine Brouard (INRAE) · Juho Rousu (Aalto University) · Florence d’Alché-Buc (Télécom Paris, Institut Polytechnique de Paris)
Mitigating Gender Bias in Face Recognition using the von Mises-Fisher Mixture Model [Arxiv]
Jean-Rémy Conti (Télécom Paris Idemia) · Nathan NOIRY (Telecom Paris) · Vincent Despiegel (Idemia) · Stéphane Gentric (IDEMIA) · Stephan Clemencon (Telecom ParisTech)
2022|The Thirty-ninth International Conference on Machine Learning.
The DSAIDIS Annual Day 2022 took place on Wednesday 15 June. This annual event is an opportunity for the Télécom Paris team to meet the operational teams of the five partner companies: Airbus, ENGIE, IDEMIA, Safran and Valeo. On this occasion, various members of the academic team, from professors to PhD students, presented their works based on the four research axes of the Chair. There was also plenty of time for socializing and exchanging ideas.
9h – 9h30 – Welcome coffee and Introduction > Find the materials of the introduction presentation 9h30 – 10h50 – Axis 2 : Exploiting large scale, heterogeneous, partially labeled data > Find the materials and the video replay of the presentations of the Axis 2 10h50 – 11h10 Coffee break 11h10 – 12h35 – Axis 1 : Building predictive analytics on time series and data streams > Find the materials and the video replay of the presentations of the Axis 1 12h35 – 14h Lunch 14h – 15h20 – Axis 4 : Learning through interactions with environment > Find the materials and the video replay of the presentations of the Axis 4 15h20 – 15h40 Coffee break 15h40 – 17h – Axis 3 : Machine Learning for trusted and robust decision > Find the materials and the video replay of the presentations of the Axis 3 17h-17h15 – Discussion and conclusion
Since its inception in 1985, AISTATS has been an interdisciplinary gathering of researchers at the intersection of artificial intelligence, machine learning, statistics and related areas.
For this 25th edition, which was held online this year again, from the 28th to the 30th of March, the academic team of the DSAIDIS chair presented two papers:
Pierre Colombo received the Outstanding Student Paper award at the 36th AAAI Conference on Artificial Intelligence for his publication “InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation”. This new category, added in 2022, recognizes his work as a post-doctoral fellow at the DSAIDIS Chair, under the supervision of Chloé Clavel.
P. Colombo, C. Clavel, and P. Piantanida, « InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation. », AAAI (2022).