Publications

See me also on Google Scholar and ORCID.org.

2025

ICML

Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

W. Bendada, G. Salha-Galvan, R. Hennequin, T. Bontempelli, T. Bouabça, T. Cazenave
42nd International Conference on Machine Learning (ICML 2025)

Abstract PDF Appendix Code

This paper introduces von Mises-Fisher exploration (vMF-exp), a scalable method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent these actions. vMF-exp involves initially sampling a state embedding representation using a von Mises-Fisher distribution, then exploring this representation's nearest neighbors, which scales to virtually unlimited numbers of candidate actions. We show that, under theoretical assumptions, vMF-exp asymptotically maintains the same probability of exploring each action as Boltzmann Exploration (B-exp), a popular alternative that, nonetheless, suffers from scalability issues as it requires computing softmax values for each action. Consequently, vMF-exp serves as a scalable alternative to B-exp for exploring large action sets with hyperspherical embeddings. Experiments on simulated data, real-world public data, and the successful large-scale deployment of vMF-exp on the recommender system of a global music streaming service empirically validate the key properties of the proposed method.

RecSys

EARL: The 2nd Workshop on Evaluating and Applying Recommender Systems with Large Language Models

I. Li, R. Dong, G. Salha-Galvan, A. Lawlor, D. Liu, L. Li
19th ACM Conference on Recommender Systems (RecSys 2025)

Abstract PDF Website

This article presents our proposal to organize the 2nd Workshop on Evaluating and Applying Recommender Systems with Large Language Models (EARL), to be held in conjunction with the 19th ACM Conference on Recommender Systems (RecSys 2025) in Prague, Czech Republic, in September 2025. Building on the success of the first EARL edition at RecSys 2024, we will foster dynamic and interactive discussions and debates on the application and evaluation of large language models (LLMs) in recommender systems (RSs). The workshop will explore emerging techniques such as retrieval-augmented generation (RAG), multi-modal recommendation, reinforcement learning with human feedback (RLHF), scalable fine-tuning methods, dynamic prompting strategies, and personalized conversational agents. In addition to highlighting key innovations and showcasing applications across diverse sectors, we will emphasize critical challenges in LLM-driven personalization, including bias, fairness, and transparency, which are essential for ensuring trustworthy and responsible RSs. Ultimately, the workshop aims to inspire new research directions and foster collaboration in the rapidly evolving field of LLM-powered RSs.

ISMIR

MusicSem: A Dataset of Music Descriptions on Reddit Capturing Musical Semantics

R. Salganik, T. Tu, F.-Y. Chen, X. Liu, K. Lu, E. Luvisia, Z. Duan, G. Salha-Galvan, A. Kahng, Y. Ma, J. Kang
Late-Breaking Demo
26th Conference of the International Society for Music Information Retrieval (ISMIR 2025)

Abstract Website Code Dataset

We present MusicSem, a dataset of 35,977 language–audio music descriptions derived from organic discussions on Reddit. What sets MusicSem apart is its focus on capturing a broad spectrum of musical semantics, reflecting how listeners naturally describe music in nuanced, human-centered ways. To structure these expressions, we propose a taxonomy of five semantic categories: descriptive, atmospheric, situational, metadata-related, and contextual. Our motivation for releasing MusicSem stems from the observation that music representation learning models often lack sensitivity to these semantic dimensions, due to the limited expressiveness of existing training datasets. MusicSem addresses this gap by serving as a novel semantics-aware resource for training and evaluating models on tasks such as cross-modal music generation and retrieval.

ICLR

von Mises-Fisher Sampling of GloVe Vectors

W. Bendada, G. Salha-Galvan, R. Hennequin, T. Bontempelli, T. Bouabça, T. Cazenave
Workshop on Frontiers in Probabilistic Inference: Sampling Meets Learning
13th International Conference on Learning Representations (ICLR 2025)

Abstract PDF Code

A recent publication introduced von Mises-Fisher exploration (vMF-exp), a scalable sampling method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent these actions. We present the first experimental validation of vMF-exp's key theoretical and scalability properties on a publicly available real-world dataset, confirming the potential of this method.

WWW

To Share or Not to Share: Investigating Weight Sharing in Variational Graph Autoencoders

G. Salha-Galvan, J. Xu
2025 ACM Web Conference (WWW 2025)

Abstract PDF Code Data

This paper investigates the understudied practice of weight sharing (WS) in variational graph autoencoders (VGAE). WS presents both benefits and drawbacks for VGAE model design and node embedding learning, leaving its overall relevance unclear and the question of whether it should be adopted unresolved. We rigorously analyze its implications and, through extensive experiments on a wide range of graphs and VGAE variants, demonstrate that the benefits of WS consistently outweigh its drawbacks. Based on our findings, we recommend WS as an effective approach to optimize, regularize, and simplify VGAE models without significant performance loss.

2024

RecSys

Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation

V.A. Tran, G. Salha-Galvan, B. Sguerra, R. Hennequin
18th ACM Conference on Recommender Systems (RecSys 2024)
Best full paper candidate

Abstract PDF Code Data

Music streaming services often leverage sequential recommender systems to predict the best music to showcase to users based on past sequences of listening sessions. Nonetheless, most sequential recommendation methods ignore or insufficiently account for repetitive behaviors. This is a crucial limitation for music recommendation, as repeatedly listening to the same song over time is a common phenomenon that can even change the way users perceive this song. In this paper, we introduce PISA (Psychology-Informed Session embedding using ACT-R), a session-level sequential recommender system that overcomes this limitation. PISA employs a Transformer architecture learning embedding representations of listening sessions and users using attention mechanisms inspired by Anderson's ACT-R (Adaptive Control of Thought-Rational), a cognitive architecture modeling human information access and memory dynamics. This approach enables us to capture dynamic and repetitive patterns from user behaviors, allowing us to effectively predict the songs they will listen to in subsequent sessions, whether they are repeated or new ones. We demonstrate the empirical relevance of PISA using both publicly available listening data and proprietary data from Deezer, a global music streaming service, confirming the critical importance of repetition modeling for sequential listening session recommendation. Along with this paper, we publicly release our proprietary dataset to foster future research in this field, as well as the source code of PISA to facilitate its future use.

RecSys

Do Recommender Systems Promote Local Music? A Reproducibility Study Using Music Streaming Data

K. Matrosova, L. Marey, G. Salha-Galvan, T. Louail, O. Bodini, M. Moussallam
18th ACM Conference on Recommender Systems (RecSys 2024)

Abstract PDF Code Data

This paper examines the influence of recommender systems on local music representation, discussing prior findings from an empirical study on the LFM-2b public dataset. This prior study argued that different recommender systems exhibit algorithmic biases shifting music consumption either towards or against local content. However, LFM-2b users do not reflect the diverse audience of music streaming services. To assess the robustness of this study's conclusions, we conduct a comparative analysis using proprietary listening data from a global music streaming service, which we publicly release alongside this paper. We observe significant differences in local music consumption patterns between our dataset and LFM-2b, suggesting that caution should be exercised when drawing conclusions on local music based solely on LFM-2b. Moreover, we show that the algorithmic biases exhibited in the original work vary in our dataset, and that several unexplored model parameters can significantly influence these biases and affect the study's conclusion on both datasets. Finally, we discuss the complexity of accurately labeling local music, emphasizing the risk of misleading conclusions due to unreliable, biased, or incomplete labels. To encourage further research and ensure reproducibility, we have publicly shared our dataset and code.

RecSys

Let's Get It Started (Again!): Fostering the Discoverability of New Releases on Deezer

L. Briand, T. Bontempelli, W. Bendada, M. Morlon, F. Rigaud, B. Chapus, T. Bouabça, G. Salha-Galvan
Workshop on Music Recommender Systems
18th ACM Conference on Recommender Systems (RecSys 2024)
(Workshop presentation showcasing our ECIR 2024 work)

Abstract PDF Slides

This workshop paper presents an extended version of a work previously been accepted for oral presentation as an Industry Talk at the 46th European Conference on Information Retrieval (ECIR 2024), under the title "Let's Get It Started: Fostering the Discoverability of New Releases on Deezer". This conference version received the ECIR 2024 Best Industry Track Paper award. We present our recent initiatives to foster the discoverability of new releases on the music streaming service Deezer. After introducing our search and recommendation features dedicated to new releases, we outline our shift from editorial to personalized release suggestions using cold start embeddings and contextual bandits. Backed by online experiments, we discuss the advantages of this shift in terms of recommendation quality and exposure of new releases on the service.

ICML

vMF-exp: von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings

W. Bendada, G. Salha-Galvan, R. Hennequin, T. Bontempelli, T. Bouabça, T. Cazenave
Workshop on Aligning Reinforcement Learning Experimentalists and Theorists
41st International Conference on Machine Learning (ICML 2024)

Abstract PDF Code

This workshop paper is under review for presentation at an international conference. We introduce von Mises-Fisher exploration (vMF-exp), a scalable method for exploring large action sets in reinforcement learning problems where hyperspherical embedding vectors represent actions. vMF-exp involves initially sampling a state embedding representation using a von Mises-Fisher hyperspherical distribution, then exploring this representation’s nearest neighbors, which scales to unlimited numbers of candidate actions. We show that, under theoretical assumptions, vMFexp asymptotically maintains the same probability of exploring each action as Boltzmann Exploration (B-exp), a popular alternative that, nonetheless, suffers from scalability issues as it requires computing softmax values for each action. Consequently, vMF-exp serves as a scalable alternative to B-exp for exploring large action sets with hyperspherical embeddings. We further validate the empirical relevance of vMF-exp by discussing its successful deployment at scale on a music streaming service to recommend playlists to millions of users.

ECIR

Let's Get It Started: Fostering the Discoverability of New Releases on Deezer

L. Briand, T. Bontempelli, W. Bendada, M. Morlon, F. Rigaud, B. Chapus, T. Bouabça, G. Salha-Galvan
46th European Conference on Information Retrieval (ECIR 2024)
Best industry track paper award

Abstract PDF Slides

This paper presents our recent initiatives to foster the discoverability of new releases on the music streaming service Deezer. After introducing our search and recommendation features dedicated to new releases, we outline our shift from editorial to personalized release suggestions using cold start embeddings and contextual bandits. Backed by online experiments, we discuss the advantages of this shift in terms of recommendation quality and exposure of new releases on the service.

2023

RecSys

Track Mix Generation on Music Streaming Services using Transformers

W. Bendada, T. Bontempelli, M. Morlon, B. Chapus, T. Cador, T. Bouabça, G. Salha-Galvan
17th ACM Conference on Recommender Systems (RecSys 2023)

Abstract PDF Blog Post

This paper introduces Track Mix, a personalized playlist generation system released in 2022 on the music streaming service Deezer. Track Mix automatically generates "mix" playlists inspired by initial music tracks, allowing users to discover music similar to their favorite content. To generate these mixes, we consider a Transformer model trained on millions of track sequences from user playlists. In light of the growing popularity of Transformers in recent years, we analyze the advantages, drawbacks, and technical challenges of using such a model for mix generation on the service, compared to a more traditional collaborative filtering approach. Since its release, Track Mix has been generating playlists for millions of users daily, enhancing their music discovery experience on Deezer.

RecSys

On the Consistency of Average Embeddings for Item Recommendation

W. Bendada, G. Salha-Galvan, R. Hennequin, T. Bouabça, T. Cazenave
17th ACM Conference on Recommender Systems (RecSys 2023)

Abstract PDF Code Poster

A prevalent practice in recommender systems consists in averaging item embeddings to represent users or higher-level concepts in the same embedding space. This paper investigates the relevance of such a practice. For this purpose, we propose an expected precision score, designed to measure the consistency of an average embedding relative to the items used for its construction. We subsequently analyze the mathematical expression of this score in a theoretical setting with specific assumptions, as well as its empirical behavior on real-world data from music streaming services. Our results emphasize that real-world averages are less consistent for recommendation, which paves the way for future research to better align real-world embeddings with assumptions from our theoretical setting.

SIGIR

Attention Mixtures for Time-Aware Sequential Recommendation

V.A. Tran, G. Salha-Galvan, B. Sguerra, R. Hennequin
46th International ACM SIGIR Conf. on Research and Development in Information Retrieval (SIGIR 2023)

Abstract PDF Code

Transformers emerged as powerful methods for sequential recommendation. However, existing architectures often overlook the complex dependencies between user preferences and the temporal context. In this short paper, we introduce MOJITO, an improved Transformer sequential recommender system that addresses this limitation. MOJITO leverages Gaussian mixtures of attention-based temporal context and item embedding representations for sequential modeling. Such an approach permits to accurately predict which items should be recommended next to users depending on past actions and the temporal context. We demonstrate the relevance of our approach, by empirically outperforming existing Transformers for sequential recommendation on several real-world datasets.

SIGIR

A Scalable Framework for Automatic Playlist Continuation on Music Streaming Services

W. Bendada, G. Salha-Galvan, T. Bouabça, T. Cazenave
46th International ACM SIGIR Conf. on Research and Development in Information Retrieval (SIGIR 2023)

Abstract PDF Code

Music streaming services often aim to recommend songs for users to extend the playlists they have created on these services. However, extending playlists while preserving their musical characteristics and matching user preferences remains a challenging task, commonly referred to as Automatic Playlist Continuation (APC). Besides, while these services often need to select the best songs to recommend in real-time and among large catalogs with millions of candidates, recent research on APC mainly focused on models with few scalability guarantees and evaluated on relatively small datasets. In this paper, we introduce a general framework to build scalable yet effective APC models for large-scale applications. Based on a represent-then-aggregate strategy, it ensures scalability by design while remaining flexible enough to incorporate a wide range of representation learning and sequence modeling techniques, e.g., based on Transformers. We demonstrate the relevance of this framework through in-depth experimental validation on Spotify's Million Playlist Dataset (MPD), the largest public dataset for APC. We also describe how, in 2022, we successfully leveraged this framework to improve APC in production on Deezer. We report results from a large-scale online A/B test on this service, emphasizing the practical impact of our approach in such a real-world application.

2022

NeurIPS

New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

G. Salha-Galvan, J.F. Lutzeyer, G. Dasoulas, R. Hennequin, M. Vazirgiannis
Workshop on New Frontiers in Graph Learning
36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Abstract PDF Code

Graph autoencoders (GAE) and variational graph autoencoders (VGAE) emerged as powerful methods for link prediction (LP). Their performances are less impressive on community detection (CD), where they are often outperformed by simpler alternatives such as the Louvain method. It is still unclear to what extent one can improve CD with GAE and VGAE, especially in the absence of node features. It is moreover uncertain whether one could do so while simultaneously preserving good performances on LP in a multi-task setting. In this workshop paper, summarizing results from our journal publication (Salha-Galvan et al. 2022), we show that jointly addressing these two tasks with high accuracy is possible. For this purpose, we introduce a community-preserving message passing scheme, doping our GAE and VGAE encoders by considering both the initial graph and Louvain-based prior communities when computing embedding spaces. Inspired by modularity-based clustering, we further propose novel training and optimization strategies specifically designed for joint LP and CD. We demonstrate the empirical effectiveness of our approach, referred to as Modularity-Aware GAE and VGAE, on various real-world graphs.

RecSys

Flow Moods: Recommending Music by Moods on Deezer

T. Bontempelli, B. Chapus, F. Rigaud, M. Morlon, M. Lorant, G. Salha-Galvan
16th ACM Conference on Recommender Systems (RecSys 2022)

Abstract PDF Blog Post Poster Video

The music streaming service Deezer extensively relies on its Flow algorithm, which generates personalized radio-style playlists of songs, to help users discover musical content. Nonetheless, despite promising results over the past years, Flow used to ignore the moods of users when providing recommendations. In this paper, we present Flow Moods, an improved version of Flow that addresses this limitation. Flow Moods leverages collaborative filtering, audio content analysis, and mood annotations from professional music curators to generate personalized mood-specific playlists at scale. We detail the motivations, the development, and the deployment of this system on Deezer. Since its release in 2021, Flow Moods has been recommending music by moods to millions of users every day.

PhD

Contributions to Representation Learning with Graph Autoencoders and Applications to Music Recommendation

G. Salha-Galvan
Ph.D. thesis, École Polytechnique, Institut Polytechnique de Paris

Abstract PDF Slides from the defense

Graph autoencoders (GAE) and variational graph autoencoders (VGAE) have emerged as two powerful groups of unsupervised node embedding methods, with various applications to graph-based machine learning problems such as link prediction and community detection. Nonetheless, at the beginning of this Ph.D. project, GAE and VGAE models were also suffering from key limitations, preventing them from being adopted in the industry. In this thesis, we present several contributions to improve these models, with the general aim of facilitating their use to address industrial-level problems involving graph representations.Firstly, we propose two strategies to overcome the scalability issues of previous GAE and VGAE models, permitting to effectively train these models on large graphs with millions of nodes and edges. These strategies leverage graph degeneracy and stochastic subgraph decoding techniques, respectively. Besides, we introduce Gravity-Inspired GAE and VGAE, providing the first extensions of these models for directed graphs, that are ubiquitous in industrial applications. We also consider extensions of GAE and VGAE models for dynamic graphs. Furthermore, we argue that GAE and VGAE models are often unnecessarily complex, and we propose to simplify them by leveraging linear encoders. Lastly, we introduce Modularity-Aware GAE and VGAE to improve community detection on graphs, while jointly preserving good performances on link prediction.In the last part of this thesis, we evaluate our methods on several graphsextracted from the music streaming service Deezer. We put the emphasis on graph-based music recommendation problems. In particular, we show that our methods can improve the detection of communities of similar musical items to recommend to users, that they can effectively rank similar artists in a cold start setting, and that they permit modeling the music genre perception across cultures. At the end of this thesis, we present two additional models, recently deployed in production on the Deezer service to recommend music to millions of users. While being less directly linked to GAE and VGAE models, they provide a complementary perspective on music recommendation topics related to the ones we previously studied.

2021

NN

Modularity-Aware Graph Autoencoders for Joint Community Detection and Link Prediction

G. Salha-Galvan, J.F. Lutzeyer, G. Dasoulas, R. Hennequin, M. Vazirgiannis
Neural Networks 153, 474 - 495, Elsevier
(2021 impact factor: 9.657)

Abstract PDF Code

Graph autoencoders (GAE) and variational graph autoencoders (VGAE) emerged as powerful methods for link prediction. Their performances are less impressive on community detection problems where, according to recent and concurring experimental evaluations, they are often outperformed by simpler alternatives such as the Louvain method. It is currently still unclear to which extent one can improve community detection with GAE and VGAE, especially in the absence of node features. It is moreover uncertain whether one could do so while simultaneously preserving good performances on link prediction. In this paper, we show that jointly addressing these two tasks with high accuracy is possible. For this purpose, we introduce and theoretically study a community-preserving message passing scheme, doping our GAE and VGAE encoders by considering both the initial graph structure and modularity-based prior communities when computing embedding spaces. We also propose novel training and optimization strategies, including the introduction of a modularity-inspired regularizer complementing the existing reconstruction losses for joint link prediction and community detection. We demonstrate the empirical effectiveness of our approach, referred to as Modularity-Aware GAE and VGAE, through in-depth experimental validation on various real-world graphs.

RecSys

Cold Start Similar Artists Ranking with Gravity-Inspired Graph Autoencoders

G. Salha-Galvan, R. Hennequin, B. Chapus, V.A. Tran, M. Vazirgiannis
15th ACM Conference on Recommender Systems (RecSys 2021)
Best student paper honorable mention

Abstract PDF Code Data Video

On an artist's profile page, music streaming services frequently recommend a ranked list of "similar artists" that fans also liked. However, implementing such a feature is challenging for new artists, for which usage data on the service (e.g. streams or likes) is not yet available. In this paper, we model this cold start similar artists ranking problem as a link prediction task in a directed and attributed graph, connecting artists to their top-k most similar neighbors and incorporating side musical information. Then, we leverage a graph autoencoder architecture to learn node embedding representations from this graph, and to automatically rank the top-k most similar neighbors of new artists using a gravity-inspired mechanism. We empirically show the flexibility and the effectiveness of our framework, by addressing a real-world cold start similar artists ranking problem on a global music streaming service. Along with this paper, we also publicly release our source code as well as the industrial graph data from our experiments.

RecSys

Hierarchical Latent Relation Modeling for Collaborative Metric Learning

V.A. Tran, G. Salha-Galvan, R. Hennequin, M. Moussallam
15th ACM Conference on Recommender Systems (RecSys 2021)

Abstract PDF Code Video

Collaborative Metric Learning (CML) recently emerged as a powerful paradigm for recommendation based on implicit feedback collaborative filtering. However, standard CML methods learn fixed user and item representations, which fails to capture the complex interests of users. Existing extensions of CML also either ignore the heterogeneity of user-item relations, i.e. that a user can simultaneously like very different items, or the latent item-item relations, i.e. that a user's preference for an item depends, not only on its intrinsic characteristics, but also on items they previously interacted with. In this paper, we present a hierarchical CML model that jointly captures latent user-item and item-item relations from implicit data. Our approach is inspired by translation mechanisms from knowledge graph embedding and leverages memory-based attention networks. We empirically show the relevance of this joint relational modeling, by outperforming existing CML models on recommendation tasks on several real-world datasets. Our experiments also emphasize the limits of current CML relational models on very sparse datasets.

KDD

A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps

L. Briand, G. Salha-Galvan, W. Bendada, M. Morlon, V.A. Tran
27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

Abstract PDF Code Data Blog Post Video

Music streaming services heavily rely on recommender systems to improve their users' experience, by helping them navigate through a large musical catalog and discover new songs, albums or artists. However, recommending relevant and personalized content to new users, with few to no interactions with the catalog, is challenging. This is commonly referred to as the user cold start problem. In this applied paper, we present the system recently deployed on the music streaming service Deezer to address this problem. The solution leverages a semi-personalized recommendation strategy, based on a deep neural network architecture and on a clustering of users from heterogeneous sources of information. We extensively show the practical impact of this system and its effectiveness at predicting the future musical preferences of cold start users on Deezer, through both offline and online large-scale experiments. Besides, we publicly release our code as well as anonymized usage data from our experiments. We hope that this release of industrial resources will benefit future research on user cold start recommendation.

TALN

Modéliser la Perception des Genres Musicaux à travers Différentes Cultures à partir de Ressources Linguistiques

E.V. Epure, G. Salha-Galvan, M. Moussallam, R. Hennequin
28ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2021)
(Summary paper presenting our EMNLP 2020 work)

Abstract PDF Code

We summarize our research work, presented at the EMNLP 2020 conference, on modeling the music genre perception across cultures using language-specific semantic representations.

2020

NN

FastGAE: Scalable Graph Autoencoders with Stochastic Subgraph Decoding

G. Salha-Galvan, R. Hennequin, J.B. Remy, M. Moussallam, M. Vazirgiannis
Neural Networks 142, 1 - 19, Elsevier
(2020 impact factor: 8.05)

Abstract PDF Code

Graph autoencoders (AE) and variational autoencoders (VAE) are powerful node embedding methods, but suffer from scalability issues. In this paper, we introduce FastGAE, a general framework to scale graph AE and VAE to large graphs with millions of nodes and edges. Our strategy, based on an effective stochastic subgraph decoding scheme, significantly speeds up the training of graph AE and VAE while preserving or even improving performances. We demonstrate the effectiveness of FastGAE on various real-world graphs, outperforming the few existing approaches to scale graph AE and VAE by a wide margin.

EMNLP

Modeling the Music Genre Perception across Language-Bound Cultures

E.V. Epure, G. Salha-Galvan, M. Moussallam, R. Hennequin
2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

Abstract PDF Code Data

The music genre perception expressed through human annotations of artists or albums varies significantly across language-bound cultures. These variations cannot be modeled as mere translations since we also need to account for cultural differences in the music genre perception. In this work, we study the feasibility of obtaining relevant cross-lingual, culture-specific music genre annotations based only on language-specific semantic representations, namely distributed concept embeddings and ontologies. Our study, focused on six languages, shows that unsupervised cross-lingual music genre annotation is feasible with high accuracy, especially when combining both types of representations. This approach of studying music genres is the most extensive to date and has many implications in musicology and music information retrieval. Besides, we introduce a new, domain-dependent cross-lingual corpus to benchmark state of the art multilingual pre-trained embedding models.

RecSys

Carousel Personalization in Music Streaming Apps with Contextual Bandits

W. Bendada, G. Salha-Galvan, T. Bontempelli
14th ACM Conference on Recommender Systems (RecSys 2020)
Best short paper honorable mention

Abstract PDF Code Data Video

Media services providers, such as music streaming platforms, frequently leverage swipeable carousels to recommend personalized content to their users. However, selecting the most relevant items (albums, artists, playlists...) to display in these carousels is a challenging task, as items are numerous and as users have different preferences. In this paper, we model carousel personalization as a contextual multi-armed bandit problem with multiple plays, cascade-based updates and delayed batch feedback. We empirically show the effectiveness of our framework at capturing characteristics of real-world carousels by addressing a large-scale playlist recommendation task on a global music streaming mobile app. Along with this paper, we publicly release industrial data from our experiments, as well as an open-source environment to simulate comparable carousel personalization learning problems.

ECML

Simple and Effective Graph Autoencoders with One-Hop Linear Models

G. Salha-Galvan, R. Hennequin, M. Vazirgiannis
2020 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2020)

Abstract PDF Code

Over the last few years, graph autoencoders (AE) and variational autoencoders (VAE) emerged as powerful node embedding methods, with promising performances on challenging tasks such as link prediction and node clustering. Graph AE, VAE and most of their extensions rely on multi-layer graph convolutional networks (GCN) encoders to learn vector space representations of nodes. In this paper, we show that GCN encoders are actually unnecessarily complex for many applications. We propose to replace them by significantly simpler and more interpretable linear models w.r.t. the direct neighborhood (one-hop) adjacency matrix of the graph, involving fewer operations, fewer parameters and no activation function. For the two aforementioned tasks, we show that this simpler approach consistently reaches competitive performances w.r.t. GCN-based graph AE and VAE for numerous real-world graphs, including all benchmark datasets commonly used to evaluate graph AE and VAE. Based on these results, we also question the relevance of repeatedly using these datasets to compare complex graph AE and VAE.

ISMIR

Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Item Annotation

E.V. Epure, G. Salha-Galvan, R. Hennequin
21st International Society for Music Information Retrieval Conference (ISMIR 2020)

Abstract PDF Code Data

Annotating music items with music genres is crucial for music recommendation and information retrieval, yet challenging given that music genres are subjective concepts. Recently, in order to explicitly consider this subjectivity, the annotation of music items was modeled as a translation task: predict for a music item its music genres within a target vocabulary or taxonomy (tag system) from a set of music genre tags originating from other tag systems. However, without a parallel corpus, previous solutions could not handle tag systems in other languages, being limited to the English-language only. Here, by learning multilingual music genre embeddings, we enable cross-lingual music genre translation without relying on a parallel corpus. First, we apply compositionality functions on pre-trained word embeddings to represent multi-word this http URL, we adapt the tag representations to the music domain by leveraging multilingual music genres graphs with a modified retrofitting algorithm. Experiments show that our method: 1) is effective in translating music genres across tag systems in multiple languages (English, French and Spanish); 2) outperforms the previous baseline in an English-language multi-source translation task. We publicly release the new multilingual data and code.

TALN

Muzeeglot : Annotation Multilingue et Multi-Sources d’Entités Musicales à partir de Représentations de Genres Musicaux

E.V. Epure, G. Salha-Galvan, F. Voituret, M. Baranes, R. Hennequin
27ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2020)

Abstract PDF (in French) Slides (in English) Code

In this demonstration, we present Muzeeglot, a web interface providing a visualization of multisources and multilingual music genres embedding spaces. We demonstrate the ability of our system to automatically infer the genres annotations of a music entity (track, artist, album...) according to some source or language, based on annotations from different sources or languages.

2019

NeurIPS

Keep It Simple: Graph Autoencoders Without Graph Convolutional Networks

G. Salha-Galvan, R. Hennequin, M. Vazirgiannis
Workshop on Graph Representation Learning
33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

Abstract PDF Code

Graph autoencoders (AE) and variational autoencoders (VAE) recently emerged as powerful node embedding methods, with promising performances on challenging tasks such as link prediction and node clustering. Graph AE, VAE and most of their extensions rely on graph convolutional networks (GCN) to learn vector space representations of nodes. In this paper, we propose to replace the GCN encoder by a simple linear model w.r.t. the adjacency matrix of the graph. For the two aforementioned tasks, we empirically show that this approach consistently reaches competitive performances w.r.t. GCN-based models for numerous real-world graphs, including the widely used Cora, Citeseer and Pubmed citation networks that became the de facto benchmark datasets for evaluating graph AE and VAE. This result questions the relevance of repeatedly using these three datasets to compare complex graph AE and VAE models. It also emphasizes the effectiveness of simple node encoding schemes for many real-world applications.

CIKM

Gravity-Inspired Graph Autoencoders for Directed Link Prediction

G. Salha-Galvan, S. Limnios, R. Hennequin, V.A. Tran, M. Vazirgiannis
28th ACM International Conference on Information and Knowledge Management (CIKM 2019)

Abstract PDF Code

Graph autoencoders (AE) and variational autoencoders (VAE) recently emerged as powerful node embedding methods. In particular, graph AE and VAE were successfully leveraged to tackle the challenging link prediction problem, aiming to figure out whether some pairs of nodes from a graph are connected by unobserved edges. However, these models focus on undirected graphs and therefore ignore the potential direction of the link, which is limiting for numerous real-life applications. In this paper, we extend the graph AE and VAE frameworks to address link prediction in directed graphs. We present a new gravity-inspired decoder scheme that can effectively reconstruct directed graphs from a node embedding. We empirically evaluate our method on three different directed link prediction tasks, for which standard graph AE and VAE perform poorly. We achieve competitive results on three real-world graphs, outperforming several popular baselines.

IJCAI

A Degeneracy Framework for Scalable Graph Autoencoders

G. Salha-Galvan, R. Hennequin, V.A. Tran, M. Vazirgiannis
28th International Joint Conference on Artificial Intelligence (IJCAI 2019)

Abstract PDF Code

In this paper, we present a general framework to scale graph autoencoders (AE) and graph variational autoencoders (VAE). This framework leverages graph degeneracy concepts to train models only from a dense subset of nodes instead of using the entire graph. Together with a simple yet effective propagation mechanism, our approach significantly improves scalability and training speed while preserving performance. We evaluate and discuss our method on several variants of existing graph AE and VAE, providing the first application of these models to large graphs with up to millions of nodes and edges. We achieve empirically competitive results w.r.t. several popular scalable node embedding methods, which emphasizes the relevance of pursuing further research towards more scalable graph AE and VAE.

2018

ASONAM

Adaptive Submodular Influence Maximization with Myopic Feedback

G. Salha-Galvan, N. Tziortziotis, M. Vazirgiannis
2018 IEEE/ACM International Conf. on Advances in Social Networks Analysis and Mining (ASONAM 2018)

Abstract PDF

This paper examines the problem of adaptive influence maximization in social networks. As adaptive decision making is a time-critical task, a realistic feedback model has been considered, called myopic. In this direction, we propose the myopic adaptive greedy policy that is guaranteed to provide a (1 - 1/e)-approximation of the optimal policy under a variant of the independent cascade diffusion model. This strategy maximizes an alternative utility function that has been proven to be adaptive monotone and adaptive submodular. The proposed utility function considers the cumulative number of active nodes through the time, instead of the total number of the active nodes at the end of the diffusion. Our empirical analysis on real-world social networks reveals the benefits of the proposed myopic strategy, validating our theoretical results.

Publications

See me also on Google Scholar and ORCID.org.

2025

Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

EARL: The 2nd Workshop on Evaluating and Applying Recommender Systems with Large Language Models

MusicSem: A Dataset of Music Descriptions on Reddit Capturing Musical Semantics

von Mises-Fisher Sampling of GloVe Vectors

To Share or Not to Share: Investigating Weight Sharing in Variational Graph Autoencoders

2024

Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation

Do Recommender Systems Promote Local Music? A Reproducibility Study Using Music Streaming Data

Let's Get It Started (Again!): Fostering the Discoverability of New Releases on Deezer

vMF-exp: von Mises-Fisher Exploration of Large Action Sets with Hyperspherical Embeddings

Let's Get It Started: Fostering the Discoverability of New Releases on Deezer

2023

Track Mix Generation on Music Streaming Services using Transformers

On the Consistency of Average Embeddings for Item Recommendation

Attention Mixtures for Time-Aware Sequential Recommendation

A Scalable Framework for Automatic Playlist Continuation on Music Streaming Services

2022

New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

Flow Moods: Recommending Music by Moods on Deezer

Contributions to Representation Learning with Graph Autoencoders and Applications to Music Recommendation

2021

Modularity-Aware Graph Autoencoders for Joint Community Detection and Link Prediction

Cold Start Similar Artists Ranking with Gravity-Inspired Graph Autoencoders

Hierarchical Latent Relation Modeling for Collaborative Metric Learning

A Semi-Personalized System for User Cold Start Recommendation on Music Streaming Apps

Modéliser la Perception des Genres Musicaux à travers Différentes Cultures à partir de Ressources Linguistiques

2020

FastGAE: Scalable Graph Autoencoders with Stochastic Subgraph Decoding

Modeling the Music Genre Perception across Language-Bound Cultures

Carousel Personalization in Music Streaming Apps with Contextual Bandits

Simple and Effective Graph Autoencoders with One-Hop Linear Models

Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Item Annotation

Muzeeglot : Annotation Multilingue et Multi-Sources d’Entités Musicales à partir de Représentations de Genres Musicaux

2019

Keep It Simple: Graph Autoencoders Without Graph Convolutional Networks

Gravity-Inspired Graph Autoencoders for Directed Link Prediction

A Degeneracy Framework for Scalable Graph Autoencoders

2018

Adaptive Submodular Influence Maximization with Myopic Feedback