Publications
For a complete list please refer to my Google Scholar profile
Multimodal Learning
[NEW] Learning to highlight audio by watching movies
C. Huang, R. Gao, J. Tsang, J. Kurcius, C. Bilen, C. Xu, A. Kumar, S. Parekh
CVPR 2025 (accepted)[NEW] Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
J. Hong, S. Parekh, H. Chen, J. Donley, K. Tan, B. Xu, A. Kumar
ArXiv 2025Weakly supervised representation learning for audio-visual scene analysis
S. Parekh, S. Essid, A. Ozerov, N. Duong, P. Pérez, G. Richard
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision
S. Parekh, A. Ozerov, S. Essid, N. Duong, P. Pérez, G. Richard
WASPAA 2019Multiview approaches to event detection and scene analysis
S. Essid, S. Parekh, N. Duong, R. Serizel, A. Ozerov, F. Antonacci, A. Sarti
Book chapter in Computational Analysis of Sound Scenes and Events, Springer, 2018Weakly supervised representation learning for unsynchronized audio-visual events
S. Parekh, S. Essid, A. Ozerov, N. Duong, P. Pérez, G. Richard
CVPR Workshop on Sight and Sound 2018
videoGuiding audio source separation by video object information
S. Parekh, S. Essid, A. Ozerov, N. Duong, P. Pérez, G. Richard
WASPAA 2017
videoMotion informed audio source separation
S. Parekh, S. Essid, A. Ozerov, N. Duong, P. Pérez, G. Richard
ICASSP 2017
ML for Audio/Image
Tackling Interpretability in Audio Classification Networks with Non-negative Matrix Factorization
J. Parekh, S. Parekh, P. Mozharovskyi, G. Richard, F. d’Alché-Buc
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2023Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
J. Parekh, S. Parekh, P. Mozharovskyi, F. d’Alché-Buc, G. Richard
NeurIPS 2022Emotion Transfer Using Vector-Valued Infinite Task Learning
A. Lambert*, S. Parekh*, Z. Szabó, F. d’Alché-Buc
CtrlGen Workshop, NeurIPS 2021
codeDeep pairwise classification and ranking for predicting media interestingness
J. Parekh, H. Tibrewal, S. Parekh
ACM ICMR 2018
code
Signal Processing
Improving audio retrieval through loudness profile categorization
S. Parekh, F. Font, X. Serra
IEEE ISM 2016Nyquist filter design using POCS methods: Including constraints in design
S. Parekh, P. Shah
IEEE ISSPIT 2014