Wasserstein wormhole: Scalable optimal transport distance with transformer Conference Paper


Authors: Haviv, D.; Kunes, R. Z.; Dougherty, T.; Burdziak, C.; Nawy, T.; Gilbert, A.; Pe'Er, D.
Title: Wasserstein wormhole: Scalable optimal transport distance with transformer
Conference Title: International Conference on Machine Learning (ICML) 2024
Abstract: Optimal transport (OT) and the related Wasserstein metric (W) are powerful and ubiquitous tools for comparing distributions. However, computing pairwise Wasserstein distances rapidly becomes intractable as cohort size grows. An attractive alternative would be to find an embedding space in which pairwise Euclidean distances map to OT distances, akin to standard multidimensional scaling (MDS). We present Wasserstein Wormhole, a transformer-based autoencoder that embeds empirical distributions into a latent space wherein Euclidean distances approximate OT distances. Extending MDS theory, we show that our objective function implies a bound on the error incurred when embedding non-Euclidean distances. Empirically, distances between Wormhole embeddings closely match Wasserstein distances, enabling linear time computation of OT distances. Along with an encoder that maps distributions to embeddings, Wasserstein Wormhole includes a decoder that maps embeddings back to distributions, allowing for operations in the embedding space to generalize to OT spaces, such as Wasserstein barycenter estimation and OT interpolation. By lending scalability and interpretability to OT approaches, Wasserstein Wormhole unlocks new avenues for data analysis in the fields of computational geometry and single-cell biology. Software is available at http://wassersteinwormhole.readthedocs.io/en/latest/. Copyright 2024 by the author(s)
Keywords: cell culture; multi-dimensional scaling; convergence of numerical methods; computational geometry; distance map; euclidean distance; empirical distributions; higher order statistics; wasserstein distance; optimal transport; embeddings; auto encoders; wasserstein metric; digital arithmetic; transport distances
Journal Title Proceedings of Machine Learning Research
Volume: 235
Conference Dates: 2024 Jul 21-27
Conference Location: Vienna, Austria
ISBN: 2640-3498
Publisher: Journal Machine Learning Research  
Date Published: 2024-01-01
Start Page: 17697
End Page: 17718
Language: English
PROVIDER: scopus
DOI/URL:
Notes: Conference paper -- Source: Scopus