lmcinnes, The landscape of the Machine Learning section of ArXiv.
This was the result of a side-project to build tools to automate the generation of such plots, from label placement, to palette and aesthetics.
Dataset was from https://huggingface.co/datasets/CShorten/ML-ArXiv-Papers encoded with sentence-transformers and mapped with UMAP. Clustering by fast_hdbscan.