Joint Hierarchical Representation Learning of Samples and Features via Informed Tree-Wasserstein Distance | Read Paper on Bytez