Aggregating multimodal cancer data across unaligned embedding spaces maintains tumor of origin signal
Aggregating multimodal cancer data across unaligned embedding spaces maintains tumor of origin signal
Kirchgaessner, R.; Keutler, K.; Sivakumar, L.; Song, X.; Ellrott, K.
AbstractAI based embeddings offer the possibilities of encoding complex biological data into low dimensional spaces, called embedding spaces, that maintain the relationships between entities. There is an open question about the compatibility of embedding spaces that are created without any coordination. It has been assumed that signals in these unaligned embedding spaces would be destroyed if vectors were aggregated into summed values. We trained embedding models across different data modalities and tested aggregating the values together to test this assumption. Our research shows that signal from unaligned embedded values is conserved and able to still be used for learning tasks, such as data modality and tumor of origin recognition.