Notes

Untitled

Graph Representation

Screenshot 2023-05-20 at 15.32.25.png

Screenshot 2023-05-20 at 15.32.33.png

Denoising Network

Screenshot 2023-05-20 at 15.34.54.png

Text-Conditioned

  1. employ a pretrained BERT encoder to extract word embedings
  2. inject the language guidance into denoising network using cross attention layers