Skip to content
View anonymsubicml24's full-sized avatar
Block or Report

Block or report anonymsubicml24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
anonymsubicml24/README.md

DIRAC ICML '24

Here we will provide the source code and supplementary material for DIRAC: Diffusion-Based Representation Learning for Modality-Agnostic Compositionality.

Please, refer to our website https://anonymsubicml24.github.io/anonymsubicml24/ for listening to the audio results.

Abstract

In this paper, we target the extrapolation and out-of-distribution generation problem in generative models by introducing a generic compositional inductive bias. Leveraging state-of-the-art generative models in an encoder-decoder scheme, our approach focuses on compositional representation learning without any form of supervision. We perform experiments on image and audio data, demonstrating the adaptability of our model to different modalities and representations. Our Diffusion-based Representation Learning for Modality-Agnostic Compositionality (DIRAC), builds upon diffusion models and shows promising results in separating meaningful entities in both images and music, serving as a powerful baseline for future investigations around compositional generation and representation learning.

Images experiments

Audio experiments

Code

We will provide the code for the DIRAC model in this repository. The code will be available soon.

Popular repositories

  1. anonymsubicml24 anonymsubicml24 Public

    Source code and supplementary material for DIRAC: Diffusion-Based Representation Learning for Modality-Agnostic Compositionality