Why do you do the ablation studies only on a subset of the testset of Spair-71k, is it due to compute? In my experiments, running sc on the full spair test split takes less than 10min on a single 3090 🤔
Why do you do the ablation studies only on a subset of the testset of Spair-71k, is it due to compute? In my experiments, running sc on the full spair test split takes less than 10min on a single 3090 🤔