Processing math: 100%

Gradual Domain Adaptation via Manifold-Constrained Distributionally Robust Optimization

Part of Advances in Neural Information Processing Systems 37 (NeurIPS 2024) Main Conference Track

Bibtex Paper Supplemental

Authors

Amirhossein Saberi, Amir Najafi, Amin Behjati, Ala Emrani, Yasaman Zolfit, Mahdi Shadrooy, Abolfazl Motahari, Babak H. Khalaj

Abstract

The aim of this paper is to address the challenge of gradual domain adaptation within a class of manifold-constrained data distributions. In particular, we consider a sequence of T2 data distributions P1,,PT undergoing a gradual shift, where each pair of consecutive measures Pi,Pi+1 are close to each other in Wasserstein distance. We have a supervised dataset of size n sampled from P0, while for the subsequent distributions in the sequence, only unlabeled i.i.d. samples are available. Moreover, we assume that all distributions exhibit a known favorable attribute, such as (but not limited to) having intra-class soft/hard margins. In this context, we propose a methodology rooted in Distributionally Robust Optimization (DRO) with an adaptive Wasserstein radius. We theoretically show that this method guarantees the classification error across all Pis can be suitably bounded. Our bounds rely on a newly introduced {\it {compatibility}} measure, which fully characterizes the error propagation dynamics along the sequence. Specifically, for inadequately constrained distributions, the error can exponentially escalate as we progress through the gradual shifts. Conversely, for appropriately constrained distributions, the error can be demonstrated to be linear or even entirely eradicated. We have substantiated our theoretical findings through several experimental results.