Constrained Data Fusion sits behind many of the research industry's top measurement products. Its ability to preserve the distribution of two datasets, while also merging the datasets in a systematic way, is a compelling proposition, guaranteeing the face validity of fused dataset.
The aim of data integration tools like these is to ensure the best possible matches for all donors and recipients. Constrained techniques will typically include all the cases from both datasets regardless of how similar the datasets are. This objectives are however, not necessarily compatible...
Preserving unweighted or weighted distributions of both datasets is, however, not desirable when those distributions are very different from each other. To this end, Constrained Data Fusion will often fail if the datasets are too dissimilar. Such Constrained Data Fusion failures demonstrate a hard truth - one of the pre-requisites for Data Fusion is that the datasets must be like-for-like (or nearly) to begin with.
Ensuring that the datasets you wish to fuse are sampled the same, the data collected the same way, at the same time is and under the same conditions is clearly not realistic. While the datasets don't have to match identically, it is 'very strongly preferred' that they are comprised from the identical population of individuals, even if the data collection methodologies differ. If this is not the case, an unconstrained approaches will prove safer.