I seem to recall that in sonar at least. there was some way where you could take two tracks and have them render to a third track, but omitting all information that wasn't present in both takes. so say you could do this with 2 stereo channels and you could remove the vocals sometimes this way.
not sure if this would really help in this situation though, but it can be a cool thing anyways. i'm a little lost as to what exactly is the situation here with the two tracks and subtracting.
|