2ms seem a little short to me. Acoustic guitars for example can easily have way loger lasting transients.
I would prefer the marker at the transient and a highlighted (protected) area for the next x milliseconds after it.
OR - I'm dreaming here - what if you could steplessly change the "slope" of how the audio is time stretched between two markers; just like the interpolation between two envelope points! That way mostly the "rear" parts of each section would be stretched - or the front parts - or both in a linear fashion - just as you'd need. Set a default behaviour and bingo: Transients saved, sustain stretched! Or change to taste.
|