Ok so sorry about the late reply, it's been a long day at work..... First of all i want to say thank you for writing such a detailed response on your observation. It is very much appreciated I must say this example you have given me is EXACTLY what i was going for. If i'm looking at this right, this worked because the deep blue background and the hearts are both 3D layers and therefore are connected to each other. The hearts being set to constraint also played a huge difference, if I'm seeing this right, and that's what you meant by it being a "non uniform scale"? Rather than actually zooming into the hearts with my previous example the size of the hearts are changing.
Also the concept i displayed is a little misleading, yes the hearts will be in sync with the beat however the video will not focus on them as heavily as i've displayed and will be more 2D than 3D. This video is primarily going to be focused on kinetic typography with movie clips to express the song.