The only answer is to have an interface device of some sort (I didn't say the R word)
Something like a juiced link or Beachtek will give you XLR audio input and headphone monitoring at the mic stage.
I use a beachtek on my cameras, do a test record, play back to make sure cam is getting clean feed. Line up mic using VU's. And thats it.
I know its not the answer you want to hear, but I'm afraid that other than 'they aren't', or mentioning the R word, it's the only answer available to me.