One can achieve audio signal mixing with a miniature transformer and no other components.
The commercial intercoms thathave "crew music" input implement a mute function so when ATC speaks the music is muted, which seems pretty well essential (for the pilot).