While I will not get into the discussion at all, or reveal the airline/which group....
From a very reliable source, in said airline, per person, one sex on average produced around 10 times more (relevant) flight data monitoring flags than the other sex. So yes, in this airline at least, there appears to be a certain sex that performs better than the other at this simplistic level. It would be interesting to perform a statistical test on this to balance out the difference in sample numbers (Chi-squared? - long time since stats).
This has nothing to do directly with the initial question, but addresses the subsequent answers.
Tom.