It's been the subject of a lot of SFTE and SETP papers in recent years.
The general approach seems to be of testing primarily fitness for purpose, with a small margin above the flight envelope structurally - rather smaller reserve factors than would be normal for any manned aircraft, and flying qualities are very much only tested in the middle of the narrow operational envelope.
Not been involved myself, but the papers are easy to look up.
G