Having worked in an ATCC environment where we ended up supporting our antediluvian version of the operating system we were using, I have sympathy. It is most probable that this failure in old software was caused by a timing fault caused by preemption, which will be close to impossible to replicate, rather than a normal program 'bug'. Interesting days