Unless and until you can deploy some software that can carry out an intelligent trawl and recategorise the whole thing you've done what you can
Thanks, good points... from analysing the nature of failed searches I think I can get as close as I need to by brushing up on pattern matching and regular expressions and testing a few options on a local server. (MySQL manual helpful here)
The data stays 'as is' though...