I'd like to set up a local mirror of certain large databases like the nt BLAST database, interpro etc.
The biomirror project looks like a good candidate, but they seem to advocate using GridFTP, and have even deprecated rsync. I would have thought a simpler solution would be something hacked together with cron and rsync, or am I missing something?
So, my question is: What solutions have you used for mirroring large biological databases, and what mistakes should I avoid making?