Modify your root directory's .htaccess file to leverage browser cache.
I'm currently using Nutch 1.7 to crawl my domain. My issue is specific to URLs being indexed as www vs. non-www.
Behind the scenes with the Marketing Devs at Bridgepoint Education.