fess: Default crawler does not run all Web Crawlers

I have around 5K web crawlers configured with max_access = 32. When I start the default crawler, it only seems to to a portion of these - like maybe a 100 and then stops. The only suspicious thing I see in the logs is Future got interrupted. Otherwise it seems to look ok but doesn’t even touch most of the sites.

About this issue

  • Original URL
  • State: closed
  • Created 5 years ago
  • Comments: 58 (18 by maintainers)

Most upvoted comments

Change the following value in fess_config.properties:

page.web.config.max.fetch.size=100