Tallan's Technology Blog

Tallan's Top Technologists Share Their Thoughts on Today's Technology Challenges

Forcing FAST Web Crawler to Re-Crawl Data Immediately

Overview

While attempting to set up the FAST Web Crawler for a client, I was unable to find a way to force FAST to re-crawl data immediately.  The web crawler uses an internal scheduler to schedule fetch operations.

There are a large number of options for the crawleradmin.exe command line tool that looked like they might work:

(from crawleradmin.exe reference on technet)

image

but none of them forced an immediate crawl.

Solution

I found that using

crawleradmin –d <content collection name>

to delete the data in the FAST Web Crawler content collection, then using

crawleradmin –f <crawler config .xml file>

forced an immediate recrawl.

 

Note: Deleting the data from the FAST Web Crawler content collection DOES NOT delete all of the data from the FAST content collection with the same name.  Any data crawled through your FAST Content SSA will still be intact after running crawleradmin –d.

Related Articles

No comments

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

\\\