This can’t be done with most of the ‘live crawl’ or url fetching tools since those mostly fetch precisely one page to examine, and won’t necessarily be able to scan that whole page if it is lengthy or complex.
Refering to the following discussion thread explains many of the same concerns and limitations that will apply to your usage: