lists.arthurdejong.org
RSS feed

webcheck: IndexError: string index out of range

[Date Prev][Date Next] [Thread Prev][Thread Next]

webcheck: IndexError: string index out of range



Hi,

not been able to workaround a rogue string. I tried adding y= to blacklist it
but perhaps I am not doing it right.

./webcheck.py -u http:/sitename.com -c --ignore-robots -y=https:///

the string that seems to throw the error is 'https:///'

webcheck:   https:///
Traceback (most recent call last):
 File "./webcheck.py", line 268, in <module>
   main(site)
 File "./webcheck.py", line 230, in main
   site.crawl(fp) # this will take a while
 File "/webcheck-1.10.4/crawler.py", line 222, in crawl
   link.fetch()
 File "/webcheck-1.10.4/crawler.py", line 535, in fetch
   content = schememodule.fetch(self, parsers.get_mimetypes())
 File "/webcheck/schemes/http.py", line 58, in fetch
   if netloc[0] == '.':
IndexError: string index out of range


any ideas on how i can fix?

thanks
--
To unsubscribe send an email to
webcheck-users-unsubscribe@lists.arthurdejong.org or see
http://lists.arthurdejong.org/webcheck-users