Protocol used by nutch -
can please tell me protocol used nutch fetching pages. wanted check kind of request nutch makes ?
i used charles proxy see request information sadly nothing obtained there. missing charles proxy or nutch ??
i have tried wireshark there cam many packets , not identify 1 of nutch ?
please help..
nutch web crawler, guess using http protocol. http get fetch pages.
if need more information (e. g. user agend of nutch) consider setting apache web server on machine , crawl test pages. have @ apache access log.
Comments
Post a Comment