jeudi 5 février 2015

How to copy a website content with Apache Nutch?

I want to copy the data of the URL's which are listed by a web crawler.My web crawler searches for a single word, i want to search for a string and the crawler should search for the full string and it should search for similar contents (more over like a search engine). Is it possible to do the same with apache nuthc and solr??





Aucun commentaire:

Enregistrer un commentaire