Python Script 2 : Crawling all emails from a website

This is the second article in the series of python scripts. In this article we will see how to crawl all pages of a website and fetch all the emails.

Important: Please note that some sites may not want you to crawl their site. Please honour their robot.txt file. In some cases it may lead to legal action. 
This article is only for educational purpose. Readers are requested not to misuse it. 

Instead of explaining the code separately, I have embedded the comments over the source code lines. I have tried to explain the code wherever I felt the requirement. Please comment in case of any query.

You might need to install some packages like requests  and BeautifulSoup  for this script to work. It is recommended that you create a virtual environment and install packages in it.


3 thoughts on “Python Script 2 : Crawling all emails from a website”

