Python Script 2 : Crawling all emails from a website

emails crawl python

This is the second article in the series of python scripts. In this article we will see how to crawl all pages of a website and fetch all the emails.

Important: Please note that some sites may not want you to crawl their site. Please honour their robot.txt file. In some cases it may lead to legal action. 
This article is only for educational purpose. Readers are requested not to misuse it. 

Instead of explaining the code separately, I have embedded the comments over the source code lines. I have tried to explain the code wherever I felt the requirement. Please comment in case of any query.

You might need to install some packages like requests  and BeautifulSoup  for this script to work. It is recommended that you create a virtual environment and install packages in it.

 

Constructive feedback is always welcomed.

Like our facebook page.

Python script to convert ebooks to kindle format.

 

(Visited 662 times, 1 visits today)

You must read this :

2 thoughts on “Python Script 2 : Crawling all emails from a website”

Leave a Reply

Your email address will not be published. Required fields are marked *