Software that checks if websites are up and running

Adivertising

Regular Member
Joined
Aug 25, 2019
Messages
216
Reaction score
165
I have about 3 million business data with their website in a column and want to filter out the domains that are offline, and preferably if they haven't been updated in x amount of years. Any ides how to achieve that?
 
Python is the answer

Use pandas lib to deal with data
Use requests to see if the website is returning 200 response

Then you are good
 
I have about 3 million business data with their website in a column and want to filter out the domains that are offline, and preferably if they haven't been updated in x amount of years. Any ides how to achieve that?
Make http calls, and see the last modification header, if returned (catch the exceptions with a retry count and add them to offline domains after a few tries). This is the most straightforward way I can think of, but there could be more nuances.
 

courses_retailer , tazarbm , BlurryBit


Thank you for your replies. I ended up doing this using Screaming Frog. Plain and simple response codes and export filtered list. Broken links are a gold mine for getting people who already started a business and failed, to ignite their will of starting a new business and buying my services (web dev)
 
Back
Top