一个很简单的版本,以后会做进一步的修改:多线程,从文件中读取,跟据Head头判断等等.
需要提供一个网站和不存在页面的错误提示
CODE:
#!/usr/bin/env python # -*- coding:utf-8 -*- import urllib import urlparse import HTMLParser class CheckHTML(HTMLParser.HTMLParser): available = True def check_data(self,data,ErrorSym): error = ErrorSym if error in data: self.available = False target = raw_input("Please enter your target:") errorsyb = raw_input("Enter the errorsyb:(eg:404 Error)") check_list = ['admin','inc','data','include','count'] for li in check_list: new_urls = urlparse.urljoin("http://"+target+"/",li) d = urllib.urlopen(new_urls) data = d.read() check = CheckHTML() check.check_data(data,errorsyb) if check.available: print "%s------------->Found!"%new_urls else: print "%s------------->Not Found!"%new_urls
--------------------------------------------------------------------------------------------
The Result maybe like this :
Please enter your target:www.fuck.com Enter the errorsyb:(eg:404 Error)HTTP 错误 404 http://www.fuck.com/admin------------->Found! http://www.fuck.com/inc------------->Not Found! http://www.fuck.com/data------------->Found! http://www.fuck.com/include------------->Found! http://www.fuck.com/count------------->Found!