import re
url = '<p>Hello World</p><a href="http://example.com">More Examples</a><a href="http://example2.com">Even More Examples</a>'
urls = re.findall('https?://(?:[-w.]|(?:%[da-fA-F]{2}))+', url)
>>> print urls
['http://example.com', 'http://example2.com']
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…