Revision: 59790
Initial Code
Initial URL
Initial Description
Initial Title
Initial Tags
Initial Language
at October 1, 2012 21:45 by zhyar
Initial Code
import urllib, re
url = 'http://www.viedemerde.fr/aleatoire'
page = urllib.urlopen(url).read()
parse = re.findall("\<div class=\"post article\" id=\"(.+?)\">(.+?)</div", page)
for article in parse:
parse1 = re.findall("\<a href=\"(.+?)" + article[0] + "\" class=\"fmllink\">(.+?)</a>", article[1])
vdm = ''
for test in parse1:
vdm += test[1]
print("http://viedemerde.fr/"+article[0]+" : "+vdm)
Initial URL
Initial Description
Simple web parser using urllib and re libs.
Initial Title
Example of web parser
Initial Tags
python, web
Initial Language
Python