python自动提取页面正文

用BeautifulSoup 库取页面中p标签,给每个p按“,"和","加权重分,去除无用节点。 BeautifulSoup3下正常,BeautifulSoup4有点问题 [[attachment:readability.py]]

Comments !