一: module pools-加速开发技术
简化日期计算模块
dateutil
pip install python-dateutil==1.5
图像处理模块
PIL
JPEG;PNG;GIF;BMP
sudo pip install python-imaging
数据的加密处理模块
pycrypto
pip install pycrypto
调用twitter的API
tweepy
pip install tweepy
sina weibo API client
Envelopes发送邮件和附件
smtplib发送邮件
threadpool线程池
汉子转拼音库
python 打包软件
https://github.com/pyinstaller/pyinstaller
python timezone
from datetime import datetime
import pytz
tz = pytz.timezone('Asia/Shanghai')
t = datetime.now(tz)
cst_time = tz.fromutc(datetime.utcfromtimestamp(time.time())).strftime('%Y-%m-%d-%H-%M')
commands
commands.getstatusooutput('ls')
ConfigParser
config.ini
[mydb]
host =
port =
user =
password =
二: 爬虫相关
mechanize
与web服务器交互复杂,如get,post等,使用 mechanize(模拟登陆);BeautifulSoup提取数据
urllib_proxy
- proxy tools
- httplib
- 低成本的获取IP池,这是一个难点
- foreigin proxy pools
- proxy pools
- global pools
- proxy spider
- code见evernote
- 高匿proxy
cookie handle
br = mechanize.Browser()
cj = cookielib.LWPCookieJar()
br.set_cookiejar(cj)