redis把数据保存在内存
MongoDB把数据保存在硬盘
pip install scrapy-redis
easy_install scrapy-redis
或者下载安装包下载。
scrapy 配置redis,在settings.py文件中配置redis
默认端口6379
#-*-coding:utf8-*- from scrapy_redis.spiders import RedisSpider from scrapy.selector import Selector from scrapy.http import Request from novelspider.items import NovelspiderItem import re class novSpider(RedisSpider): name = "novspider" redis_key = 'nvospider:start_urls' start_urls = ['http://www.daomubiji.com/' #'http://www.daomubiji.com/qi-xing-lu-wang-01.html' ] def parse(self,response): selector = Selector(response) table = selector.xpath('//table')
版权声明:本文为Homewm原创文章,遵循CC 4.0 BY-SA版权协议,转载请附上原文出处链接和本声明。