AБ
Size: a a a
AБ
A
class WaybackMiddleware(object):но на таком после пуска зависает скрейпи
def process_request(self, request, spider):
if 'web.archive.org' not in request.url:
new_url = 'http://archive.org/wayback/available?url=' + request.url
request = request.replace(url=new_url)
return request
2020-10-19 12:05:43 [scrapy.middleware] INFO: Enabled item pipelines:2020-10-19 12:05:43 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min)
['crawlers.pipelines.SourceDownloaderPipeline',
'crawlers.pipelines.SourceExampleProfilesPipeline']
2020-10-19 12:05:43 [scrapy.core.engine] INFO: Spider opened
AR
A
AR
A
AR
AR
AR
AR
AR
AR
A
AR
AR
A
A
def process_request(self, request, spider):
new_url = 'http://archive.org/wayback/available?url=' + request.url
request = request.replace(url=new_url)
return request