Size: a a a

2020 March 13

AR

Andrey Rahmatullin in Scrapy
do you want to only rescrape failed ones?
источник

AR

Andrey Rahmatullin in Scrapy
or are there new 200000 URLs each day?
источник

AK

Anil Kumar in Scrapy
New urls each day
источник

AK

Anil Kumar in Scrapy
Andrey Rahmatullin
or are there new 200000 URLs each day?
Reply please
источник

AR

Andrey Rahmatullin in Scrapy
and do you want to run several job attempts each day?
источник

AK

Anil Kumar in Scrapy
No, I don't want line that. If I run the scraper once it should scrape all the urls and it should check the count of urls, if count is less than it should scrape the missing urls.
источник

AR

Andrey Rahmatullin in Scrapy
that's not different from just increasing RETRY_TIMES
источник

🎱

🎱 in Scrapy
Andrey Rahmatullin
можешь в пауке сохранить респонс в файл и открыть в браузере
этот способ помог, спасибо)
открыл в шелле так view(response)
источник

AK

Anil Kumar in Scrapy
Andrey Rahmatullin
that's not different from just increasing RETRY_TIMES
I don't know whether it is getting error or not
источник

AR

Andrey Rahmatullin in Scrapy
then the spider doesn't either
источник

AK

Anil Kumar in Scrapy
I don't know whether it is skipping due to error or some other reason
источник

AK

Anil Kumar in Scrapy
I need the reason for missing urls
источник

AR

Andrey Rahmatullin in Scrapy
check the logs
источник

AK

Anil Kumar in Scrapy
Ok
источник

AK

Anil Kumar in Scrapy
Andrey Rahmatullin
check the logs
Thank you
источник

i

ildar in Scrapy
Anil Kumar
I have given retry times as 70 and download delay. Concurrent requests = 50
источник

i

ildar in Scrapy
with "retry times as 70" - maybe the thing is not about retries...
источник

AR

Andrey Rahmatullin in Scrapy
with "I don't know whether it is getting error or not", definitely
источник

i

ildar in Scrapy
this sounds like "hm, I'm getting error on 2-3-5 retries, maybe if I set 70 the server will give up eventually and load what I need"
источник

i

ildar in Scrapy
чет Масяню вспомнил, "ожидаемого ответа сервера не поступило, придется повторить"
источник