Description
I encountered an issue where the --limit/-k parameter does not enforce the specified maximum number of papers to download.
Steps to Reproduce
- Run the command:
pygetpapers -q "zeolite" --startdate "2010-01-01" --enddate "2026-01-01" -k 1000 -o pygetpapers -p --pdf -s --supp --loglevel info
- The output shows:
Total Hits are 23264
519it [00:00, 125938.32it/s]
- Question:
How to crawl all 23k+ papers without duplication
Description
I encountered an issue where the
--limit/-kparameter does not enforce the specified maximum number of papers to download.Steps to Reproduce
How to crawl all 23k+ papers without duplication