pygetpapers | Automated Retrieval of Open Access Scientific Literature

Release Category: Production

Developed By: Ayush Garg

pygetpapers is a tool to download papers and metadata from open-access repositories. It makes requests to open access scientific text repositories, analyses the hits, and systematically downloads the articles without further interaction.

It has been developed by Ayush Garg under the guidance of the OpenVirus community and Peter Murray Rust and Rik Smith-Unna funded by ContentMine.

It comes with the packages pygetpapers and download tools which provide various functions to download, process and save research papers and their metadata.

We use pygetpapers for querying current and past scholarly literature in bulk.

Primary functionality:

Primary inputs:

Primary outputs:

Main file types for transfer: .xml, .pdf, .html, .json, .csv.

Installation

Check the successful installation with command : pygetpapers --help. You should see a help message come up.

Tutorials (Jupyter Notebook/ Colab Notebook and video demo)

hackathon

← Back