Nantes Université

Skip to content
Extraits de code Groupes Projets
Valider 1c7334e2 rédigé par Theo LE CALVAR's avatar Theo LE CALVAR
Parcourir les fichiers

add basic readme

parent 67bf4743
Aucune branche associée trouvée
Aucune étiquette associée trouvée
Aucune requête de fusion associée trouvée
Pipeline #34050 réussi
# dblp-downloader
This bash script can be used to batch download all scientific paper from a conference or a journal. This script queries [dblp](https://dblp.uni-trier.de/) to retrieve DOIs or directl download links to articles.
Then it uses [Sci-Hub](https://sci-hub.se/) to get paywalled articles or directly downloades open access papers.
PDFs are stored in a folder named `pdfs` in the current directory.
A `pages` directory is created and contains intermediate html pages downloaded from Sci-Hub. This folder can safely be removed.
## Usage
Get all articles from a journal:
```
./get_articles.sh --journal journal_short_name
```
Get all articles from a conference:
```
./get_articles.sh --conf conf_short_name
```
You can find the short name of the journal/conference by going to the page dedicated to the venue on dblp and looking at the URL.
For instance, the Software and Systems Modeling journal has the following url <https://dblp.uni-trier.de/db/journals/sosym/index.html> and its short name is `sosym`.
### Environment variables
- `SCIHUB_BASE_URL`: sci-hub URL to use (default "https://sci-hub.se")
- `MIN_WAIT`: minimum time in seconds to wait between downloads from Sci-hub (default 5)
- `MAX_WAIT`: maximum time in seconds to wait between downloads from Sci-hub (default 15)
## Manual install
Download the `getarticle.sh` script and directly run it.
### Dependencies
- bash
- curl
- sed
- [jq](https://stedolan.github.io/jq/)
- [xidel](https://github.com/benibela/xidel)
## Run it using docker
A pre-built docker image is available to easily use the script without installing its dependencies.
A `docker-compose.yml`file is provided to launch the script.
You can modify the `command` according to your needs.
\ No newline at end of file
0% Chargement en cours ou .
You are about to add 0 people to the discussion. Proceed with caution.
Terminez d'abord l'édition de ce message.
Veuillez vous inscrire ou vous pour commenter