Online library

What is it?

Online and also offline library with parser, settings and the site. The script can download books(files), their descriptions, create your own library with the website. We will do it with with resources the great free online library tululu.org.

Let's start:

Prerequisites

Please be sure that Python3 is already installed.

Installing

Clone the repository:

git clone https://github.com/MiraNizam/Online-Library.git

Create a new virtual environment env in the directory

python -m virtualenv env

Activate the new environment

source env/bin/activate

Use pip (or pip3, if there is a conflict with Python2) to install dependencies in new environment:

pip install -r requirements.txt

How to run code (Part I):

You can parse: range of books, pages or full category.

You should run the script from the folder "Online-Library"

Script has command-line interface for comfortable using. Interface includes the following commands:

for main.py:

--start_id the start position in range for parsing, default: 1
--end_id the end position in range for parsing, default: 10

for parse_tululu_category.py:

--start_page the start position in range for parsing, default: 1
--end_page the end position in range for parsing, default: the last page in category
--dest_folder path to the catalogue with parse result: images and books, as default: folders named images and books
--skip_imgs Don't download images, change to True. Default: False.
--skip_txtDon't download txt, change to True. Default: False.
--json_path path to JSON file, as default: media
--help use it if you forget the information above

for render_website.py:

--json_path path to JSON file, as default: media

Examples:

Parse book in range from 1 to 10

python main.py

Parse book in range from 11 to 15

python main.py --start_id=11 --end_id=15

Parse full category

python parse_tululu_category.py

Parse page 600, save images and books in folder page_600, save .json file in folder json_file and skip images

python parse_tululu_category.py --start_page 600 --end_page 601 --dest_folder page_600 --json_path json_file --skip_imgs True

How to run code (Part II):

Now we will create a site for our books that we received earlier:

Input:

python render_website.py

Output:

Now you can go to folder pages and see some html files named index. So, you can open any of them, find book and start reading. It will be fully offline version of site. Just enjoy!

Publish site

Additionally, you can publish the site on GitHub Pages. Detailed instructions can be found in this article or on GitHub Pages.

Example site on GitHub Pages:

First page of book catalogue

Project Goals

This code was written for educational purposes as part of an online course for web developers at dvmn.org.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
README_files		README_files
media		media
pages		pages
static		static
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
check_for_redirect.py		check_for_redirect.py
downloader.py		downloader.py
main.py		main.py
parse_tululu_category.py		parse_tululu_category.py
parser.py		parser.py
render_website.py		render_website.py
requirements.txt		requirements.txt
template.html		template.html

License

MiraNizam/Online-Library

Folders and files

Latest commit

History

Repository files navigation

Online library

Let's start:

Prerequisites

Installing

How to run code (Part I):

How to run code (Part II):

Publish site

Project Goals

About

Topics

Resources

License

Stars

Watchers

Forks

Languages