Automatically extract relevant data from invoices by processing their .pdf/.xml files.
-
MySQL
-
Python 3
-
Install the Python and MySQL development headers and libraries-
sudo apt-get install python-dev libmysqlclient-dev
and
sudo apt-get install python3-dev
-
Install mysqlclient-
pip3 install mysqlclient
-
Clone the repository.
-
Create a new MySQL database and a user. (Make sure to update your credentials in the handler.py file)
-
Create a new table -
Name - inputdata
Attributes -
id int, primary key, auto_increment top int height int width int left_mar int tag_data varchar extra varchar fname varchar
-
Run the handler.py script from the command line. eg -
python3 /path/to/handler.py
The script asks the user to input the path for the XML file which is to be processed and shows the result.