Native python tool for text extraction from doc binary file.

ali e217d98ee0 Updated readme 3 years ago
data_creation 9625e5e123 initial release 3 years ago
doc2python 9625e5e123 initial release 3 years ago
.gitignore 9625e5e123 initial release 3 years ago
MANIFEST.in 9625e5e123 initial release 3 years ago
README.md e217d98ee0 Updated readme 3 years ago
requirements.txt 9625e5e123 initial release 3 years ago
setup.py 9625e5e123 initial release 3 years ago

README.md

doc2python

Ali BELLAMINE - contact@alibellamine.me Last version : 1.0 - 07/02/2021

Main repository : https://gogs.alibellamine.me/alibell/doc2python/

Tool that extract text data from doc file.

How to install it ?

Clone the current repository :

    git clone https://gogs.alibellamine.me/alibell/doc2python

Install dependencies with pip.

    pip install -r requirements.txt

Then install the library :

    pip install -e .

How to use it

    from doc2python import process

    text = process(path_to_doc)