Native python tool for text extraction from doc binary file.
ali e217d98ee0 Updated readme | vor 4 Jahren | |
---|---|---|
data_creation | vor 4 Jahren | |
doc2python | vor 4 Jahren | |
.gitignore | vor 4 Jahren | |
MANIFEST.in | vor 4 Jahren | |
README.md | vor 4 Jahren | |
requirements.txt | vor 4 Jahren | |
setup.py | vor 4 Jahren |
Ali BELLAMINE - contact@alibellamine.me Last version : 1.0 - 07/02/2021
Main repository : https://gogs.alibellamine.me/alibell/doc2python/
Tool that extract text data from doc file.
Clone the current repository :
git clone https://gogs.alibellamine.me/alibell/doc2python
Install dependencies with pip.
pip install -r requirements.txt
Then install the library :
pip install -e .
from doc2python import process
text = process(path_to_doc)