Skip to content

Flagro/TextTableScoop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

71 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📝 TextTableScoop

Welcome to TextTableScoop 🌟, a versatile tool designed for extracting text from files and CSV tables, particularly focusing on Office files like Excel, PowerPoint, etc. This project is part of a 'ProjectText' suite that includes ProjectTextAgent and ProjectDataBaseQnA.

🚀 Features

  • Specializes in extracting text from various file formats, including Office files.
  • Designed to work in both Windows with COM and Linux with LibreOffice + PyUNO.
  • Current implementation supports Linux + LibreOffice + PyUNO.
  • Windows support with COM environment is planned for robust file handling.

📥 Installation

To install TextTableScoop, use the following pip command:

pip3 install git+https://github.com/Flagro/TextTableScoop.git

🛠️ Usage

Run texttablescoop from the bin folder with these arguments:

  1. path: Path to the file or directory to process.
  2. -t or --temp: (Optional) Path to a custom temporary folder.
  3. -p or --project: (Optional) Path to the project folder the file belongs to.
  4. --ignore: (Optional) Comma-separated list of patterns to ignore.

🖥️ Example Command

texttablescoop 'path/to/file' --temp 'path/to/temp' --project 'path/to/project' --ignore 'pattern1,pattern2'

🤝 Collaboration & Issues

Open for collaboration; check the issues page for discussions.

Here's how you can contribute:

  1. Fork the Project.
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature).
  3. Commit your Changes (git commit -m 'Add some AmazingFeature').
  4. Push to the Branch (git push origin feature/AmazingFeature).
  5. Open a Pull Request.