Skip to content

This Python script extracts text content from EPUB files located within a specified folder.

License

Notifications You must be signed in to change notification settings

akumathedyn123/python-epub-extractor-epub2txt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EPUB Text Extractor

This Python script extracts text content from EPUB files located within a specified folder.

Features

  • Iterates through all EPUB files in a directory.
  • Extracts text content from each book's document sections.
  • Saves the extracted text as separate TXT files in the same folder.

Requirements

  • Python 3 (tested with 3.x)
  • ebooklib library:
  • Install using
  • pip install ebooklib

Usage

  1. Clone the repository:

    git clone https://github.com/akumathedyn123/python-epub_extractor-epub2txt.git
  2. Install dependencies:

    cd epub-text-extractor
    pip install -r requirements.txt
  3. Update folder path:

    Open the script (epub_extractor.py) and edit the epub_folder variable to point to the directory containing your EPUB files.

  4. Run the script:

    python epub_extractor-epub2txt.py

Contributing

Feel free to submit pull requests for improvements or bug fixes.

License

This project is licensed under the MIT License (see LICENSE file for details).

Warning

This script is intended for educational purposes only. It is designed to help you learn about scripting concepts and explore programming possibilities. It is not intended for production use or any situation where unintended consequences could have a negative impact or any legal issues. The author is not responsible for any damages or hazards caused by its use. And, know about local laws.