A PyPi package used for finding, generating, and setting alt-text for images in HTML files. The Alt-Text project is developed for the Free Ebook Foundation as a Senior Design Project at Stevens Institute of Technology.
Go to file
XxMistaCruzxX cf4968f80e
Merge pull request #18 from EbookFoundation/dev-david
Updated version pyproject.toml. Added openai to requirements.txt.
2024-03-07 17:33:51 -05:00
.github/workflows Reconfigure for python-publish Github Action. 2023-10-17 13:24:18 -04:00
src/alttext Added OpenAIAPI LangEngine. 2024-03-07 17:30:09 -05:00
tests Added OpenAIAPI LangEngine. 2024-03-07 17:30:09 -05:00
.gitignore Fixed bug in multi-threaded association creation. Removed test engines from ocrengine and descengine. Condensed Replicate DescEngines into one. 2023-11-28 21:17:35 -05:00
LICENSE Begin setting up for PyPi packaging. Added pyproject.toml, LICENSE, and README.md. Restructured files into '/src' and '/tests' folders. Edited 'idep.sh' and '.gitignore'. 2023-10-16 16:26:01 -04:00
README.md Merge pull request #6 from EbookFoundation/dev-david 2023-11-06 17:10:54 -05:00
pyproject.toml Updated version pyproject.toml. Added openai to requirements.txt. 2024-03-07 17:33:25 -05:00
requirements.txt Updated version pyproject.toml. Added openai to requirements.txt. 2024-03-07 17:33:25 -05:00

README.md

Alt-Text

A PyPi package used for finding, generating, and setting alt-text for images in HTML and EPUB files.

Getting Started

Installation

You can find the PyPi package here. To install the package via, you can execute the following in a terminal for your respective system...

Windows
py -m pip install alt-text

Unix/MacOS
python3 -m pip install alt-text

Developer Dependencies

All developer dependencies can be found here. You will only need to install these individually when working directly with the source code.

Engine Dependencies

As of the moment, the image analyzation tools that Alt-Text uses are not fully bundled with the package itself. Hence, depending on the type of engines you are using (for Description Generation and/or Character Recognition), you will need to install various applications/get API keys for the respective functionalities.

Description Engines

Description Engines are used to generate descriptions of an image. If you are to use one of these, you will need to fulfill that specific Engine's dependencies before use.

ReplicateMiniGPT4API

ReplicateMiniGPT4API Engine uses the Replicate API, hence you will need to get an API key via Logging in with Github on the Replicate website.

GoogleVertexAPI

GoogleVertexAPI Engine uses the Vertex AI API, hence you will need to get access from the Google API Marketplace. Additionally, Alt-Text uses Service Account Keys to get authenticated with Google Cloud, hence you will need to Create a Service Account Key with permission for the Vertex AI API and have its according JSON.

OCR Engines

Optical Character Recognition Engines are used to find text within images. If you are to use one of these, you will need to fulfill that specific Engine's dependencies before use.

Tesseract

The Tesseract Engine uses Tesseract, hence you will need to install the Tesseract OCR.

Quickstart & Usage

To be added...

Our Mission

The Alt-Text project is developed for the Free Ebook Foundation as a Senior Design Project at Stevens Institute of Technology.

As Ebooks become a more prominant way to consume written materials, it only becomes more important for them to be accessible to all people. Alternative text (aka alt-text) in Ebooks are used as a way for people to understand images in Ebooks if they are unable to use images as intended (e.g. a visual impaired person using a screen reader to read an Ebook).

While this feature exists, it is still not fully utilized and many Ebooks lack alt-text in some, or even all their images. To illustrate this, the Gutenberg Project, the creator of the Ebook and now a distributor of Public Domain Ebooks, have over 70,000 Ebooks in their collection and of those, there are about 470,000 images without alt-text.

The Alt-Text project's goal is to use the power of AI, Automation, and the Internet to craft a solution capable of automatically generating descriptions for images lacking alt-text in Ebooks, closing the accessibility gap and improving collections, such as the Gutenberg Project.

Contact Information

The emails and relevant information of those involved in the Alt-Text project can be found below.

The Deveolpers

The Client

Advisor

APIs, Tools, & Libraries Used

Alt-Text is developed using an assortment of modern Python tools...

Development Tools

Alt-Text is developed using...

APIs and Supplementary Tools

Packaging/Distribution Tools

Alt-Text is distributed using...