Skip to content

phanxuanquang/XCan-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

© 2024 Phan Xuan Quang / XCan AI


XCan AI is a powerful tool designed to simplify the extraction of text from images. It can handle various types of content, including tables, code, mathematical formulas, and more. Additionally, it can translate the extracted text into Vietnamese while preserving the original format.

The users just input the image by pasting (Ctrl + V) or select the image from the local device, then the AI handles the rest to provide the text in the image if any. In addition, the tool can help to translate the text in the image into Vietnamese without changing the text format.

Features

Image

  • Text Extraction: Extracts text from images with high fidelity, maintaining the original layout and formatting.
  • Translation: Translates extracted text into Vietnamese without altering the format.
  • Support for Various Formats: Handles tables, code snippets, mathematical formulas, and more.
  • User-Friendly Interface: Allows users to input images by pasting, selecting from the local device, or taking a photo.

Getting Started

Prerequisites

Installation

  1. Clone the repository:

    git clone https://github.com/phanxuanquang/XCan-AI.git
    cd XCan-AI
  2. Backend Setup:

  • In Visual Studio Code:
    • Navigate to the XCan.Api directory:
      cd XCan.Api
    • Restore the .NET dependencies:
      dotnet restore
    • Build the project:
      dotnet build
  • In Visual Studio:
    • Build the project XCan.Api.
  1. Frontend Setup:
    • Navigate to the xcan.web directory:
      cd ../xcan.web
    • Install the Node.js dependencies:
      npm install

Running the Application

  1. Start the Backend:

    • Navigate to the XCan.Api directory:
      cd ../XCan.Api
    • Run the application:
      dotnet run
  2. Start the Frontend:

    • Navigate to the xcan.web directory:
      cd ../xcan.web
    • Run the application:
      npm start
    • Open http://localhost:3000 to view it in your browser.

Usage

  1. Gemini API Key: Obtain a Gemini API key from Google AI Studio and enter it in the application.
  2. Upload Image: Paste, drag & drop, or select an image from your local device.
  3. Extract Text: Click the "Scan" button to extract text from the image.
  4. Translate Text: Click the "Translate" button to translate the extracted text into Vietnamese.

Contributing

We welcome contributions! Please follow these steps:

  1. Fork the repository.
  2. Create a new branch:
    git checkout -b feature/your-feature-name
  3. Make your changes.
  4. Commit your changes:
    git commit -m "Add your commit message"
  5. Push to the branch:
    git push origin feature/your-feature-name
  6. Create a Pull Request.

Contact

For any questions or feedback, please open an Issue or Discussion in this repository.

About

Simplify the way we extract text from image using AI

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published