Welcome to Athena's Oracle, the groundbreaking AI companion designed to sail through the vast ocean of documents with the wisdom and insight of the ancient goddess of wisdom herself, Athena. 🌟 Drawing upon the mighty advancements in artificial intelligence and natural language processing, Athena's Oracle empowers users to dive deep into academic papers, reports, and any form of textual content, uncovering hidden insights with ease and precision like never before.
The following work is based on Athena's Oracle Optimization Development! PaperHelper
- : RAG Tutorial: Start from Athenas-Oracle [1]
- : RAG Tutorial: Start from Athenas-Oracle [2]
- : RAG Tutorial: Start from Athenas-Oracle [3]
- [New Article is out now!!!] : RAG Tutorial: Start from Athenas-Oracle [4]
- Athena's Oracle: 掌握 RAG 技术的理想跳板
- RAG 教程: 从Athenas-Oracle开始 [1]
- RAG 教程: 从Athenas-Oracle开始 [2]
- [New Article is out now!!!] RAG 教程: 从Athenas-Oracle开始 [3]
- [New Article is out now!!!] RAG 教程: 从Athenas-Oracle开始 [4]
We are thrilled to announce the launch of a brand-new feature designed to enhance your document analysis and knowledge acquisition experience: 🚀 Batch Downloading from Awesome Paper Lists 🚀.
📚 Batch Downloading: Automatically extract arXiv links from awesome paper lists on GitHub and download papers in bulk to your local system.
🔧 RAG Knowledge Base Construction: With the batch-downloaded papers, effortlessly build or expand your RAG (Retrieval-Augmented Generation) knowledge base, providing solid support for in-depth research and machine learning models.
- Intuitive Query Handling: Whisper your questions into the ear of Athena's Oracle, and be guided to the wisdom you seek, leveraging the depths of your entire document repository. 📖
- Multi-Document Intelligence: Capable of parsing and understanding layers of information across multiple documents, Athena's Oracle weaves answers that are not just accurate but breathtakingly comprehensive. 🌐
- Reciprocal Rank Fusion: Through a sophisticated dance of algorithms, the most relevant documents emerge, combining forces from various sources to deliver the ultimate insights. 🔍
- Streamlit Integration: Step into the temple of Athena's Oracle via a user-friendly interface, making the art of complex document analysis accessible to all mortals. 💻
Athena's Oracle conjures its magic by intertwining several state-of-the-art technologies and methodologies:
- Document Retrieval: Harnesses the power of advanced vector search technologies to unearth the most relevant documents based on your heart's queries. 🗂️
- Question Answering: Employs the wisdom of the latest language models to generate precise answers to your inquiries, drawing knowledge from the identified documents. 💡
- Reciprocal Rank Fusion: Gathers the search results from multiple documents, ensuring the knowledge bestowed upon you is the most pertinent and rich. 📊
To embark on your journey with Athena's Oracle, follow these mystic steps:
- Clone the Repository: Summon the latest version of Athena's Oracle from our GitHub sanctuary.
git clone https://github.com/H-Freax/Athenas-Oracle.git
- Install Dependencies: Enter the sacred project directory and conjure the required Python elixirs.
cd Athenas-Oracle pip install -r requirements.txt
After you've got all the requirements set up, there's one more crucial step before you can unleash the full power of Athena's Oracle: adding your OpenAI API key. This key is like the secret password that grants you access to the vast intelligence Athena's Oracle taps into.
You have two options for managing the OpenAI API key:
-
Add to Streamlit Secrets: You can securely store your API key by adding it to the
.streamlit/secrets.toml
file. This way, it's automatically picked up by the app, and you don't have to worry about it again. Here's how you format it in the file:OPENAI_API_KEY = "sk-yourapikeyhere"
Make sure to replace
"sk-yourapikeyhere"
with your actual OpenAI API key. -
User Input in Sidebar: If you prefer, you can also set up the app to ask users to input their OpenAI API key in the sidebar each time they visit the page. This method is a bit more flexible and great for when you're sharing your creation with others but don't want to share your API key directly.
With your API key securely in place, you're ready to bring Athena's Oracle to life. Open your terminal or command prompt, navigate to the folder where you've stored Athena's Oracle, and run the following spell:
streamlit run app.py
Note: Set allow_dangerous_deserialization: bool = True
first, where you can find in faiss.py
.
You may also embed your pdf, or you may raise error:
Once Athena's Oracle has awakened, you stand at the threshold of untold knowledge. Follow these steps to navigate through the wisdom of the ages:
Entering the Sacred ArXiv Link 🔗 Begin your journey by entering the sacred arXiv link into the designated field. This link is your key to unlocking the vast halls of academic knowledge stored within the arXiv's libraries. Fear not, for Athena's Oracle is designed to guide you through this process with ease.
The Downloading Ritual 📥 Upon submitting your arXiv link, Athena's Oracle will commence the downloading ritual. Watch as the document is summoned from the digital ethers into the realm of the Oracle. A message of "Download Successful" shall herald the completion of this step, marking the document's readiness for the next phase of enlightenment.
Generating the Sacred Embeddings 🔮 With the document now within Athena's grasp, the Oracle shall begin the arcane process of generating embeddings. These embeddings are the essence of the document's knowledge, transformed into a form that Athena's Oracle can comprehend and utilize in its wisdom-giving endeavors.
Choosing the Parchment 📜 With the embeddings crafted, you must now choose the file — the specific parchment of knowledge you wish to query. This step is crucial, for it determines the source of the wisdom Athena's Oracle will provide. Select wisely, and let your intuition guide you.
Commencing the Dialogue with Athena 🗣️ At last, you stand ready to engage in a dialogue with Athena's Oracle. Enter your queries, your questions, and your ponderings into the Oracle's interface. Athena's Oracle, now fully attuned to the knowledge of the chosen document, will respond with insights, answers, and guidance drawn from the depths of the text.
Embark on this adventure equipped with these magical artifacts:
- LangChain: For the seamless weaving of language models into the fabric of applications.
- FAISS: For the swift and efficient search and clustering of dense vectors, akin to finding needles in cosmic haystacks.
- OpenAI: For invoking the powerful oracles housed within AI models.
- arxiv-downloader: A revered tool for summoning documents from the halls of arXiv, showcasing the boundless realms of document retrieval.
- LLMStreamlitDemoBasic: A beacon of inspiration on how to craft interactive demos with Streamlit and the oracles of language models, enhancing the mortal's experience within Athena's Oracle.
Athena's Oracle flourishes with the contributions and insights from the community. Whether it's refining the algorithm, enhancing the divine interface, or expanding the document corpus, your wisdom is invaluable to us. Feel free to fork the repository, infuse it with your enhancements, and submit a pull request.
- arXiv-downloader: Heartfelt thanks to arXiv-downloader for providing easy access to the scrolls of academic papers, aiding in the continuous growth of Athena's Oracle's knowledge base.
- LLMStreamlitDemoBasic: Our gratitude extends to the sages at LLMStreamlitDemoBasic for laying down the mystical foundations for integrating great language models with Streamlit, thus enhancing the interactive pilgrimage through Athena's Oracle.
Athena's Oracle is bestowed upon the world under the MIT License. For more details, consult the LICENSE scroll.