RAG Knowledge Base Application

A modern knowledge management system built with RAG (Retrieval-Augmented Generation) technology that helps organizations efficiently organize, search, and retrieve information.

Features

🔍 Semantic search powered by Pinecone vector database
🤖 Advanced question answering using Google's Gemini Pro
📚 Document ingestion and processing
💬 Interactive chat interface
🔄 Context-aware conversations
🎯 Precise information retrieval

Prerequisites

Python 3.8 or higher
Pinecone API key (Sign up at Pinecone)
Google API key for Gemini Pro (Get from Google AI Studio)

Quick Start

Clone the repository:

git clone https://github.com/asvpappula/RAGapplication.git
cd RAGApplication

Set up environment:

# For Unix/MacOS
chmod +x install.sh
./install.sh

# For Windows
# Run these commands manually:
python -m venv venv
.\venv\Scripts\activate
pip install -r requirements.txt

Configure environment variables:
- Copy .env.example to .env
- Add your API keys and configurations:

    GOOGLE_API_KEY=your_google_api_key_here
    PINECONE_API_KEY=your_pinecone_api_key_here
    PINECONE_ENVIRONMENT=your_pinecone_environment
    ```

4. Run the application:

  ```bash
  python app.py

5.Open your browser and navigate to:

http://localhost:5000

Adding Your Knowledge Base

Create a data/raw_text directory
Add your PDF or text documents to this directory

Run the indexing script:

python backend/pinecone/extract_and_chunk.py

Customizing the System

Modifying the RAG Prompt

The system prompt defines how the AI processes and responds to questions. You can customize it in backend/llm/response_generation.py:

Locate the get_prompt_template method
Modify the template string to:
- Change the AI's persona
- Adjust response formatting
- Add domain-specific instructions
- Customize source citation format

Adjusting Document Processing

Customize how documents are processed in backend/pinecone/extract_and_chunk.py:

Modify chunk size and overlap
Adjust text extraction rules
Customize metadata extraction
Change document type handling

Development

Project Structure

rag-knowledge-base/ ├── backend/ │ ├── llm/ # LLM integration │ └── pinecone/ # Vector database operations ├── data/ │ └── raw_text/ # Place your documents here ├── static/ # Frontend assets ├── templates/ # HTML templates ├── app.py # Main Flask application └── requirements.txt # Python dependencies

Key Components

app.py: Main Flask application
backend/llm/response_generation.py: RAG implementation
backend/pinecone/extract_and_chunk.py: Document processing
templates/index.html: Chat interface
static/js/chat.js: Frontend logic

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

Apache License Version 2.0 - See LICENSE file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Knowledge Base Application

Features

Prerequisites

Quick Start

Adding Your Knowledge Base

Customizing the System

Modifying the RAG Prompt

Adjusting Document Processing

Development

Project Structure

Key Components

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.git copy		.git copy
backend		backend
rag_system		rag_system
static		static
templates		templates
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
install.sh		install.sh
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

RAG Knowledge Base Application

Features

Prerequisites

Quick Start

Adding Your Knowledge Base

Customizing the System

Modifying the RAG Prompt

Adjusting Document Processing

Development

Project Structure

Key Components

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages