📚 Enhanced Multimodal RAG with Hugging Face

Get HF Token: Visit Hugging Face Settings to get your token
Upload PDF: Click "Choose File" and select your PDF document
Process Document: Click "Process PDF" and wait for confirmation
Ask Questions: Type questions or use example prompts

Upload a PDF document and ask questions about its content, including images and tables!

Now with improved PDF processing and multiple extraction methods

📄 Multiple Text Extraction Methods: PyPDF2, PyMuPDF, OCR, and Unstructured
🖼️ Advanced Image Processing: Direct PDF image extraction + vision models
🔍 Robust PDF Handling: Works with scanned PDFs, complex layouts, and image-heavy documents
💬 Interactive Chat: Conversation history with multimodal understanding
⚡ Error Recovery: Graceful fallbacks when one extraction method fails
📊 Processing Statistics: Detailed feedback on what was extracted

Text + Images: Can answer questions about both text content and visual elements
Image Understanding: Describes charts, diagrams, photos in your PDFs
OCR Integration: Extracts text from images within PDFs
Context Awareness: Combines text and visual information for comprehensive answers
Fallback Strategy: Uses multiple methods to ensure successful text extraction