Spaces:

Athena1621
/

translation_app

Configuration error

translation_app / README.md

feat: Introduce new backend architecture with notebooks, sources, chat, and CLaRa models, alongside database schema and updated deployment scripts, while removing old frontend, deployment files, and previous backend components.

88f8604 20 days ago

preview code

raw

history blame contribute delete

8.91 kB

📚 Antigravity Notebook

A NotebookLM clone powered by Apple's CLaRa-7B-Instruct for infinite context reasoning

Antigravity Notebook enables you to create "Notebooks" where you can upload multiple disparate sources (PDFs, URLs, Text) and have an AI reason across all of them simultaneously using CLaRa's latent compression technology.

🌟 Key Features

The "Infinite Context" Strategy

16x Compression: CLaRa compresses text into latent representations, reducing context usage by ~16x
Whole-Notebook Reasoning: When all sources fit in context (32k tokens), the AI reads EVERYTHING
Smart Retrieval: For larger notebooks, intelligently selects the most relevant sources
Multi-Modal Ingestion: Support for PDFs, URLs, and plain text

NotebookLM-Style Interface

Notebook Organization: Group related sources into project notebooks
Source Management: Easy upload, URL scraping, and text input
Memory Usage Meter: Visual gauge showing context utilization
Citation Tracking: See which sources were used for each response

🏗️ Architecture

┌─────────────────────────────────────────────────────────┐
│                    Streamlit UI                         │
│  (NotebookLM-style interface with sidebar + chat)      │
└────────────────────┬────────────────────────────────────┘
                     │
                     ↓
┌─────────────────────────────────────────────────────────┐
│                   FastAPI Backend                       │
│                                                         │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐ │
│  │  Notebooks   │  │   Sources    │  │     Chat     │ │
│  │   Router     │  │   Router     │  │   Router     │ │
│  └──────────────┘  └──────────────┘  └──────────────┘ │
└────────────────────┬────────────────────────────────────┘
                     │
        ┌────────────┴─────────────┬──────────────┐
        ↓                          ↓              ↓
┌───────────────┐      ┌──────────────────┐   ┌──────────────┐
│   CLaRa-7B    │      │  ContextManager  │   │   Storage    │
│  (Compress &  │      │  (Whole-Context  │   │   Service    │
│   Generate)   │      │    Strategy)     │   │ (Tensors)    │
└───────────────┘      └──────────────────┘   └──────────────┘
        ↓                          ↓                  ↓
┌─────────────────────────────────────────────────────────┐
│                     PostgreSQL                          │
│  (Notebooks → Sources → LatentTensors → ChatMessages)  │
└─────────────────────────────────────────────────────────┘

🚀 Quick Start

Prerequisites

Python 3.9+
Docker & Docker Compose (for PostgreSQL)
CUDA-capable GPU (recommended, 16GB+ VRAM for CLaRa-7B)

Installation

Clone the repository

git clone <your-repo-url>
cd antigravity-notebook

Install dependencies

pip install -r requirements.txt

Set up environment

cp .env.example .env
# Edit .env with your configuration

Start PostgreSQL

docker-compose up -d

Initialize database

python -m backend.database

Start the backend

python -m backend.main

Start the frontend (in a new terminal)

streamlit run frontend/app_notebook.py

Open your browser

Frontend: http://localhost:8501
API Docs: http://localhost:8000/docs

📖 Usage

Creating a Notebook

Open the Streamlit UI
Click "Create New Notebook" in the sidebar
Enter a name and description
Click "Create Notebook"

Adding Sources

Upload PDF:

Select your notebook
Go to "Add Source" → "PDF" tab
Upload your PDF file
Wait for processing (CLaRa compression)

Add URL:

Select your notebook
Go to "Add Source" → "URL" tab
Paste the URL
Optionally add a custom title
Click "Add URL"

Add Text:

Select your notebook
Go to "Add Source" → "Text" tab
Enter a title and paste your text
Click "Add Text"

Querying Your Notebook

Select a notebook with sources
Type your question in the chat input
The AI will reason across ALL your sources
View the response and see which sources were cited

🧠 How It Works

Latent Compression

When you add a source:

Text is extracted (PDF/URL/Text)
Split into 2048-token chunks
Each chunk is compressed by CLaRa into a latent tensor (~128 tokens)
Latent tensors are saved to disk
Metadata is stored in PostgreSQL

Context Management

When you query a notebook:

ContextManager fetches ALL latent tensors for the notebook
Calculates total token count
If ≤ 32k tokens: Stacks ALL tensors → Whole-Notebook Reasoning
If > 32k tokens: Ranks tensors by relevance, selects top-N → Selective Retrieval
Generates response using CLaRa with the selected context
Returns answer with source citations

🛠️ API Endpoints

Notebooks

POST /notebooks/ - Create notebook
GET /notebooks/ - List notebooks
GET /notebooks/{id} - Get notebook details
GET /notebooks/{id}/stats - Get context usage stats
PATCH /notebooks/{id} - Update notebook
DELETE /notebooks/{id} - Delete notebook

Sources

POST /sources/notebooks/{id}/sources/upload - Upload PDF
POST /sources/notebooks/{id}/sources/url - Add URL
POST /sources/notebooks/{id}/sources/text - Add text
GET /sources/notebooks/{id}/sources - List sources
DELETE /sources/{id} - Delete source

Chat

POST /chat/notebooks/{id}/chat - Query notebook
GET /chat/notebooks/{id}/messages - Get chat history
DELETE /chat/notebooks/{id}/messages - Clear chat history

📊 Database Schema

notebooks
├── id (UUID)
├── name
├── description
├── created_at
└── updated_at

sources
├── id (UUID)
├── notebook_id (FK)
├── source_type (pdf|url|text)
├── filename
├── url
├── content_hash
└── metadata (JSONB)

latent_tensors
├── id (UUID)
├── source_id (FK)
├── tensor_path
├── segment_index
├── token_count
└── metadata (JSONB)

chat_messages
├── id (UUID)
├── notebook_id (FK)
├── role (user|assistant)
├── content
└── sources_used (JSONB)

⚙️ Configuration

Edit .env to configure:

# Database
POSTGRES_USER=antigravity
POSTGRES_PASSWORD=antigravity123
POSTGRES_DB=antigravity_db

# CLaRa Model
MODEL_NAME=apple/CLaRa-7B-Instruct
DEVICE=cuda  # or cpu
MAX_CONTEXT_TOKENS=32768
COMPRESSION_RATIO=16

# Storage
LATENT_TENSOR_DIR=./data/latent_tensors

# API
API_PORT=8000

🎯 Performance

Ingestion: ~30s for 50-page PDF
Query Response: ~10s for full notebook
Capacity: 10-20 average-sized books per notebook

🔬 Technical Details

Why CLaRa?

CLaRa (Compressing Long-range Attention) uses latent compression to represent text in a much smaller space, enabling:

16x compression ratio
Preservation of semantic information
Cross-document reasoning

Context Budget

Standard: 32,768 tokens (latent space)
Equivalent to: ~500k original text tokens (with 16x compression)
Example: Can fit 10-20 full books simultaneously

🤝 Contributing

Contributions welcome! Please open an issue or PR.

📝 License

MIT License - see LICENSE file

🙏 Acknowledgments

Apple for CLaRa-7B-Instruct
Google for NotebookLM inspiration
HuggingFace for model hosting

Built with ❤️ by the Antigravity Team