🚀 Prompt Compression Gateway

A production-ready API gateway that compresses LLM prompts and enforces token policies before execution.

🎯 Why This Exists

LLM prompts are getting longer and more expensive. This gateway helps by:

💰 Cost Reduction - Smart compression reduces token usage
🛡️ Policy Enforcement - Token limits before API calls
⚡ Fast Processing - Efficient compression with LLMLingua-2

✨ Features

Intelligent prompt compression
Configurable token limits
REST API with FastAPI
Docker support
Comprehensive tests
Clear documentation

🚀 Quick Start

Installation

# Clone the repo
git clone https://github.com/kelpejol/prompt-compression-gateway.git
cd prompt-compression-gateway

# Install dependencies
pip install -r requirements.txt

# Run the server
uvicorn gateway.main:app --reload

Docker

docker-compose up -d

First API Call

curl -X POST http://localhost:8000/compress \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "You are an AI assistant helping with code review.",
    "max_tokens": 512,
    "compression_ratio": 0.5
  }'

📖 API Documentation

POST `/compress`

Compress a prompt with policy enforcement.

Request:

{
  "prompt": "string",
  "max_tokens": 2048,
  "compression_ratio": 0.5
}

Response:

{
  "original_tokens": 150,
  "compressed_tokens": 75,
  "compressed_prompt": "compressed text here"
}

Interactive Docs: Visit http://localhost:8000/docs after starting the server.

🏗️ Architecture

Client Request
     ↓
Token Policy Check
     ↓
LLMLingua Compression
     ↓
Compressed Output

🧪 Testing

# Run tests
pytest

# With coverage
pytest --cov=gateway

# Specific test file
pytest tests/test_api.py

🛠️ Development

# Install dev dependencies
pip install -r requirements-dev.txt

# Format code
black gateway/ tests/

# Run linter
ruff check gateway/

📦 Deployment

Using Docker

docker build -t prompt-gateway .
docker run -p 8000:8000 prompt-gateway

Environment Variables

HOST=0.0.0.0
PORT=8000
MAX_TOKENS_DEFAULT=2048
COMPRESSION_RATIO_DEFAULT=0.5

See .env.example for all options.

🤝 Contributing

Contributions are welcome! Please check out CONTRIBUTING.md for guidelines.

Fork the repo
Create a feature branch (git checkout -b feature/amazing)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing)
Open a Pull Request

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

LLMLingua for compression
FastAPI for the framework

📞 Support

🐛 Report Issues
💬 Discussions

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.github		.github
docs		docs
examples		examples
gateway		gateway
tests		tests
.env.example		.env.example
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Prompt Compression Gateway

🎯 Why This Exists

✨ Features

🚀 Quick Start

Installation

Docker

First API Call

📖 API Documentation

POST `/compress`

🏗️ Architecture

🧪 Testing

🛠️ Development

📦 Deployment

Using Docker

Environment Variables

🤝 Contributing

📄 License

🙏 Acknowledgments

📞 Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 Prompt Compression Gateway

🎯 Why This Exists

✨ Features

🚀 Quick Start

Installation

Docker

First API Call

📖 API Documentation

POST /compress

🏗️ Architecture

🧪 Testing

🛠️ Development

📦 Deployment

Using Docker

Environment Variables

🤝 Contributing

📄 License

🙏 Acknowledgments

📞 Support

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

POST `/compress`

Packages