Full-Stack AI Engineer with expertise in building production-ready AI
applications
from model development to deployment. I specialize in RAG
systems,
LLM optimization, and scalable backend architecture.
PYTORCH |
TENSORFLOW |
SKLEARN |
HUGGINGFACE |
LANGCHAIN |
OPENAI |
PINECONE |
CHROMADB |
MLFLOW |
W&B |
ANTHROPIC |
LLAMAINDEX |
FAISS |
WEAVIATE |
N8N |
ZAPIER |
REACT |
NEXT.JS |
TYPESCRIPT |
NODE.JS |
EXPRESS |
ELECTRON |
FLASK |
FASTAPI |
.NET |
GRAPHQL |
PYTHON |
JAVASCRIPT |
POSTGRESQL |
MYSQL |
MONGODB |
FIREBASE |
DYNAMODB |
REDIS |
SQLITE |
WEAVIATE |
SUPABASE |
AWS |
DOCKER |
KUBERNETES |
ACTIONS |
GITLAB CI |
PROMETHEUS |
GRAFANA |
GIT |
POSTMAN |
JEST |
RABBITMQ |
AUTH0 |
OKTA |
CELERY |
PYTEST |
VS CODE |
🔮 recallmSemantic LLM Cache Semantic cache layer for LLM APIs — embeds prompts locally, finds near-matches, and skips redundant LLM calls to slash costs and latency. |
⚡ SnagOpen-Source Webhook Inspector Lightweight webhook inspector with a CLI tunnel, real-time web console, request replay, configurable forwarding rules, and native MCP support for AI agents. |
🤖 DocuChatCross-Platform AI PDF Assistant Desktop app for intelligent Q&A over PDFs with RAG architecture, multiple LLM support (OpenAI, Claude, local models), and real-time chat with source citations. |
📋 FolioOffline-First Todo App Beautiful, fast, offline-capable task manager with optional Web Push notifications, service worker caching, and a responsive TypeScript frontend. |
🤖 AI/ML RAG Systems • LLM Fine-tuning (LoRA/QLoRA) • Vector DBs (FAISS, Weaviate) • Prompt Engineering • n8n/Zapier Automation
🌐 Full-Stack React • Next.js • Node.js • .NET • Electron • FastAPI • WebSockets • gRPC
☁️ Cloud AWS (EC2, S3, Lambda, SageMaker, ECS) • Docker • Kubernetes • CI/CD
📊 Data PostgreSQL • MongoDB • Redis • SQLite • Pinecone • ChromaDB • Weaviate
I'm always open to collaborating on AI-powered applications and full-stack projects!