I’m a hands-on ML Engineer focused on production Generative AI: agentic systems, enterprise RAG, multimodal pipelines (vision/speech/OCR), and multi-GPU / multi-node inference on Kubernetes/Docker.
Since years, I’ve been building AI systems that go from prototype → hardened services → monitored deployments — with a strong bias for clean architecture, reproducibility, and real-world impact.
- Agentic AI + RAG platforms: secure retrieval, tool use, multi-step workflows, evaluation + tracing
- Generative & multimodal pipelines: vision, speech, OCR, diffusion, embeddings + reranking
- GPU infrastructure & inference: vLLM / Hugging Face TGI, Kubernetes/Docker (bare metal + cloud)
- Engineering discipline: refactor research code into modular libraries, APIs, tests, configs (Hydra-style), observability
- Email: gianpaolo.santopaolo@gmail.com
- LinkedIn: https://www.linkedin.com/in/gianpaolosantopaolo/
- Blog: https://genmind.ch/


