🇩🇪

Mohamed HANNANI, LLM Engineer

PROFILE

AI Engineer with 4+ years building production systems — from RAG pipelines and LLM-powered agents to real-time voice AI. Solo-built a multi-service SaaS (FastAPI, LangGraph, PostgreSQL, Next.js) handling live customer conversations across 5 channels. Strong in Python, async architectures, and shipping AI from prototype to production.

EXPERIENCE

AI/ML Consultant

Nov 2024 - Feb 2026, Siegen-Wittgenstein, Germany

Healthcare Manufaktur GmbH

  • Architect and deploy production RAG pipelines using LangChain, Qdrant vector database, and Claude/OpenAI APIs for healthcare document retrieval and intelligent Q&A systems, improving information access efficiency by 40%.
  • Engineer healthcare chatbot with Elasticsearch-powered retrieval, sentence-transformers embeddings, and context-aware generation, processing 1000+ daily user queries with 95% user satisfaction.
  • Build automated web scraping pipelines with Playwright and BeautifulSoup4 for data extraction and vector database ingestion, reducing manual data entry by 85%.
  • Develop geospatial mapping applications using Leaflet, Deck.gl, and PostGIS for hospital network visualization and pharmaceutical territory analysis across 200+ healthcare facilities.
  • Design interactive dashboards with D3.js, React Flow, and custom components for real-time healthcare KPI monitoring and executive decision support.
  • Manage AWS EC2 infrastructure orchestrating 50+ Docker containers with CI/CD pipelines, health monitoring, and zero-downtime deployments.
RAGLangChainQdrantClaude APIElasticsearchVector DatabasesD3.jsReact FlowAWS EC2DockerFastAPI

Data Scientist & AI Researcher

Nov 2023 - Oct 2024, Siegen, Germany

University of Siegen

  • Developed in-context machine translation systems using LLMs (GPT-4, Claude Opus, LLaMA 2) for generic text and subtitle translation, achieving 25% BLEU score improvement over baseline transformer models.
LLMsMachine TranslationBERTFine-tuningLangChainPrompt EngineeringSentiment AnalysisFastAPI

ML Engineer & Data Scientist

Mar 2022 - July 2023, Casablanca, Morocco

Indatacore

  • Led 3-person ML team to deploy state-of-the-art OCR and information extraction models for Sky Onboarding™ platform, reducing document processing time by 30% and improving accuracy by 15% across 156+ document types.
  • Deployed scalable ML models for Sky ID™ identity verification using Docker and Kubernetes on AWS, implementing A/B testing framework to optimize model performance and achieving 99.2% uptime.
  • Engineered signature detection and extraction system for Sky Signature™ using object detection models (YOLO, Faster R-CNN), processing bank checks and contracts with 97% precision across 62+ check formats.
  • Built production OCR pipeline with Flask API and Tesseract/PaddleOCR integration for bank check validation, achieving 95% accuracy and eliminating manual verification for 10,000+ monthly transactions.
  • Developed automated information extraction system using transformer models and custom NER pipelines for bill processing, achieving 98% accuracy and improving data processing efficiency by 70%.
MLOpsAWSDockerKubernetesPyTorchTensorFlowTransformersOCRFlaskFastAPIETLData PipelineA/B Testing

PROJECTS

Empfio — AI Appointment Booking SaaS

Production AI SaaS handling real customer conversations and booking appointments 24/7 across WhatsApp, SMS, web chat, and voice — solo-built from zero to launch. Architected 6 interconnected services: FastAPI backend (15+ DDD domains, async PostgreSQL, Redis, Celery), LangGraph AI agent with multi-LLM support (GPT-4o, Claude), real-time voice pipeline with ~500ms response time, Next.js 14 dashboard (8 languages), and Stripe billing with API-driven feature registry.

PythonFastAPILangGraphPostgreSQLRedisDockerNext.jsTypeScriptPydanticStripeReal-time Voice AIMulti-LLM

ECO Analyzer – AI-Powered Healthcare Analytics Platform

Designed and built a scalable healthcare analytics platform for German ASV (Ambulant Specialist Care) data, enabling data-driven insights for healthcare administrators and researchers. Developed a microservices architecture using FastAPI, Node.js, PostgreSQL + PostGIS, and Redis to analyze 18,000+ doctors and 200+ specialist care teams across Germany. Implemented geospatial analytics, healthcare network collaboration mapping, and AI-powered insights (GPT-4 / Claude) through interactive dashboards built with Next.js, D3.js, and Leaflet. Focused on performance, security, and enterprise-grade scalability in a regulated healthcare environment.

Healthcare AnalyticsMicroservicesFastAPINode.jsPostgreSQLPostGISNext.jsAI IntegrationGeospatial IntelligenceEnterprise Architecture

Deutsch Tutor - AI-Powered German Vocabulary Learning Bot

Built an AI-powered WhatsApp learning system applying structured morpheme-based word decomposition (Wortzerlegung) for vocabulary acquisition. Developed an async FastAPI backend with PostgreSQL (SQLAlchemy 2.0 async), OpenClaw gateway integration, and Claude AI for real-time linguistic analysis. Implemented SM-2 spaced repetition, quiz workflows, personalized vocabulary tracking, and webhook-based message routing. Containerized with Docker and managed schema evolution via Alembic, ensuring clean architecture and scalable async processing.

PythonFastAPIPostgreSQLAsyncIOOpenClawClaude APIWebhooksDockerSpaced RepetitionLLM Integration

EDUCATION

Master of Data Science

The University of Cadi Ayad

2020 – 2022Marrakech, Morocco

Bachelor of Computer Science

The University of Cadi Ayad

2017 – 2020Marrakech, Morocco

SKILLS

  • Large Language Models (LLMs): GPT, Claude, BERT, LLaMA, Fine-tuning, Prompt Engineering
  • Retrieval Augmented Generation (RAG): LangChain, LlamaIndex, Vector Databases (Qdrant, ChromaDB, Pinecone)
  • Agentic AI & Multi-Agent Systems: OpenClaw, Agent Orchestration, WebSocket Protocols, Tool Integration
  • Knowledge Graphs: Neo4j, Graph Databases, Semantic Relationships, Knowledge Representation
  • Machine Learning & Deep Learning: PyTorch, TensorFlow, Scikit-learn, Keras, Transformers
  • MLOps & Deployment: Docker, Kubernetes, AWS (EC2, S3, Lambda), FastAPI, Model Monitoring, CI/CD
  • Programming: Python (Expert), SQL, Bash, REST APIs, Async/Await, SQLAlchemy
  • Data Engineering: Apache Spark, Kafka, ETL Pipelines, PostgreSQL, MongoDB, Elasticsearch
  • NLP & Computer Vision: Sentence Transformers, OCR, Sentiment Analysis, Machine Translation
  • Web Technologies: React, D3.js, Playwright, BeautifulSoup4, Leaflet, Deck.gl, PostGIS

CERTIFICATES

Generative AI with Large Language Models - Coursera (Aug 2023 - July 2023)
Natural Language Processing with Attention Models - Coursera (June 2021 - Aug 2021)
Machine Learning - Coursera (Apr 2021 - June 2021)
Apply Generative Adversarial Networks - Coursera (May 2021 - July 2021)

LANGUAGES

German: B1
English: C1
French: C1