Pakistan · Open to remote, globally

Talha Jaleel

RAG POC & LLM Integration Engineer · Senior Python Developer for Hire

I build scalable, cloud-native full-stack systems and production AI — RAG pipelines, LLM apps, and MLOps that ship. Available for project-based contracts. 6+ years across Python, TypeScript, and AWS.

01 — About

Engineer who ships AI to production

I'm a senior engineer focused on the messy middle of AI products — where research meets reliability. I've led full-stack development on consumer apps with 100K+ users, scaled backend services to 50K+ daily requests on AWS, and built RAG and MLOps pipelines that improved production accuracy by 35% and cut deployment time by 40%.

  • 6+ years experience
  • 50K+ daily requests handled
  • 100K+ users shipped to
  • 99.5% deploy success rate

02 — Stack

Tools I reach for

AI / ML & MLOps

  • LLMs
  • RAG
  • LangChain
  • GPT-4
  • Azure OpenAI
  • Prompt Engineering
  • TensorFlow
  • PyTorch
  • Computer Vision
  • NLP
  • Embeddings
  • Pinecone

Languages

  • Python
  • TypeScript
  • JavaScript
  • SQL

Frontend

  • React
  • Next.js
  • Redux
  • Tailwind
  • GraphQL

Backend

  • Django
  • FastAPI
  • Flask
  • Node.js
  • Express
  • Microservices

Cloud & DevOps

  • AWS
  • Azure
  • Docker
  • Kubernetes
  • GitHub Actions
  • Linux

Databases

  • PostgreSQL
  • MongoDB
  • Redis
  • Pinecone

03 — Selected work

Projects with measurable outcomes

04 — Experience

Where I’ve worked

  1. Software Engineer · Nuclieos

    Apr 2024 — Present

    • Production LLM chatbots with RAG, +35% response accuracy
    • MLOps pipelines on Docker/K8s, -40% deployment time
    • AWS backend scaled to 50K+ daily requests
  2. Machine Learning Engineer · ESS

    Apr 2023 — Mar 2024

    • Sales AI Voice Bot MVP using LLaMA + Twilio + Pinecone
    • Real-time CV system with LIDAR + camera at 30fps
    • -45% API latency via Node/Django + Redis caching
  3. Full Stack Developer · Ilsa Interactive

    Nov 2020 — Apr 2023

    • Led Fayvo (100K+ users) on React/Redux/TS + DRF
    • ML recsys with neural nets, +28% engagement
    • 99.9% uptime on AWS Docker microservices

05 — AI demo

Chat with my resume

This is a small RAG app: questions are answered by an LLM grounded in my resume content. Same pattern I ship to production — just inlined for the demo.

Try asking:

Note: requires GOOGLE_GENERATIVE_AI_API_KEY set in deployment env.

06 — Contact

Let’s build something

I'm currently open to senior full-stack and AI/ML roles, remote worldwide. The fastest way to reach me is email.