Skip to main content

Open to select projects

Building intelligent systems that actually ship.

I'm Riantama Putra, a software & AI engineer based in Jakarta, Indonesia. I design and build intelligent systems that ship — from LLM pipelines to the products around them.

years shipping
5+
years shipping
systems in prod
20+
systems in prod
daily AI requests served
3M+
daily AI requests served

02 / Experience

Where I've built

  1. 2024 — Present

    Senior Software Engineer, AI Platform · Aurora Labs

    Leading the team that turns model research into reliable product features.

    • Built the LLM gateway serving 3M+ requests/day with cost and quality routing
    • Cut inference spend 38% via semantic caching and model tiering
  2. 2022 — 2024

    Machine Learning Engineer · Kanaya Tech

    Owned computer-vision systems end to end — data pipelines to edge deployment.

    • Shipped defect-detection models to 12 factory sites across Southeast Asia
    • Reduced false-positive rate 4× with active-learning retraining loops
  3. 2020 — 2022

    Full-stack Engineer · Studio Delta

    Built and scaled web products for early-stage startups as engineer #3.

    • Took two products from prototype to 100k monthly active users
    • Introduced typed API contracts that halved integration bugs
  4. 2016 — 2020

    B.Sc. Computer Science · Universitas Indonesia

    Focus on machine learning and distributed systems. Graduated cum laude.

03 / About

Engineer first, then the models

I'm a software engineer who went deep on machine learning — not the other way around. That order matters: I care as much about the deploy pipeline, the p95 latency and the on-call story as I do about model quality.

For the last five years I've been building AI-powered products across fintech, manufacturing and developer tools — usually owning the path from ambiguous idea to something running in production. I like small teams, sharp problems, and systems that are boring to operate.

Ship the whole system

A model is 10% of the work. The product, data loops and guardrails around it are the job.

Measure before magic

Evals, tracing and honest baselines before reaching for a bigger model.

Fast is a feature

Latency budgets are product decisions. I design for them from the first commit.

Toolbox

Languages

  • Python
  • TypeScript
  • Go
  • SQL
  • Rust

AI / ML

  • PyTorch
  • LLM fine-tuning
  • RAG systems
  • vLLM
  • LangGraph
  • Evaluation & observability

Web & APIs

  • React / Next.js
  • Node.js
  • FastAPI
  • gRPC
  • Tailwind CSS

Infrastructure

  • Docker
  • Kubernetes
  • AWS / GCP
  • PostgreSQL
  • Kafka
  • Terraform

04 / Contact

Let's build something intelligent.

I take on a small number of projects and roles where the problem is sharp and the bar is high. If that sounds like yours, my inbox is open.

riantamaputra751@gmail.com

Jakarta, Indonesia (UTC+7)