|
I build backend systems that handle real load — not toy demos. The work I'm proudest of usually looks boring from the outside: a sharded write path that doesn't fall over at 1B rows, a workflow engine that uses Kahn's algorithm because cycles are bugs not features, a 300M parameter SLM model I trained from scratch on my laptop because I wanted to actually understand transformers — not import them. I don't care about being right. I care about systems that stay up at 3am. |
role: platform engineer
company: Steps AI
shipping: agent infra
training: 300M param SLM
mood: write more, talk less |
▸ ─────────────────────────────────────────────────────────────── ◂
▸ ─────────────────────────────────────────────────────────────── ◂
parameters: 300M
architecture: decoder-only · custom hybrid
positional: RoPE
attention: multi-head causal · KV-cache
ffn: SwiGLU
norm: Pre-RMSNorm
tokenizer: custom BPE (from scratch)
training: bf16 · MPS · Apple M5
|
status: in production
stack: go · postgres · clickhouse
role: led product team end-to-end
adopted by: The Chatterjee Group
(45-country conglomerate)
Cricut (US consumer tech)
|
▸ ─────────────────────────────────────────────────────────────── ◂
▸ ─────────────────────────────────────────────────────────────── ◂
|
FLAGSHIP Quper used by TCG + Cricut |
COMPETITIVE ICPC '23 regionalist · rank 73 |
ALGORITHMS LeetCode top 7% globally |
SCALE 10K+ users asksenior backend |
HACKATHON Hack-O-Octo winner · blockchain |
▸ ─────────────────────────────────────────────────────────────── ◂
|
Refactored core modules to NestJS, built repo analytics tooling for the API testing platform. |
100+ tests added, full PostgreSQL upgrade, comprehensive API documentation rewrite. |
▸ ─────────────────────────────────────────────────────────────── ◂




