AI agent engineering

AI agents, engineered like infrastructure.

Reliable under load, fully observable, and security-first — built for production, not demos.

agents/shipping BUILDING ✓

The Lab

What we're building

Open, benchmarked, production-minded agent systems. Each one ships with a writeup and the numbers behind it.

IN BUILD Project 01

Go LLM Gateway

Multi-provider routing, cost attribution, OpenTelemetry tracing, and a streaming proxy — engineered for predictable tail latency under high concurrency. Benchmarked head-to-head against LiteLLM.

GoOTelp95 latencystreaming

Read the build →

IN BUILD Project 02

kube-sage

A security-first autonomous Kubernetes remediation agent. Human-in-the-loop approval gates, a strict action allowlist, and prompt-injection defense baked in. A Go operator paired with a planning agent over gRPC.

KubernetesgRPCHITLsecurity

Read the build →

Principle

Predictable

Tail latency, backpressure, and graceful degradation treated as first-class requirements — not afterthoughts.

Principle

Observable

Every request traced and attributed. If you can't see it, you can't run it in production.

Principle

Safe by default

Allowlists, approval gates, and injection defenses so an agent can act without acting recklessly.

About

Built by someone who runs real systems.

AgenticCore Labs is led by Harpreet Singh — a software engineer with a decade spent building and operating distributed systems at scale.

10+

Years building production systems

150M+

Transactions handled per day

~1,700

Transactions per second

<250ms

p95 latency at that scale

Track record

Production-grade, not proof-of-concept

Now

Banking-scale infrastructure

Currently leading a team building scalable, secure Go and Java microservices on GCP for banking-scale payment infrastructure — distributed systems that process 150M+ transactions a day at ~1,700 TPS, with p95 latency held under 250ms.

Before

5 years at NetApp

Built distributed Go microservices, a Kubernetes monitoring operator, and an mTLS security framework for enterprise products.

National scale

Notifications for UMANG

Built a distributed, event-driven notification system — email, SMS, and push — on Apache Kafka for UMANG, India's government super-app. It powers citizen alerts across a platform that today serves 80M+ users and 2,000+ government services.

Apache Kafka distributed scalable

Certified

Cloud & Kubernetes

The platforms agents actually run on — proven, not just claimed.

CKAD AWS Developer Associate GCP ACE

The lab

Why AgenticCore Labs

Most AI agents are demos. The hard part is making them production-grade: distributed, scalable, and secure enough to run on real systems — predictable under load, fully observable, and safe to let act. That's the same engineering discipline behind everything I've built, now applied to AI agents, and what I bring to client work.

Writing

Notes from the build.

Deep dives on agent engineering — benchmarks, architecture, and the trade-offs behind each decision.

Articles

Latest posts

UPCOMING Benchmark

Tail latency: a faster LLM gateway vs. LiteLLM

What actually happens to p95/p99 under concurrent load — and the engineering choices that keep it flat.

Coming soon →

UPCOMING Security

Designing a security-first agent framework

Allowlists, human-in-the-loop checkpoints, and defending an autonomous agent against prompt injection.

Coming soon →

UPCOMING Benchmark

Go vs. Rust: which builds the faster agent?

Same gateway, two languages. A head-to-head on throughput and tail latency once both projects ship.

Coming soon →

Work with me

Hire me for agent development.

Building an AI agent, hiring for a role, or have any other question about the work? Send a message — I read every one.

Email
harpreet.singh@agenticcorelabs.com Phone
+91 98787 41775 LinkedIn
dev-harpreet-singh GitHub
eng-harpreet-singh