Leading the R&D team at A10 Networks to innovate AI-driven solutions for developing and deploying LLM guardrails that safeguard applications against cyber threats. It's been an exhilarating journey, but my passion for artificial intelligence extends far beyond the confines of any single role.
Previously, I was a Research Engineer at DiDi Labs where I architected & implemented a Graph-Based Neural Network to forecast the heading direction of pedestrians within a scene to aid our car's decision-making process. Boosted the model's efficiency from 65% to 72%.
AI/ML
70%
Cloud
50%
System Designing
60%
what i do.
Machine / Deep Learning
Learn & Build LLM Powered Applications
Exploring and researching emerging LLM architectures and advancements.
Fine-tuning LLMs using state-of-the-art techniques like SFT, QLoRA, and RL.
Deploying LLMs efficiently with vLLM, Docker, Helm, and Kubernetes for scalable production use.
recent work.
Here's some of my recent work
all
llms
GPTs From Scratch
Project Overview
GPTs From Scratch
An open source GPT-based model implementation using PyTorch, such as Llama & Mistral. Aim is to gain practical expertise in training and deploying these models to production, as well as a theoretical understanding of the underlying working principles.