|
i'm an AI/ML engineer based in the US, currently building production AI systems at Reallytics.ai and Verticiti. most of my work revolves around getting large language models to do useful things in production — not toy demos, actual systems handling real traffic. before this, i spent years at Afiniti and Cloud Kinetics doing the grunt work of making ML models reliable at scale. fraud detection, voice analytics, enterprise search — the kind of stuff that breaks at 3am and you have to fix. what keeps me going: that moment when an AI agent you built actually solves a problem you didn't explicitly program it for. still hits different every time. right now i'm deep into:
|
|
|
Agentic AI Workflows — Production AI Agents |
RAG Enterprise Search — Retrieval-Augmented Generation |
|
Voice AI Platform — Real-Time Speech AI |
LLM Fine-Tuning (LoRA/QLoRA) — Parameter-Efficient Fine-Tuning |
|
RLHF LLM Optimization — Reinforcement Learning from Human Feedback |
Sentinel Fraud Detection — Explainable AI |
i'm not going to pretend i use everything equally. here's what i actually reach for day-to-day:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM & GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| vector & data | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud & MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i commit a lot. sometimes it's good code, sometimes it's "fix: typo in typo fix".
i publish research notes daily — not polished papers, just honest writeups of what i'm learning and building. think of it as a public lab notebook for generative AI, LLM fine-tuning, RAG, and agentic systems.
Automl For Complex Workflows And Pipelines
|
|
Explainability Techniques For Computer Vision Mode
|
Ai Safety And Alignment Engineering
|
⭐ Starred wislertt/leetcode-py (2026-04-22)
⭐ Starred yeahhe365/Prisma (2026-04-22)
⭐ Starred worldbench/awesome-vla-for-ad (2026-04-22)
⭐ Starred ydyhello/Awesome-VLM-Streaming-Video (2026-04-22)
⭐ Starred TIGER-AI-Lab/VLM2Vec (2026-04-22)
⭐ Starred berylliumsec/nebula (2026-04-22)
⭐ Starred thubZ09/vision-language-model-research (2026-04-22)
⭐ Starred starVLA/starVLA (2026-04-22)
topics discovered daily by a multi-model AI research engine (GPT-4.1, Grok-3, DeepSeek R1, Llama-4)
🔬 AutoML for Complex Workflows and Pipelines
🔬 Fine-Tuning and Customization of LLMs with Parameter-Efficient Techniques
🔬 Retrieval-Augmented Generation (RAG) Systems at Scale
🔬 Reinforcement Learning for Robotics and Autonomous Systems
🔬 Real-time Time Series Forecasting with Streaming Data
🔬 Explainability Techniques for Computer Vision Models
📌 Prompt Template Engine with Variable Injection — Production Pattern (Python) (2026-04-22)
📌 Kubernetes Pod Scaler for GPU Workloads — Production Pattern (Python) (2026-04-21)
📌 Kubernetes Pod Scaler for GPU Workloads — Production Pattern (Python) (2026-04-19)
🤖 Profile auto-updated on 2026-04-22 19:24 UTC


