I design end-to-end data pipelines, generative AI systems, and fraud detection solutions processing millions of daily transactions across GCP and Azure. Published researcher with formal evaluation frameworks for RAG and graph-based anomaly detection.
Compares three LLM agent deployment patterns — Managed Agents, DIY Factory (Python + GCP), and Direct API — across 45 controlled measurements on financial data governance tasks. Reveals that prompt caching inverts the expected cost hierarchy: DIY accumulates 21.68x context tokens while managed caching reduces costs by 90%. Includes security analysis for regulated fintech (PCI-DSS, SBS Peru, Ley 29733) with a practical decision framework for data sovereignty.
Introduces the GRAFID framework with three novel metrics (Feature Richness Index, Graph Signal Gain, Cost-Effectiveness Ratio) to determine when GNNs outperform XGBoost for fraud detection. Evaluated on IEEE-CIS (590K transactions) and Credit Card EU (284K) datasets with 20+ model configurations, statistical validation (Wilcoxon, McNemar, Cohen's d), and multi-seed reproducibility.
Demonstrates that data quality improvements (ground truth correction, embedding deduplication) yielded a 27% improvement in Precision@5 with zero code changes and zero cost, outperforming neural reranking approaches. Based on 28 formal evaluations with 52 reference questions.
Design and prototype of an autonomous two-wheeled mobile robot using inverted pendulum control systems, with stress simulations and material validation in Autodesk Inventor.
Distributed architecture for natural language query (NLQ) processing translating business questions into SQL. Asynchronous WebSocket orchestration with adaptive timeouts (5s→40s), semantic engine using open-source LLMs for NLQ-SQL with ontological schema mappings (50+ business terms), hierarchical fallbacks, and conversational interface on WhatsApp. Democratized BI access for non-technical enterprise users.
Powered by HuggingFace Inference API (all-MiniLM-L6-v2, 384 dimensions).
Ask anything about my professional background, technical skills, research, or projects. Powered by Gemini.
Professional and technical questions only. 5 queries per day.
Open to collaborations, research partnerships, and new opportunities.