Data & Platform Engineer (10+ years)
I build scalable data platforms, production ML systems, and AI-powered applications across banking, FMCG, and SMEs.
- Design lakehouse & distributed data systems
- Build end-to-end data pipelines (batch & streaming)
- Develop ML & LLM-powered applications (RAG, agents)
- Productionize systems with cloud-native & DevOps practices
- Languages: SQL, Python, Java, Rust
- Data: Lakehouse, Medallion, Spark/Trino, BigQuery, Iceberg, dbt, Airflow, CDC, Data Quality
- ML/AI: MLOps, RAG, LLMs, Vector DBs
- Infra: Docker, Kubernetes, CI/CD, IaC, Cloud
-
Thailand Air Quality Insights
End-to-end GCP pipeline (Airflow + dbt + BigQuery) for PM2.5 analytics with data quality checks and BI-ready marts. -
E-commerce Pipeline
Medallion architecture with production-grade data modeling. -
Weather Pipeline
DAG-based ingestion pipeline into a cloud warehouse. -
Churn Prediction
ML system with FastAPI inference + containerized deployment. -
Face Recognition
Deep learning system with embedding-based inference and model optimization. -
Small Language Model
Transformer + Rust inference + ONNX optimization. -
Data Platform (OSS)
Lakehouse platform with Iceberg + Trino, autoscaled via KEDA. -
RAG Running Coach
Retrieval-based AI system with evaluation pipeline. -
AI Body Weight Assistant
Multi-agent AI system (ADK + Gemini) with orchestrator pattern, human-in-the-loop workflow, and personalized fitness planning.
- LinkedIn: https://linkedin.com/in/dannykhant
- Email: dannypmkhant@gmail.com


