class DataEngineer:
def __init__(self):
self.name = "Abhishek"
self.role = "Data Engineer & AI/ML Practitioner"
self.company = "GHD"
self.location = "Mumbai, India ๐ฎ๐ณ"
self.email = "abhishek2f24@gmail.com"
@property
def expertise(self):
return [
"Large-scale Data Pipelines & ETL Architectures",
"Azure Cloud Services & Databricks",
"Real-time Streaming Analytics (Apache Kafka)",
"Machine Learning Model Deployment (Azure ML, MLOps)",
"Business Intelligence & Power BI",
"Geospatial & Telematics Data Engineering",
]
@property
def passions(self):
return ["โ๏ธ Chess", "๐ Stock Markets", "๐ MLOps", "โ๏ธ Cloud Computing"]
def __str__(self):
return f"Building data systems that scale, learn, and deliver business impact."|
End-to-end telematics pipeline for Codelco processing millions of GPS records from mining trucks. Includes haversine distance, elevation profiling from JP2 rasters, WGS84โUTM coordinate transforms, and Azure SQL bulk-insert workflows. Stack: Python ยท PySpark ยท Azure SQL ยท Pandas ยท Rasterio |
Two-stage classification pipeline (Logistic Regression + Random Forest) deployed as an Azure ML Batch Endpoint. Integrated with Azure Data Factory orchestration and Power BI dashboards for operational visibility. Stack: Azure ML ยท ADF ยท Power BI ยท scikit-learn ยท Python |
|
Production-grade real-time event streaming platform using Apache Kafka and PySpark Structured Streaming on Azure Databricks. Designed for high-throughput, low-latency analytics on live data feeds. Stack: Apache Kafka ยท PySpark ยท Azure Databricks ยท Delta Lake |
Conversational AI and semantic search engine leveraging transformer-based embeddings and vector similarity. Deployed as a scalable API service with a Flask front-end interface. Stack: Python ยท HuggingFace ยท Flask ยท Azure ยท NLP |
|
Multi-dataset Power BI report ingesting data via CKAN OData API endpoints using a reusable Power Query function pattern, with scheduled weekly refresh on Power BI Service. Stack: Power BI ยท Power Query ยท OData API ยท DAX |
Production-ready PowerShell automation to inventory Azure Resource Group assets โ including dependent resources, activity metrics, auth identity providers, and instance counts โ for cloud migration assessments. Stack: PowerShell ยท Azure CLI ยท Azure Resource Manager |

