Skip to content
View abhishek2f24's full-sized avatar
๐Ÿ‘‹
๐Ÿ‘‹

Block or report abhishek2f24

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
abhishek2f24/README.md

Typing SVG


Visitors GitHub followers Twitter Follow


๐Ÿ‘จโ€๐Ÿ’ป About Me

class DataEngineer:
    def __init__(self):
        self.name        = "Abhishek"
        self.role        = "Data Engineer & AI/ML Practitioner"
        self.company     = "GHD"
        self.location    = "Mumbai, India ๐Ÿ‡ฎ๐Ÿ‡ณ"
        self.email       = "abhishek2f24@gmail.com"

    @property
    def expertise(self):
        return [
            "Large-scale Data Pipelines & ETL Architectures",
            "Azure Cloud Services & Databricks",
            "Real-time Streaming Analytics (Apache Kafka)",
            "Machine Learning Model Deployment (Azure ML, MLOps)",
            "Business Intelligence & Power BI",
            "Geospatial & Telematics Data Engineering",
        ]

    @property
    def passions(self):
        return ["โ™Ÿ๏ธ Chess", "๐Ÿ“ˆ Stock Markets", "๐Ÿš€ MLOps", "โ˜๏ธ Cloud Computing"]

    def __str__(self):
        return f"Building data systems that scale, learn, and deliver business impact."

๐Ÿ› ๏ธ Tech Stack

โ˜๏ธ Cloud & Infrastructure

Azure Azure Databricks Azure ML Azure Data Factory GCP AWS Docker

๐Ÿ”ฅ Big Data & Streaming

Apache Kafka Apache Spark PySpark Hadoop HBase

๐Ÿ Languages & Frameworks

Python SQL PowerShell Flask

๐Ÿค– AI / ML

Azure ML TensorFlow PyTorch scikit-learn

๐Ÿ“Š Data & BI

Power BI Tableau Azure SQL MySQL


๐Ÿš€ Featured Projects

โ›๏ธ Mining Vehicle Telematics Analytics

End-to-end telematics pipeline for Codelco processing millions of GPS records from mining trucks. Includes haversine distance, elevation profiling from JP2 rasters, WGS84โ†’UTM coordinate transforms, and Azure SQL bulk-insert workflows.

Stack: Python ยท PySpark ยท Azure SQL ยท Pandas ยท Rasterio

๐Ÿ”ฎ Incident Prediction ML System

Two-stage classification pipeline (Logistic Regression + Random Forest) deployed as an Azure ML Batch Endpoint. Integrated with Azure Data Factory orchestration and Power BI dashboards for operational visibility.

Stack: Azure ML ยท ADF ยท Power BI ยท scikit-learn ยท Python

โšก Real-Time Streaming Analytics

Production-grade real-time event streaming platform using Apache Kafka and PySpark Structured Streaming on Azure Databricks. Designed for high-throughput, low-latency analytics on live data feeds.

Stack: Apache Kafka ยท PySpark ยท Azure Databricks ยท Delta Lake

๐Ÿง  NLP Chatbot & Semantic Search

Conversational AI and semantic search engine leveraging transformer-based embeddings and vector similarity. Deployed as a scalable API service with a Flask front-end interface.

Stack: Python ยท HuggingFace ยท Flask ยท Azure ยท NLP

๐Ÿ“Š California Open Data Power BI Report

Multi-dataset Power BI report ingesting data via CKAN OData API endpoints using a reusable Power Query function pattern, with scheduled weekly refresh on Power BI Service.

Stack: Power BI ยท Power Query ยท OData API ยท DAX

โ˜๏ธ Azure Resource Inventory Tool

Production-ready PowerShell automation to inventory Azure Resource Group assets โ€” including dependent resources, activity metrics, auth identity providers, and instance counts โ€” for cloud migration assessments.

Stack: PowerShell ยท Azure CLI ยท Azure Resource Manager


๐Ÿ“ˆ GitHub Stats

ย 

๐Ÿค Connect With Me

LinkedIn Twitter Gmail Stack Overflow HackerRank YouTube


Pinned Loading

  1. LLM_ChatGPT LLM_ChatGPT Public

    Implementation of various LLMs in python

    Jupyter Notebook

  2. NoSQL-Schema-Extraction NoSQL-Schema-Extraction Public

    Python

  3. rasa-chatbot rasa-chatbot Public

    Python

  4. MultiClass-Sentence-Classification MultiClass-Sentence-Classification Public

    This repository contains file from preprocessing to deployment(as Azure HTTP trigger function)

    PowerShell

  5. ReinforcementLearning ReinforcementLearning Public

    Jupyter Notebook

  6. OpenCV_projects OpenCV_projects Public

    Jupyter Notebook