ali-pic

Ali Bin Kashif

About Me

Ali's Image

Greetings! I am Ali.

I'm a data professional with over a year of experience and with a proven ability to deliver short or long-term projects in data engineering, data workflow automation, data warehousing, and BI realm. I've worked in different domains like e-commerce, SaaS, hospitality, and mobile apps. My passion is to partner with my clients to deliver top-notch, scalable data solutions that maximize your ROI and drive business growth.

My approach? Not just code, but understand the problem and business outcome first, then architect an effective solution using the right technologies and tools.

I specialize in the following data solutions:
✔️ Building data warehouses using modern cloud platforms and technologies.
✔️ Developing and automating data pipelines, real-time streaming & ETL processes.
✔️ Building highly intuitive, interactive dashboards.
✔️ Data Migration (Heterogenous and Homogenous).
✔️ Data Extraction, Scraping, Cleaning, Transformation, and Modelling.
✔️ Data strategy advisory & technology selection/recommendation.
✔️ Data Applications with API integrations.

Looking for someone to get your data problems solved?
Contact me today to explore solutions tailored to your needs!

Let's drive business growth together!

Work

Completed Projects

Ongoing Projects

Jobs

Certifications

Technical Skills

Programming Languages & Dev Tools

power-bi-2021

Python

external-SQL-development-files-those-icons-flat-those-icons

SQL

amazon-web-services

Docker

fastapi-logo

FastAPI

flask-logo

Flask

Cloud Platforms & Version Control

google-cloud

Google Cloud

amazon-web-services

AWS

snowflake

Snowflake

amazon-web-services

Databricks

dbt

dbt

git

Git

github

GitHub

Databases, Warehouse and ETL Tool

mysql-logo

MySQL

postgreesql

PostgreSQL

mongodb

MongoDB

bigquery

BigQuery

bigquery

Redshift

bigquery

GCS

bigquery

S3 Bucket

airflow-logo

Airflow

Data Analytics & Visualization

pandas

Pandas

microsoft-excel-2019--v1

PySpark

Matplotlib

seaborn-logo

Seaborn

power-bi-2021

Power BI

google-looker

Looker

microsoft-excel-2019--v1

Excel

microsoft-excel-2019--v1

QuickSight

Experience

Data Engineer

November 2024 - Present

MarketLytics

At MarketLytics, I specialize in developing scalable data solutions that drive actionable insights and operational and cost efficiency. My work involves building robust pipelines, integrating advanced analytics, and delivering impactful products for our clients.

  • Collaborated with the DE team and built a data pipeline which ingest raw data from Stripe, PayPal, Shopify, Thrivecart, and Clickfunnels into BigQuery with incremental syncs. Created views in BigQuery with scheduled queries and Looker dashboards for unified revenue tracking, reducing manual revenue tracking time by 80% and delivering accurate, actionable reports for business stakeholders.

  • Led a complete refactor of a client’s data infrastructure that was previously scattered across 30+ Google Cloud Functions with no CI/CD or documentation. I consolidated these into a single FastAPI-based microservice, containerized with Docker, and deployed on Cloud Run. Using Terraform, I automated the entire infrastructure setup and introduced GitHub Actions for CI/CD, ensuring automated builds, testing, and deployments. I also integrated Slack alerts and structured logging for better observability. This transformation reduced deployment time by 80%, improved reliability, and made the system fully maintainable with clear version control and documentation.

  • Built and deployed Muawin, a real-time anomaly detection system that uses the SARIMA model to monitor key metrics of e-commerce and send automated Slack alerts and reports. The product was containerized with Docker, deployed on Cloud Run, and uses Flask as a backend.

  • Designed a cost-effective pipeline to migrate web analytics data from BigQuery → GCS → Snowflake, achieving 90% cost savings over the client’s existing solution.

  • Developed complex queries for data transformation and analytics in Databricks and Metabase, enabling data-driven decision-making for stakeholders.

  • Built an automated pipeline to fetch data from the GA4 API into BigQuery, enabling seamless integration of web analytics data for further analysis.

  • Implemented Slack Alerts system in data pipelines for notifications of successful runs and if any errors that occurred, so that prompt action can be made.

  • Built a real-time revenue attribution pipeline triggered by webhooks, transforming and sending session-mapped data into GA4.

  • Built custom GPTs and chatbots integrated with BigQuery, allowing clients to chat with their data and instantly generate tables, visualizations, and aggregated data and get quick insights simply by writing prompts.


Data and AI Trainee

June 2024 - August 2024

MarketLytics

  • Developed an efficient anomaly detection system using Python and the Prophet model to identify unusual patterns and trends.
  • Integrated web analytics data from Google Analytics 4 into Google BigQuery.
  • Implemented real-time alerts on Slack with comprehensive report generation.
  • Created the API using Flask and containerized using Docker.
  • Deployed the pipeline on Google Cloud Run, ensuring a robust and scalable architecture for the pipeline.

  • Product Impact: Muawin helps clients monitor their web analytics data effectively, enhancing their ability to respond swiftly and accurately to data irregularities.

Associate Software Developer

2023 - 2023

The Game Storm Studios Pvt. Ltd.

  • Worked on various Android and iOS games using C# and Unity.
  • Worked closely with senior developers to understand project requirements.
  • Collaborated with the design team to implement user-friendly interfaces.

Game Developer Apprentice

June 2023 - August 2023

Mindstorm Studios

  • Collaborated with other developers and assisted in developing mobile games.
  • Participated in code reviews and contributed to improving code quality.
  • Gained hands-on experience, learned industry best practices and professional behaviour.

PROJECTS

Crypto Data Pipeline — AWS Glue, Lambda, S3, Snowflake, Airflow

A scalable crypto data pipeline that ingests from API, transforms, stores, and loads crypto currencies historical and intra day data using AWS services and Snowflake. Airflow orchestrates the entire pipeline and, CI/CD fully automated through GitHub Actions — built with industry best practices in mind.

Webhooks to BigQuery and GA4 Realtime Data Pipeline

Built this real-time pipeline to track e-commerce purchase events along with customer attribution and session data in real-time by sending structured payloads to both Google Analytics 4 (GA4) and BigQuery (for session data), ensuring marketing teams get actionable insights while storing clean data for revenue attributions.

BigQuery to Snowflake Data Migration Pipeline

​Developed a cost-effective data migration pipeline, reducing expenses by over 90% for transferring 300GB monthly from BigQuery to Snowflake. Utilized Python to automate data export to Google Cloud Storage and configured Snowflake for efficient ingestion, achieving seamless scalability and enhanced performance

Portfolio Highlights

Glimpses of my work, certifications and achievements.

  • All
  • Work
  • Courses
  • Achievements

Crypto Data Pipeline

Python | AWS Glue | Lambda | Snowflake
| Airflow | S3

Python Automated Data Extraction

Large PDF to Structured JSON for AI Agent

BigQuery to Snowflake Data Migration Pipeline

Python | BigQuery | GCS | Snowflake

Webhooks to BigQuery and GA4 Realtime Data Pipeline

Python | BigQuery | Webhooks | Google Cloud Run | GA4

Web Analytics Data Pipeline with Anomaly Alerts

Python | BigQuery | Google Cloud Run | Docker | GA4

GenAI and LLM Powered RAG Chabot for Universities

Python | FastAPI | Langchain | Pinecone

ETL Pipeline for Next Cola's Inventory Analysis

PYTHON, MYSQL, APACHE AIRFLOW, AWS S3

Books Website Data Scrapping

Web Scraping | Python | Scrapy

Sales Dashboard

Power BI | SQL

Telecom 5G Launch Dashboard

Power BI

FMCG Supply Chain Dashboard

POWER BI

Kaggle Profile

Expert Rank

Revenue Insights Dashboard

POWER BI

Pakistan's E-Commerce Dataset Analysis

Python

Hearth Healthcare Data Analysis

Python

Sales Analytics Ad-hoc Insights

SQL | MySQL DATABASE

Exploratory Data Analysis on Al-Quran Dataset

Python

IBA DATATHON 2.0

winner BI Track

Data Manipulation in SQL

DATACAMP

Certified Data Analyst Associate

DATACAMP

Data Analyst in Python

DATACAMP

Power BI Fundamentals

DATACAMP

Intermediate SQL Course

DATACAMP

Marketing Survey Analysis Report

Google Looker Studio

Education & Activites

NED University of Engineering & Technology

BS, Computer Science (2020-2024)

University life has a big impact on my personality grooming. It has improved my communication, team work, problem-solving and leadership skills.

IBA Datathon 2.0

Winner of Business Intelligence Track

Participated and won in Datathon 2.0 organized by IBA Data Science Society held at the Institute of Business Administration, Karachi on 28 April 2024,

NSA NEDUET

Manager Graphics Domain (2020-2021)

NSA is a media society in NED University. I served as a co-director and manager of graphics domain, responsible for managing the team members and all the graphics work of the society.

ACM NUCES - Developer's Day 2023

Winner of Game Development Competition

Participated and won the game development competition sponsored by The Game Storm Studios Pvt. Ltd. held on May 2023.

Student's Welfare Society

Founder (2021 - Present)

Founder and lead of a welfare society for students which is carrying out philanthropy work like Ramazan drives, Ration drives, Winter drives etc. for the needy people in our city, Karachi.

Get in Touch

Location:

Karachi, Pakistan

Call:

+92 333 3552460