Skip to content
View shahidmalik4's full-sized avatar

Block or report shahidmalik4

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shahidmalik4/README.md

Hi, I'm Shahid Malik 👋

Data Analyst → Analytics Engineering
Building reliable pipelines, scalable data models, and the infrastructure that makes analytics trustworthy.


👨‍💻 About Me

4+ years working with sales, CRM, and supply chain data. I understand what breaks pipelines, what makes dashboards unreliable, and what actually drives decisions for business teams.

Lately, my work has been focused on:

  • dbt models
  • ELT pipelines
  • Airflow workflows
  • data quality checks
  • Snowflake data warehousing

— building analytics that people can actually trust.

What I've shipped at work:

  • Reduced manual reporting by 30–40%
  • Built data quality frameworks that improved forecasting accuracy across planning teams
  • Developed data systems used by leadership, contributing to:
    • PKR 89M+ revenue growth
    • 11% margin improvement

🛠 Tech Stack

Layer Tools
Transformation dbt, SQL
Orchestration Airflow
Warehouse Snowflake, PostgreSQL
Language Python, FastAPI
Infra Docker, Git, Linux
Analytics Power BI, Looker, Excel, Pandas

Concepts & Practices

ELT / ETL Pipelines · Dimensional Modeling · Data Quality Frameworks · Pipeline Orchestration · Modular Data Modeling · Analytics Engineering


🚀 Featured Projects

End-to-end analytics engineering projects covering pipeline design, data modeling, orchestration, and dashboard delivery.

# Project Stack
1 dbt-airflow-data-pipeline dbt · Airflow · Metabase · Postgres · FastAPI
2 analytics-pipeline-fastapi-dbt dbt · Streamlit · Postgres · FastAPI
3 python-elt-demo Python · Pandas · Postgres · SQLAlchemy · Docker

📂 Explore all repositories →


📫 Let's Connect

If you're building a modern data stack, hiring for AE roles, or working on supply chain and revenue analytics — I'd love to connect.


Pinned Loading

  1. 120-years-of-olympics-analysis-using-powerbi 120-years-of-olympics-analysis-using-powerbi Public

    The "120 Years of Olympic Games: A Comprehensive Review" project aimed to analyze historical Olympic Games data spanning more than a century. Utilizing data sourced from Kaggle, the project focused…

    2

  2. crm-analytics-using-powerbi crm-analytics-using-powerbi Public

    The "Boosting Profitability: Optimizing Sales via CRM and Analytics" project involved the extraction, manipulation, and analysis of sales data sourced from SQL Server. Utilizing MySQL for data tran…

    2

  3. pandas_marketing_campaign_analysis pandas_marketing_campaign_analysis Public

    Imagine that Freedom ran a recent marketing campaign to promote the value proposition of how the debt relief program helps people achieve financial freedom. Suppose the cost of this campaign was $6…

    Jupyter Notebook 1

  4. harnessing-sql-for-sales-insights-and-improvement harnessing-sql-for-sales-insights-and-improvement Public

    The "Harnessing SQL for Sales Insights and Improvement" project involved comprehensive analysis of sales data extracted from SQL Server, imported into MySQL for data preparation, cleaning, normaliz…

    1

  5. investigating-patterns-in-120-years-of-olympic-data investigating-patterns-in-120-years-of-olympic-data Public

    The project involved the extraction, cleaning, and manipulation of Olympic Games dataset sourced from Kaggle. Using Excel for initial cleaning and MySQL for data preparation, normalization, and ana…

    1

  6. pyspark-airflow-postgres-etl pyspark-airflow-postgres-etl Public

    This project sets up an ETL pipeline using PySpark and Apache Airflow to extract data from a PostgreSQL database, transform it, and load it into a Railway PostgreSQL cloud database. The PySpark scr…

    Python 1