This project conducts a comprehensive SQL analysis on e-commerce data from Target Brazil (2016-2018). The goal is to extract meaningful insights to guide strategic business decisions, including:
- π Regional expansion
- π Shipping optimization
- π³ Payment method analysis
- π Customer retention strategies
The entire analysis is performed using Google BigQuery.
- BigQuery Dataset: π View Here
- Datasets: π Google Drive Link
The dataset consists of 100,000+ orders placed in Brazil from 2016 to 2018, covering:
- π₯ Customers & Sellers
- π¦ Order details & items
- π Geolocation data
- π° Payment information
- π· Product attributes
- β Customer reviews
The data is stored in 8 CSV files:
- π
customers.csv- Customer details (location, ID, etc.) - πͺ
sellers.csv- Seller information - π
order_items.csv- Order details (items, price, shipping, etc.) - π
geolocation.csv- Customer & seller location data - π³
payments.csv- Payment details (type, value, installments, etc.) - βοΈ
reviews.csv- Customer feedback & ratings - π¦
orders.csv- Order timestamps & statuses - π
products.csv- Product descriptions, weight, dimensions, etc.
The SQL analysis leverages advanced querying techniques such as:
- π Window functions
- π Common Table Expressions (CTEs)
- π Complex Joins
- π Orders grew steadily from 2016 to 2018.
- π― Peak order months: May, July, and August.
- β³ Most orders are placed in the Afternoon (13:00 - 18:00 hrs).
- π High customer density: Minas Gerais (MG) & Rio de Janeiro (RJ).
- π Low customer density: Roraima (RR) & AmapΓ‘ (AP) - potential for targeted marketing.
- π° Order costs increased by 20% (2017-2018, Jan-Aug).
- π Top spending states: SΓ£o Paulo (SP), Minas Gerais (MG), ParanΓ‘ (PR).
- π― Identified top 10 highest-spending customers for loyalty programs.
- π Longest delivery times: Roraima (RR), AmapΓ‘ (AP), Amazonas (AM) (>23 days avg.)
- π Fastest deliveries: SΓ£o Paulo (SP), ParanΓ‘ (PR), Minas Gerais (MG) (<15 days avg.)
- π² Highest freight costs: ParaΓba (PB), Acre (AC), RondΓ΄nia (RO).
- π³ Credit Cards dominate across all years.
- π Voucher usage is declining from 2017 onwards.
- π Orders with 1-10 installments are most common; very few use >10 installments.
- πΎ SQL (Google BigQuery Legacy SQL)
- βοΈ Google BigQuery (Cloud-based Data Analysis)
- π Data Visualization (Tableau, Google Data Studio)
- π
SQL Target Data Analysis.sql- Contains 50+ SQL queries used in the project. - π
Target SQL Business Case.pdf- Visual summary of insights & trends. - π
Project Target SQL Description.pdf- Detailed project description & objectives.
- π Enhance Delivery Optimization: Reduce delivery times in delayed regions.
- π³ Improve Payment Strategies: Address declining voucher usage, promote installment-based purchases.
- π Regional Expansion: Identify opportunities in states with fewer customers.
- π― Personalized Marketing: Use customer spending insights for targeted campaigns.
For questions, feedback, or collaboration opportunities, feel free to reach out:
- π LinkedIn: Venkata Sai Teja Mothukuri
π This project showcases data-driven decision-making for e-commerce growth and optimization using SQL & BigQuery. π