
Actual footage from production (every single day)
๐ Professional Landing Page โข ๐ฎ Interactive Animation โข ๐ Download CV
๐ Landing Page
|
๐ฎ Interactive Animation
|
๐ Professional CV
|
๐ Explore the Full Experience โ
Real-time data โข Interactive elements โข Professional design โข Open source
Fork this repository and customize! Complete implementation with landing page, interactive animations, and auto-updating workflows.
Last updated: automatically every morning โข Status: ๐ฅ Everything is fine ๐ฅ
Data Engineer | Open Data Advocate | Analytics Pipeline Architect
"In Data We Trust, In Backups We Believe"
Transforming public data into accessible insights. Building scalable data solutions with open-source tools.
Digital Ecosystem โข Featured Project โข Tech Stack โข Other Projects โข Achievements โข Connect
๐ฏ Core Mission: Democratizing Data Access
Building bridges between complex public datasets and accessible insights
- ๐ 4+ Contributors on Osservatorio platform with growing community
- โก <100ms analytics query performance optimization
- ๐ 65% Test Coverage across production-ready codebases
- ๐ 15+ Open Source repositories supporting data democracy
๐ญ Osservatorio - Open Data Analytics Platform
Osservatorio democratizes access to Italian statistical data through automated pipelines and intuitive visualizations. Growing community with 4+ active contributors and production-ready infrastructure.
- Robust ETL pipelines for ISTAT data with automatic retries and circuit breakers
- Interactive Streamlit dashboards (React coming soon) for demographic and socio-economic analysis
- Multi-format export (CSV, Excel, Parquet) for maximum interoperability
- Contributor-friendly architecture with complete documentation and 65% test coverage
- Active community with regular discussions and collaborative development
Implementing hybrid persistence (DuckDB + SQLite to PostgreSQL) for <100ms analytics queries. Seeking contributors for data modeling and performance optimization. Join the discussion โ
๐ DataProfiler - High-Performance Data Quality Analysis
Fast, lightweight library and CLI tool for CSV and JSON data profiling written in Rust. Handles large files (GB+) with intelligent sampling and provides professional HTML reports.
- โก Lightning-fast analysis: Milliseconds for small files, ~3s for 115MB with 99.6% accuracy
- ๐ Comprehensive profiling: Auto-detects data types, nulls, duplicates, outliers, format inconsistencies
- ๐ Scalable architecture: Smart sampling for large datasets without memory issues
- ๐จ Professional output: Colored terminal display and HTML reporting
- ๐ฆ Rust performance: Zero-runtime dependencies, memory-safe, ultra-fast execution
Category | Technologies | Status |
---|---|---|
Data Processing | Python, pandas, numpy, dbt-core | ๐ข Production Ready |
Systems Programming | Rust, CLI tools, high-performance computing | ๐ก Actively Learning |
Storage & DB | DuckDB, PostgreSQL, SQLite, Parquet | ๐ข Optimized |
Analytics & BI | streamlit, Power BI, Plotly, Excel | ๐ข Dashboard Heaven |
Orchestration | Poetry, GitHub Actions, Docker, Kubernetes | ๐ก Continuously Improving |
Philosophy | No vendor lock-in, 100% reproducible | ๐ฅ Always On Fire |
data_stack = {
"orchestration": ["dbt-core", "Python 3.11+", "Poetry", "Docker"],
"systems": ["Rust", "CLI tools", "performance-critical apps"],
"storage": ["DuckDB", "PostgreSQL", "SQLite", "Parquet"],
"analytics": ["pandas", "numpy", "streamlit"],
"visualization": ["Power BI", "Plotly", "Excel"],
"current_status": "๐ฅ Everything is fine ๐ฅ"
}
- Data Modeling: Multi-layer architectures (
staging โ core โ marts
) - Pipeline Design: ETL/ELT with integrated validations and audit trails
- Performance Engineering: Query optimization, Rust CLI tools, sub-second data processing
- API Integration: SDMX, JSON, XML parsing from government sources
Miniature Modern Data Stack
|
๐ฏ ATS-ResearchATS Parsing Optimization Research
|
๐ CruscottoPMIBusiness Intelligence per PMI
|
๐ DashboardsBI-ExcelTemplate Excel avanzati per BI
|
Click above to play! A nostalgic breakout game powered by your GitHub activity
๐ Quick Stats: Focus on data engineering โข Automated ETL pipelines โข Open Source advocate โข 85% Python, SQL, Power BI
- Consulting on data engineering and analytics architecture
- Collaborations on open data initiatives and public sector projects
- Speaking engagements on democratizing data access
- Mentoring junior data professionals
- Contributors for Osservatorio project expansion
- Data partnerships with Italian public institutions
- Open source maintainers for knowledge sharing
๐ Professional Experience โข ๐ Download CV โข ๐ญ Featured Project โข ๐ Support Work
๐ Building the future of open data access โข ๐ฏ One pipeline at a time โข ๐ค Together with the community
Available for: Data Engineering Consulting โข Open Source Collaboration โข Technical Mentoring
โจ This entire ecosystem is open source - Fork it, customize it, make it yours!