This is the code repository for Time Series Analysis with Spark, published by Packt.
Yoni Ramaswami
Written by Databricks Senior Solutions Architect Yoni Ramaswami, whose expertise in Data and AI has shaped innovative digital transformations across industries, this comprehensive guide bridges foundational concepts of time series analysis with the Spark framework and Databricks, preparing you to tackle real-world challenges with confidence. From preparing and processing large-scale time series datasets to building reliable models, this book offers practical techniques that scale effortlessly for big data environments. You’ll explore advanced topics such as scaling your analyses, deploying time series models into production, Generative AI, and leveraging Spark's latest features for cutting-edge applications across industries. Packed with hands-on examples and industry-relevant use cases, this guide is perfect for data engineers, ML engineers, data scientists, and analysts looking to enhance their expertise in handling large-scale time series data. By the end of this book, you’ll have mastered the skills to design and deploy robust, scalable time series models tailored to your unique project needs—qualifying you to excel in the rapidly evolving world of big data analytics.
- Understand the core concepts and architectures of Apache Spark
- Clean and organize time series data
- Choose the most suitable modeling approach for your use case
- Gain expertise in building and training a variety of time series models
- Explore ways to leverage Apache Spark and Databricks to scale your models
- Deploy time series models in production
- Integrate your time series solutions with big data tools for enhanced analytics
- Leverage GenAI to enhance predictions and uncover patterns
- What Are Time Series?
- Why Time Series Analysis?
- Introduction to Apache Spark
- End-to-End View of a Time Series Analysis Project
- Data Preparation
- Exploratory Data Analysis
- Building and Testing Models
- Going at Scale
- Going to Production
- Going Further with Apache Spark
- Recent Developments in Time Series Analysis
Yoni Ramaswami Yoni Ramaswami is a Senior Solutions Architect at Databricks with two decades of experience in IT, data, and AI. Recognized for his contributions to projects spanning digitally innovative technologies across industries, Yoni combines thought leadership, architecture, and implementation expertise. Originally from Mauritius, Yoni earned his Diplôme d'Ingénieur from UTC in France and Chalmers in Sweden, grounding his global perspective in both technical rigour and cultural insight. When not devising practical, high-impact solutions, he can be found exploring the lush landscapes of Mauritius with his son.
