Skip to content

PacktPublishing/Time-Series-Analysis-with-Spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

141 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Time Series Analysis with Spark, First Edition

This is the code repository for Time Series Analysis with Spark, published by Packt.

A practical guide to processing, modeling, and forecasting time series with Apache Spark

Yoni Ramaswami

      Free PDF       Amazon      

About the book

Written by Databricks Senior Solutions Architect Yoni Ramaswami, whose expertise in Data and AI has shaped innovative digital transformations across industries, this comprehensive guide bridges foundational concepts of time series analysis with the Spark framework and Databricks, preparing you to tackle real-world challenges with confidence. From preparing and processing large-scale time series datasets to building reliable models, this book offers practical techniques that scale effortlessly for big data environments. You’ll explore advanced topics such as scaling your analyses, deploying time series models into production, Generative AI, and leveraging Spark's latest features for cutting-edge applications across industries. Packed with hands-on examples and industry-relevant use cases, this guide is perfect for data engineers, ML engineers, data scientists, and analysts looking to enhance their expertise in handling large-scale time series data. By the end of this book, you’ll have mastered the skills to design and deploy robust, scalable time series models tailored to your unique project needs—qualifying you to excel in the rapidly evolving world of big data analytics.

Key Learnings

  • Understand the core concepts and architectures of Apache Spark
  • Clean and organize time series data
  • Choose the most suitable modeling approach for your use case
  • Gain expertise in building and training a variety of time series models
  • Explore ways to leverage Apache Spark and Databricks to scale your models
  • Deploy time series models in production
  • Integrate your time series solutions with big data tools for enhanced analytics
  • Leverage GenAI to enhance predictions and uncover patterns

Chapters

  1. What Are Time Series?
  2. Why Time Series Analysis?
  3. Introduction to Apache Spark
  4. End-to-End View of a Time Series Analysis Project
  5. Data Preparation
  6. Exploratory Data Analysis
  7. Building and Testing Models
  8. Going at Scale
  9. Going to Production
  10. Going Further with Apache Spark
  11. Recent Developments in Time Series Analysis

Get to know Author

Yoni Ramaswami Yoni Ramaswami is a Senior Solutions Architect at Databricks with two decades of experience in IT, data, and AI. Recognized for his contributions to projects spanning digitally innovative technologies across industries, Yoni combines thought leadership, architecture, and implementation expertise. Originally from Mauritius, Yoni earned his Diplôme d'Ingénieur from UTC in France and Chalmers in Sweden, grounding his global perspective in both technical rigour and cultural insight. When not devising practical, high-impact solutions, he can be found exploring the lush landscapes of Mauritius with his son.

Other Related Books

About

Time Series Analysis with Spark, published by Packt

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors