Skip to content

Latest commit

 

History

History
122 lines (89 loc) · 6.95 KB

r-predictive-model-introduction.md

File metadata and controls

122 lines (89 loc) · 6.95 KB
title titleSuffix description author ms.author ms.reviewer ms.date ms.service ms.subservice ms.topic monikerRange
Tutorial: Develop predictive model in R
SQL machine learning
In this four-part tutorial series, you develop data to train a predictive model in R with SQL machine learning.
VanMSFT
vanto
monamaki
05/29/2024
sql
machine-learning
tutorial
>=sql-server-2016||>=sql-server-linux-ver15||=azuresqldb-mi-current

Tutorial: Develop a predictive model in R with SQL machine learning

[!INCLUDE SQL Server 2016 SQL MI]

::: moniker range=">=sql-server-ver15||>=sql-server-linux-ver15" In this four-part tutorial series, you use R and a machine learning model in SQL Server Machine Learning Services or on Big Data Clusters to predict the number of ski rentals. ::: moniker-end ::: moniker range="=sql-server-2017" In this four-part tutorial series, you use R and a machine learning model in SQL Server Machine Learning Services to predict the number of ski rentals. ::: moniker-end ::: moniker range="=sql-server-2016" In this four-part tutorial series, you use R and a machine learning model in SQL Server R Services to predict the number of ski rentals. ::: moniker-end ::: moniker range="=azuresqldb-mi-current" In this four-part tutorial series, you use R and a machine learning model in Azure SQL Managed Instance Machine Learning Services to predict the number of ski rentals. ::: moniker-end

Imagine you own a ski rental business and you want to predict the number of rentals that you'll have on a future date. This information will help you get your stock, staff, and facilities ready.

In the first part of this series, you'll get set up with the prerequisites. In parts two and three, you'll develop some R scripts in a notebook to prepare your data and train a machine learning model. Then, in part three, you'll run those R scripts inside a database using T-SQL stored procedures.

In this article, you'll learn how to:

[!div class="checklist"]

  • Restore a sample database

In part two, you'll learn how to load the data from a database into a Python data frame, and prepare the data in R.

In part three, you'll learn how to train a machine learning model in R.

In part four, you'll learn how to store the model in a database, and then create stored procedures from the R scripts you developed in parts two and three. The stored procedures run on the server to make predictions based on new data.

Prerequisites

::: moniker range=">=sql-server-ver15||>=sql-server-linux-ver15"

Restore the sample database

The sample database used in this tutorial has been saved to a .bak database backup file for you to download and use.

::: moniker range=">=sql-server-ver15||>=sql-server-linux-ver15"

Note

If you are using Machine Learning Services on Big Data Clusters, see how to Restore a database into the SQL Server big data cluster master instance. ::: moniker-end

::: moniker range=">=sql-server-2017||>=sql-server-linux-ver15"

  1. Download the file TutorialDB.bak.

  2. Follow the directions in Restore a database from a backup file in Azure Data Studio, using these details:

    • Import from the TutorialDB.bak file you downloaded.
    • Name the target database TutorialDB.
  3. You can verify that the restored database exists by querying the dbo.rental_data table:

    USE TutorialDB;
    SELECT * FROM [dbo].[rental_data];

::: moniker-end ::: moniker range="=azuresqldb-mi-current"

  1. Download the file TutorialDB.bak.

  2. Follow the directions in Restore a database to Azure SQL Managed Instance in SQL Server Management Studio, using these details:

    • Import from the TutorialDB.bak file you downloaded.
    • Name the target database TutorialDB.
  3. You can verify that the restored database exists by querying the dbo.rental_data table:

    USE TutorialDB;
    SELECT * FROM [dbo].[rental_data];

::: moniker-end

Clean up resources

If you're not going to continue with this tutorial, delete the TutorialDB database.

Next step

In part one of this tutorial series, you completed these steps:

  • Installed the prerequisites
  • Restored a sample database

To prepare the data for the machine learning model, follow part two of this tutorial series:

[!div class="nextstepaction"] Prepare data to train a predictive model in R