Skip to content

An introduction to Data Science using Snowflake

License

Notifications You must be signed in to change notification settings

pyladiesams/snowflake-apr2023

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

An introduction to Data Science using Snowflake

Workshop description

In this session, you will learn how Snowflake can enable and accelerate end-to-end Data Science workflows. We will train and deploy a model using the Snowpark library for Python and build a Streamlit app that allows business users to consume the results of our model predictions in a simple web app - all using pure Python, without any front-end experience required.

Requirements

Usage

  • Clone the repository and navigate to the workshop folder
  • Install the required libraries with conda: conda env create -f jupyter_env.yml
  • Activate new environment: conda activate snowpark_pyladies
  • Update connection.json with your Snowflake credentials
  • Install the Snowflake extension for VS code
  • Run the scripts and the notebooks
  • After deploying the model, open a terminal window and run streamlit run 3-Snowpark_Streamlit_Revenue_Prediction.py
  • Congratulations! You have just created a cool app to showcase your predictive powers 🚀

Quick Recap ⛄ : You've successfully performed data engineering tasks and trained a Linear Regression model to predict future ROI (Return On Investment) of variable advertising spend budgets across multiple channels using Snowpark for Python and scikit-learn. And then you created a Streamlit application that uses that model to generate predictions on new budget allocations based on user input.

Video record

Re-watch this YouTube stream

Keep-up the learning

For an up-to-date versions, please refer to the QuickStart Guide.

Credits

This workshop was set up by @pyladiesams and @avecaile and @iamontheinet