Peter Hoffmann – 12 Factor Apps for Data-Science with Python
BooksPython for Data Analysis, 2nd Edition
IPython Interactive Computing and Visualization Cookbook, Second Edition
Jupyter for Data Science
- Cloud Computing
- Infrastructure as a Service (IaaS)
- Platform as a Service (PaaS)
- Desktop as a Service (DaaS)
- Managed software as a Service (MSaaS)
- Mobile backend as a Service (MBaaS)
- Information Technology Management as a Service (ITMaaS)
Centralized hosting of business applications dates back to the 1960s. Starting in that decade, IBM and other mainframe providers conducted a service bureau business, often referred to as time-sharing or utility computing. Such services included offering computing power and database storage to banks and other large organizations from their worldwide data centers.
Snowflake, The Only Data Warehouse Built for the Cloud
Welcome to the Snowflake Documentation
Key Concepts & Architecture
Snowflake is an analytic data warehouse provided as Software-as-a-Service (SaaS). Snowflake provides a data warehouse that is faster, easier to use, and far more flexible than traditional data warehouse offerings.
Snowflake’s data warehouse is not built on an existing database or “big data” software platform such as Hadoop. The Snowflake data warehouse uses a new SQL database engine with a unique architecture designed for the cloud. To the user, Snowflake has many similarities to other enterprise data warehouses, but also has additional functionality and unique capabilities.
The Jupyter Notebook, Formerly known as the IPython Notebook
The Jupyter Notebook
github.com/jupyter/jupyter/wiki/Jupyter-kernels, Jupyter kernels
How do I install different languages in Jupyter Notebook?
Data Scientist / Services for Machine Learning
- WSGI Service
- Jupyter
- pandas – Python Data Analysis Library
- scikit-learn – Machine Learning in Python
- StatsModels – Statistics in Python
- SciPy
- NumPy
- matplotlib
- DASK
- Airflow
- Apache Aurora
- Apache Mesos
- Apache Parquet
- Apache Arrow
- Grafana
- graylog
- Sentry
- Prometheus
- Docker
Jupyter
web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text
pandas
Apache Parquet
Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model or programming language.
Apache Arrow
A cross-language development platform for in-memory data.
Reading and Writing the Apache Parquet Format
Amit Kumar – Let’s Talk About GIL!
SymPy
SymPy is a Python library for symbolic mathematics. It aims to become a full-featured computer algebra system (CAS) while keeping the code as simple as possible in order to be comprehensible and easily extensible. SymPy is written entirely in Python.