Created with Sketch.
Machine Learning Guide
68 minutes | Jan 29, 2022
MLA 020 Kubeflow
Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) Dirk-Jan Verdoorn - Data Scientist at Dept Agency Kubeflow. (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable. Our goal is not to recreate other services, but to provide a straightforward way to deploy best-of-breed open-source systems for ML to diverse infrastructures. Anywhere you are running Kubernetes, you should be able to run Kubeflow. TensorFlow Extended (TFX). If using TensorFlow with Kubeflow, combine with TFX for maximum power. (From the website:) TensorFlow Extended (TFX) is an end-to-end platform for deploying production ML pipelines. When you're ready to move your models from research to production, use TFX to create and manage a production pipeline. Alternatives: Airflow MLflow
75 minutes | Jan 13, 2022
MLA 019 DevOps
Chatting with co-workers about the role of DevOps in a machine learning engineer's life Expert coworkers at Dept Matt Merrill - Principal Software Developer Jirawat Uttayaya - DevOps Lead The Ship It Podcast (where Matt features often) Devops tools Terraform Ansible Pictures (funny and serious) Which AWS container service should I use? A visual guide on troubleshooting Kubernetes deployments Public Cloud Services Comparison Killed by Google aCloudGuru AWS curriculum
6 minutes | Nov 7, 2021
MLA 018 Descript
(Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it performed. Descript The Ship It Podcast How to ship software, from the front lines. We talk with software developers about their craft, developer tools, developer productivity and what makes software development awesome. Hosted by your friends at Rocket Insights. AKA shipit.io Brandbeats Podcast by BASIC An agency podcast with views on design, technology, art, and culture. Explore the new microsite at www.brandbeats.basicagency.com
64 minutes | Nov 6, 2021
MLA 017 AWS Local Development
Show notes: ocdevel.com/mlg/mla-17 Developing on AWS first (SageMaker or other) Consider developing against AWS as your local development environment, rather than only your cloud deployment environment. Solutions: Stick to AWS Cloud IDEs (Lambda, SageMaker Studio, Cloud9 Connect to deployed infrastructure via Client VPN Terraform example YouTube tutorial Creating the keys LocalStack Infrastructure as Code Terraform CDK Serverless
60 minutes | Nov 5, 2021
MLA 016 SageMaker 2
Part 2 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.) SageMaker Jumpstart Deploy Pipelines Monitor Kubernetes Neo
47 minutes | Nov 4, 2021
MLA 015 SageMaker 1
Show notes Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.) SageMaker DataWrangler Feature Store Ground Truth Clarify Studio AutoPilot Debugger Distributed Training And I forgot to mention JumpStart, I'll mention next time.
52 minutes | Jan 18, 2021
MLA 014 Machine Learning Server
Server-side ML. Training & hosting for inference, with a goal towards serverless. AWS SageMaker, Batch, Lambda, EFS, Cortex.dev
47 minutes | Jan 3, 2021
MLA 013 Customer Facing Tech Stack
Client, server, database, etc.
31 minutes | Nov 9, 2020
MLA 012 Docker
Use Docker for env setup on localhost & cloud deployment, instead of pyenv / Anaconda. I recommend Windows for your desktop.
42 minutes | Nov 8, 2020
MLG 032 Cartesian Similarity Metrics
Show notes at ocdevel.com/mlg/32. L1/L2 norm, Manhattan, Euclidean, cosine distances, dot product Normed distances link A norm is a function that assigns a strictly positive length to each vector in a vector space. link Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below. L1: Manhattan/city-block/taxicab. abs(x2-x1)+abs(y2-y1). Grid-like distance (triangle legs). Preferred for high-dim space. L2: Euclidean. sqrt((x2-x1)^2+(y2-y1)^2. sqrt(dot-product). Straight-line distance; min distance (Pythagorean triangle edge) Others: Mahalanobis, Chebyshev (p=inf), etc Dot product A type of inner product. Outer-product: lies outside the involved planes. Inner-product: dot product lies inside the planes/axes involved link. Dot product: inner product on a finite dimensional Euclidean space link Cosine (normalized dot)
34 minutes | Nov 8, 2020
MLA 011 Practical Clustering
Kmeans (sklearn vs FAISS), finding n_clusters via inertia/silhouette, Agglomorative, DBSCAN/HDBSCAN
26 minutes | Oct 28, 2020
MLA 010 NLP packages: transformers, spaCy, Gensim, NLTK
NLTK: swiss army knife. Gensim: LDA topic modeling, n-grams. spaCy: linguistics. transformers: high-level business NLP tasks.
24 minutes | Nov 6, 2018
MLA 009 Charting tools
matplotlib, Seaborn, Bokeh, D3, Tableau, Power BI, QlikView, Excel
24 minutes | Oct 26, 2018
MLA 008 Exploratory Data Analysis
EDA + charting. DataFrame info/describe, imputing strategies. Useful charts like histograms and correlation matrices.
16 minutes | Oct 16, 2018
MLA 007 Jupyter Notebooks
Run your code + visualizations in the browser: iPython / Jupyter Notebooks.
19 minutes | Jul 19, 2018
MLA 006 Salary
Salary based on location, gender, age, tech... from O'Reilly.
26 minutes | Jun 9, 2018
MLA 005 Shapes & Sizes
Dimensions, size, and shape of Numpy ndarrays / TensorFlow tensors, and methods for transforming those.
17 minutes | May 24, 2018
MLA 003 Storage: HDF, Pickle, Postgres
Comparison of different data storage options when working with your ML models.
17 minutes | May 24, 2018
MLA 002 Numpy & Pandas
Some numerical data nitty-gritty in Python.
11 minutes | May 24, 2018
MLA 001 Certificates & Degrees
Reboot on the MLG episode, with more confident recommends.
Terms of Service
Do Not Sell My Personal Information
© Stitcher 2022