Created with Sketch.
Data Science Imposters Podcast
47 minutes | Jul 26, 2021
Casual Talk, Space, and Bitemporal Tables
Jordy and Antonio meet after a vacation haitus or two. They talk space, life, and are even able to sneak in some data science. If you’d like to see our show continue, reach out to the hosts.
32 minutes | Jun 8, 2021
3 Projects That Can Make You a Household* Name
*Household name is a bit of exaggeration – but certainly a known name in the ML community. Today we are joined by Alex Beutel, KDD Cup co-chair and senior staff research scientist & tech lead / manager at Google to discuss the three projects for this year’s KDD Cup. It’s amazing the diversity in these projects and possible ways to solve it. Even if you didn’t get a chance to officially enter the competition this year, think about how you would solve it, and then listen to how the winners of the contest were able to achieve top scores. These are the three projects that we discuss during the show: https://compete.hexagon-ml.com/practice/competition/39/https://ogb.stanford.edu/kddcup2021/http://www.yunqiacademy.org/ “… KDD 2021, the longest running and largest interdisciplinary conference on data science where the biggest names in the industry and academia come together to drive innovation in AI, machine learning, computer vision and more. Originally planned to take place in Singapore, the conference will be 100% virtual this year. As an added bonus, organizers will offer key content twice, on eight-hour intervals to ensure equitable access across time zones.” Okay, I borrowed that last part from one of their press releases but if you don’t know KDD, you better ask somebody.
36 minutes | May 31, 2021
‘Data Science’ is an evolution. What does it mean now in the Workplace? w/ Erin Stanton
We didn’t always have ‘Data Science’ but we’ve had data for ever. We’re joined by Erin Stanton to learn about the evolution of Data Science in her work and personal life. Erin Stanton has 13+ years of experience in big data and data science and she currently runs the Global Client Support organization for Virtu Analytics. Erin is known for her big energy which she brings to everything she does and has been more recently championing potential machine learning and AI techniques to answer client questions within her day job following her recent completion of her Masters in Data Science from UC Berkeley.
34 minutes | Mar 29, 2021
What are NFTs and why do you care? (Fixed episode link)
Shain and Calvin join us to tell us all about NFTs. NFTs stand for non-fungible tokens – meaning they represent unique items – unlike currency which is fungible and can be aggregate or divided. NFTs could become a bigger industry than digital currencies so will you be an early adopter or late comer?
34 minutes | Mar 9, 2021
I don’t get the vaccine rollout … is using tech for a vaccine appt immoral …
Antonio and Jordy are back at it. No guests, just the two hosts catching up after 3 months of relative silence. Antonio explains to Jordy why he is befuddled by the vaccine rollout … in addition, he asks whether the script he wrote to find a vaccine appointment is immoral … We’d like to hear from you so please reach out.
49 minutes | Dec 28, 2020
Can you remember this? We talk memory with John Graham, Grandmaster of Memory, and USA Memory Champion
What would you remember if you had better memory? In this episode we speak to the 2018 USA Memory Champion and Grandmaster of Memory, John Graham aka memoryjohn.com. As imposters, we have lots of questions. John answers them with the exactness that you’d expect from someone who continues to train their memory. John is a great coach and he’ll even run us through a mental exercise where we memorize something – listen and see if you can follow along. If you’re interested about learning more, check out John’s website. You can also join Jordy and Antonio as they read Moonwalking with Einstein. Imagine what’s possible if you applied these techniques into 2021.
73 minutes | Dec 14, 2020
Math in Data Science with Professor Margot Gerritsen
Today we are joined by Professor Margot Gerritsen from Stanford University today to talk to us about Math in Data Science, Diversity in Data Science, and some of the ideas behind a Growth Mindset. She is co-founder and co-director of the global Women in Data Science (WiDS) conference, reaching more than 120,000 participants annually in more than 60 countries and inspiring thousands of women around the world to pursue careers in STEM or data science. If you haven’t seen Margot’s Stanford TedX Stanford talk, you should do so now. Also, the Women in Data Science (WIDS) Conference is taking place on March 8, 2021. Click on the link above for more details, follow WiDS on Twitter @WiDS_Worldwide, or check out their LinkedIn profile: https://www.linkedin.com/company/women-in-data-science-wids-at-stanford-university Margot recommended the book Mindset by Carol S. Dweck, Ph.D. Antonio is already a few chapters in. Here’s the Goodreads link for you interested in learning more. Additional details about Margot: Margot Gerritsen, a prominent Stanford professor in her field who radiates a passion for the computational sciences, including data science. She is professor of energy resources engineering at Stanford, the former director of the Stanford Institute for Computational and Mathematical Engineering, and until recently, the senior associate dean at Stanford’s School of Earth, Energy & Environmental Sciences. Margot is also the Chair of the Board of Trustees of SIAM. She has spent nearly her entire lifetime using mathematics to solve a wide variety of complex problems, including reservoir modeling, coastal ocean flows, and sail design for America’s Cup yachts.
43 minutes | Nov 30, 2020
Discovering interesting rules: Association Rule Learning
We’ve been doing this podcast for over three years and I’m surprised that we had never come across and discussed the topic of association rules. Association rules is a mining technique that allows you to find rules in your data. It’s pretty intuitive when you see it but we were so surprised how hard it was to talk about during the show. Some resources that we encountered to get our head around the topic: We take a look at this example: http://www.saedsayad.com/association_rules.htm during our conversationAssociation Rules: https://towardsdatascience.com/association-rules-2-aa9a77241654mlxtend Association rules documentation and example: http://rasbt.github.io/mlxtend/user_guide/frequent_patterns/association_rules/Extracting Rules from Medical Health Records: https://www.sciencedirect.com/science/article/pii/S1877050917319658Association Rule Learning Wikipedia Article: https://en.wikipedia.org/wiki/Association_rule_learning We know that talking about some of these topics is difficult and sometimes it’s hard to follow and sometimes we don’t actually convey the message easily. We hope that our conversations spark some ideas, ignite you to try some of the code yourself, or do your own research. Please email us if our shows have spoken to you in way way or another. Also, the first person to email us will get a Data Science Imposters Podcast face mask.
53 minutes | Nov 16, 2020
Stock Market: Investing and Data Science
Jordy and Antonio talk about the stock market. They focus on the data behind the stock market movements and also how you can get started investing in the stock market. Whether you have never traded in the market, are a buy-and-hold investor, or are an avid investor – you’ll enjoy this spirited conversation that could have gone on for more longer. We were excited to do this episode and when we were done, we realized that there’s so much more to talk about but we won’t know if we should or shouldn’t do a follow-up episode unless we hear from our listeners. We look forward to your feedback.
62 minutes | Nov 2, 2020
How Emerging Data Scientists Can Path Creative Careers (And How Employers Can Help Get Them There)
This episode is the panel discussion that we moderated two weeks ago for the Tom Tom Cities Rising Summit (https://www.tomtomfoundation.org/cities-rising-summit). Here’s the description provided for the event: Join regional data science leaders for a discussion that will touch on best practices for college graduates to mid-career professionals who are seeking rewarding employment opportunities, as well as insights into how organizations can build the data science team that best suits their needs. Listen to the perspective of these fantastic panelists: Miriam Friedel – Director of Software Engineering at Capital OneKerry Guerrero – Senior Data Scientist at S&P globalRenee Teate – Director of Data Science at HelioCampus Subscribe, tell your friends, and if you’d like us to moderate an event at your conference, please reach out.
39 minutes | Oct 19, 2020
Is Feature Engineering a low hanging fruit?
In this episode, Antonio explains to Jordy what he knows about feature engineering from work, Kaggle projects, and general research. Antonio talks about featuretools which he was able to use recently. Feature Engineering appears to be an area that could really enhance machine learning in significant ways. What do you think? Do we need experts to tell us what’s important or is there another way?
43 minutes | Oct 5, 2020
Can Team Sports Take Advantage of Reinforcement Learning?
In team sports, you have players – sometimes a couple, sometimes many players – all going for a goal. This is very different than games or sports with one player. The idea of collaboration makes everything harder. In this episode, Edward Rusu tells us all about Multiagent Reinforcement Learning (aka MARL). Edward is a researcher and developer. He’s very excited about the work he’s doing and cannot wait to share it with you all. We enjoyed our time with Edward and he’s always up for connecting to anyone interested in the topic. If you’re interested, feel free to reach out to him via LinkedIn https://www.linkedin.com/in/edward-rusu/ This is our second show about reinforcement learning. We strongly believe that hearing about a topic makes it easier to relate to and will also remove some of the fear of trying out the techniques yourself. Whether you are an avid developer or just come here to learn a bit more, do not be afraid to jump in and try new things. If you’ve gotten all the way down here, drop us an email, give us a nice review, send us an advertiser, or just smile that you read this.
38 minutes | Sep 21, 2020
Do computers learn using positive reinforcement or negative reinforcement?
Nothing feels more like Artificial Intelligence than when a computer learns by itself through repeated simulations. Computers can now master games by simply playing the game over and over against itself. This is reinforcement learning. Today we have David Stroud on the show to explain to us a bit of reinforcement learning. He’ll give us the base needed when hearing and understanding Reinforcement Learning. David Stroud is a Reinforcement Learning Researcher and Lecturer of Information Systems and Quantitative Methods at Troy University. You can find more information about Reinforcement Learning and links to other resources on David’s Github page below: https://github.com/RL-Research-Cohiba/Reinforcement_Learning After listening to the episode, do you think computers learning using positive or negative reinforcement? We want to hear your thoughts.
32 minutes | Sep 7, 2020
Did AI write this blog or was it Liam?
Liam Porr wrote ‘Feeling Unproductive? Maybe you should stop overthinking’ The post had thousands of visitors and made it to the top of some blogging site. Liam did come up with the title and some of the text but he didn’t do it alone. Liam had help from GPT-3. GPT-3 is the latest readily available natural language processing technology. Should we be worried that our podcast will soon be obsolete? NLP to sound and we’re done.
57 minutes | Aug 24, 2020
A/B Testing, Data Science, & more with Eric Schles
We asked Eric Schles to come on the show to explain A/B Testing to us. He didn’t stop there. He explained a few other things along the way and as you expect with someone with his vast experience in technology and data science, our conversation took some twists and turns. Eric is writing a Data Science book – https://github.com/EricSchles/datascience_book. Go check it out and encourage him to finish.
42 minutes | Jul 28, 2020
Data Analytics and Predictions made easier
We connect with Shanif Dhanani, Co-founder and CEO of Apteo, to learn how he and his team are making data analytics and presictions easier. If you’re looking to do this in your organization, like we’ve done, you will want to listen and hear about some of the hurdles and how they are being solved.
47 minutes | Jul 13, 2020
Big Data through the eyes and mouth of Arun Murthy, co-founder of Hortonworks & CPO of Cloudera
In this episode, we get to talk to an insider of the big data movement. Arun Murthy is one of the founders of Hortonworks and has spent well over a decade in the big data industry. We ask Arun to walk us through what has challenged in the industry, his journey, and what is next at Cloudera and what that brings for enterprise users. Subscribe, join our mailing list, and send us a note to say hello.
41 minutes | Jun 15, 2020
Police Encounters: A data project w/ D. Brian Burghart
According to FatalEncounters.org, there have been 934 police-involved deaths between January 1, 2020 and June 12, 2020. How does a lifelong, award-winning journalist, the former editor/publisher of the Reno News & Review and a former journalism instructor at the University of Nevada, Reno create one of the leading data sources for US fatal encounters with the police? This is our conversation today. Fatal Encounters is a 501(c)3 public charity. If you’re interested in donating your time or money, please visit https://fatalencounters.org/donate/ Post-edit Corrections: “The Geezer grant was for $12,000, not $17,000 and the university I was doing the AI stuff with was UMass, not Harvard.”
43 minutes | May 28, 2020
Broadcast Radio-inspired AI technology personalizes song transitions w/ Zack Zalon
“The future of streaming music is more than just playlists of songs with long gaps of silence,” says Co-Founder and CEO of Super Hi-Fi, Zack Zalon. We are excited to have Zack on the show to tell us a bit about his company, data, and technology he uses. Like our show? Subscribe. Write a review. Tell us: (antonio|jordy)@datascienceimposters.com, on twitter @dsimposters, or on our reddit channel r/dsimposters
42 minutes | May 18, 2020
The OODA loop & the Onion Diagram in Data Science w/ David Purdy
We’re still in the midst of COVID-19. We wanted to see how someone like David Purdy would approach this problem from a data and data science perspective. He didn’t let us down by introducing high-level concepts like the OODA loop, developed by Colonel John Boyd for combat operations process, and his own Onion Diagram which describes the layers within data science and strategy. David Purdy has been at Uber, Goldman Sachs, worked on autonomous vehicles, and advises startups on data science and strategy. David is an old friend of the show and has one of the most downloaded episodes on Building a Machine Learning Platform: https://datascienceimposters.com/2017/11/26/building-a-machine-learning-platform-interview-with-dr-david-purdy/
Terms of Service
Do Not Sell My Personal Information
© Stitcher 2021