Created with Sketch.
The Internet Report
18 minutes | 18 days ago
Ep. 29: 2020 Election—The Internet Held Strong with a Few App Performance Glitches (Week of Nov. 2- 8)
This is The Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. This week, we’re pleasantly surprised to say that the network did not break, and there were no major election-night outages to report. However, that’s not to say we didn’t catch performance glitches in the days and weeks around the big night. Watch this week’s episode, as we cover performance issues at a Secretary of State website as well as why CNN’s election map website was so slow to load for many.
15 minutes | a month ago
Ep. 28: 2020 Election Special: Going Under the Hood on State Election Websites (Week of Oct. 19-25)
We’ve got an election coming up here in the US, and over the last several weeks, we have been analyzing a dozen or so state election websites to take a closer look at how they’re hosted (e.g., do they use a CDN or are they self-hosted?) and to monitor them for outages. In this episode, we discuss the pros and cons of each hosting method and dive into some examples we’ve seen where election websites have had unexpected performance degradation. Catch this week’s episode to go under the hood on the websites powering the upcoming presidential election—and don’t forget to get out there and vote!
7 minutes | a month ago
Ep. 27 No, Twitter Wasn’t Hacked and Zayo Goes Bump in the Night (Week of Oct. 12-18)
. In this week’s episode, we discuss two notable outages that happened last week. The first, at Twitter, took place on October 15 around 5:30 pm PST and impacted users’ ability to tweet or re-tweet. According to Twitter’s official statement, an internal system error was the culprit—putting to bed any theories of another hack. The second outage took place at the transit provider, Zayo, in the early morning hours of October 13. Although the outage seemed to mostly involve interfaces on the US west coast, Denver and the southwest (as well as a handful of other global locations), the impact of the outage was not very severe due to the time of the outage, which was outside of US business hours. Watch this week’s episode to hear more about these two outages.
19 minutes | 2 months ago
Ep. 26 The case of an overloaded database and what happens when a bug bites (Week of Oct. 4-11)
This is The Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. In this week’s episode, we dive into a recent outage at Slack that caused intermittent issues for its enterprise users (including ourselves) for nearly a full day. The cause, as noted by Slack, was on the backend and related to an overloaded database. Next, we dig into another outage at Microsoft. According to their statement, a bug in an internal update seems to have revoked the routes to a number of devices that were believed to be unhealthy—thereby creating congestion in the rest of their network. This explanation jives with the increased packet loss we observed during this time period. Don’t miss this week’s episode, where we walk through these outages in depth
16 minutes | 2 months ago
Ep. 25: Microsoft's Monday Outage Is a Lesson in App Complexity; Plus, Digging into Telstra’s BGP Hijack (Week of Sept. 28-Oct. 4)
This is The Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. On today’s episode, we dive into a recent Azure AD disruption that significantly impacted access to Microsoft cloud services and apps (as well as third-party apps) for nearly three hours. We then went under the hood on a recent BGP hijacking in which Telstra began announcing routes to services that didn’t belong to it, such as Quad9. Catch this episode to hear our take on these incidents, and see below for show links, some additional commentary on these outages, and a sneak preview of next week’s episode.
13 minutes | 2 months ago
Ep. 24The TikTok Shutdown Showdown Continues, and WeChat Gets Muzzled (Week of Sept. 14-20)
On today’s episode, Angelique and I cover off on a couple outages that occurred over the past week. First, we discuss an application outage at Instagram that occurred on September 17th and lasted around 30 minutes. We also discuss a network outage on September 14th on the AWS backbone near Columbus, Ohio. This outage was a little more widespread, affecting nearly 100 interfaces and lasting around 30 minutes. Next, we dive into the upcoming bans on WeChat and TikTok, which have now been temporarily extended by a Federal judge, and then we walk through some of the network architecture differences between these two applications and how a potential shutdown could be enforced.
11 minutes | 2 months ago
Ep. 23: You’ve Got Questions, We’ve Got Answers: Upstream Providers and the Reality of SLAs ( Week of Sept. 7-13)
It was another quiet week on the Internet, so we wanted to spend some time answering your questions around some recent outages. Catch this episode as we discuss how you can understand the upstream relationships of the services you rely on to assess your risk profile. We also cover why SLAs fall short in protecting your business in the event of an outage, and why you need to proactively collaborate with your providers to solve issues faster.
44 minutes | 3 months ago
Ep. 22: Even the Internet Enjoys a Long Weekend; Plus, Digging Into a Recent CDN Outage (Week of Aug. 31- Sept. 6)
The Internet held up reasonably well over the past week, all things considered. There were no major outages to report, which is a welcome repose for those impacted by the major outages the week prior. While it’s not an outage that occurred this past week, we did want to spend some time covering the recent Verizon Edgecast outage that occurred on August 21st. Watch this episode as we dive into this application-level outage to understand exactly what happened and who might have been impacted.
44 minutes | 3 months ago
Ep. 21 Under the Hood On the CenturyLink / Level 3 Outage (Week of Aug. 24-30)
This is the Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. It was a rough week on the Internet last week, with outages and incidents across multiple services and providers including Slack, Zoom, AWS, and Verizon. However, in today’s episode we’re going to focus exclusively on Sunday’s CenturyLink / Level 3 outage that according to Cloudflare, caused a significant 3.5% drop in global Internet traffic, making it one of the most significant internet outages ever recorded.
22 minutes | 3 months ago
Ep. 20 An IXP And A Streaming Music Provider Walk Into An Outage Bar (Week of Aug. 17-23)
his is the Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. On this week’s episode, Archana and I cover some recent outages that made headlines. This includes the Spotify outage, caused by an expired TLS certificate, that prevented users from accessing its platform. We also cover off on a widespread outage at Cogent during (what seems to be) a maintenance window. Then, we go “under the hood” on the prolonged outage at an IXP on August 18th to understand exactly what infrastructure was impacted and which downstream providers were subsequently impacted. We’re also joined by our guest, Prabhnit Singh, who currently leads ThousandEyes’ Internet & WAN product line, to discuss why we’re seeing an increased number of outages caused by expired TLS certificates and to cover some examples of past high-profile outages.
20 minutes | 3 months ago
Ep. 19 Fortnite’s Epic Battle Against the “Apple Tax”; And, The Evolution of Cloud Connectivity (Week of Aug. 10-16)
On this week’s episode, Archana and I cover recent headlines concerning social media platform, TikTok, and the gaming provider, Epic Games. TikTok appears to have gained some additional time (now 90 days) before the US government will enforce its ban on the service. Gaming provider, Epic Games, recently made news when its game Fortnite was removed from Apple’s App Store and Google’s Play Store for violating their Terms of Service. Epic was quick to file a lawsuit claiming the tech giants were in violation of anti-competition laws. The outcome of this case will be one to watch, and can have far-reaching impacts for developers. Next up, we speak with William Collins, Lead Cloud Architect at a Fortune 100 company, about cloud connectivity, on-ramp services and the difference between the “Big 3” on-ramp services.
30 minutes | 4 months ago
Ep. 18 Time’s Running Out on TikTok; Plus, This Ain't Your Dad’s SatComms—But Does It Live Up to the Hype?(Week of Aug. 3-9)
This is the Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. On this week’s episode, Mike sat down with our guest, Ray Hunter, the senior network consultant at Globis in the Netherlands, to talk about SatComms and the role they play in connecting users, and what effect the mass deployment of Low Earth Orbital (LEO) satellites will have on networks and service delivery. We also discuss a recent move by the US to ban financial transactions between TikTok’s parent company, ByteDance, and US citizens, effectively removing financial incentives to serve US citizens. While not an outright ban, it does raise questions about how an outright ban even be enforced, and what that means for the broader conversation around Internet sovereignty.
10 minutes | 4 months ago
Ep. 17 Cogent's Midsummer's Night Outage and Telstra's Weekend DNS Mishap Prove Not All Outages Are Equal (Week of July 27-Aug. 2)
This is the Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. On this week’s episode, Archana and I discuss a small number of outages that hit certain regions of the globe over the past week. This includes an outage that caused a midday disruption for people trying to connect to Reddit, a weekend DNS issue at Telstra, and a Cogent outage in EMEA and NA that had the signatures of a maintenance window. We also revisit Cloudflare’s root cause analysis concerning their recent DNS outage and answer some of the open-ended questions we had.
26 minutes | 4 months ago
Ep. 16 Ransomware Attack Leaves Garmin Users Stuck Without a Paddle (Week of July 20-26)
On this week’s episode, I am joined by Deepak Ravi from our Dublin technical sales engineering team to discuss a recent outage at Garmin. Garmin confirmed that it was a victim of a ransomware attack, which took down several of its services including its website functions, customer support, customer facing applications, and company communications. In this episode, we walk through what we observed in the ThousandEyes platform during the time of the attack, and what the impacts were on users attempting to access Garmin services.We’re also joined by ThousandEyes’ CISO, Alexander Anoufriev, to talk about what ransomware attacks are, how they manifest and how organizations can protect themselves against future attacks.
26 minutes | 4 months ago
Ep. 15 Do Outages Come in 3’s? Diving Into Last Week’s Outages at GitHub, WhatsApp and Cloudflare (Week of July 13-19)
On this week’s episode, we cover a couple of significant application-layer outages at Github and WhatsApp that occurred over the past week. Then, Archana and I do a deep-dive into a network-related outage at Cloudflare that affected the availability of its popular DNS service for approximately 30 minutes. We’ll share what we saw through our vantage points in the ThousandEyes platform, and you can read Cloudflare’s full explanation of the incident on their blog/
21 minutes | 5 months ago
Ep. 14 India Swipes Left on TikTok, GCP Outage Hits Multiple AZs, & Cloud Networking 101 for Enterprises (Week of June 29-July 5)
On this week’s episode, we cover a recent move by the government of India to ban many Chinese-owned applications, including TikTok, which reportedly has more than 600,000,000 downloads in India. We also talk through a two-hour-long outage at Google Cloud Platform that affected multiple of its availability zones within a single region—highlighting that availability zones may be architected differently between providers—and briefly cover outages at Slack and Comcast, too. After our review of this week’s highlights, I sat down with Atif Khan, CTO of Alkira and former co-founder of Viptela to talk enterprise cloud strategy.
13 minutes | 5 months ago
Ep. 13: Broadband Goes Bust, Again; Plus, Satellite Meets SD-WAN (Week of June 22-28)
This week’s episode is brought to you by the letter “O” for outages — in particular, there were a number of broadband providers, globally, that suffered localized outages this past week. After we run down our top headlines, including a satellite provider rolling out managed SD-WAN, we take a look at outages in Comcast and AT&T’s networks. Make sure you join us next week to hear from Atif Khan, CTO at Alkira, as we talk about multi-cloud networking.
17 minutes | 5 months ago
Ep. 12: Major T-Mobile Outage Caused By Fiber Cut, and Talking Cloud Architecture at Scale with Uber (Week of June 15-21)
This is the Internet Report, where we uncover what’s working and what’s breaking on the Internet—and why. On this week’s episode, we cover a widespread T-Mobile outage that took down its cellular network for several hours and elicited a rare condemnation from the FCC. The culprit, according to the carrier, was a fiber cut—highlighting the need for redundancy and resiliency in the nation’s cellular networks. We also cover an issue with What’s App’s privacy settings that sent users scrambling to Twitter, as well as a recent move by Russia to “un-ban” the messenger app, Telegram. Then, stay tuned as we go one-on-one with Jason Black, the Head of Global Network Infrastructure at Uber Technologies, to discuss how Uber approaches its cloud architecture.
15 minutes | 5 months ago
Ep. 11 Excuse Me, Your BGP Is Leaking (Week of June 8-15)
On this week’s episode, we discuss a recent BGP-related outage at a major public cloud provider, as well as a recent announcement that Cogent Networks has rolled out RPKI in an effort to strengthen its BGP route security. We’re also joined by Kemal Sanjta, principal engineer on our customer success team and our resident expert on Internet routing and security, to chat about these events. Catch this week’s episode here to dive into BGP with us.
22 minutes | 6 months ago
Ep. 10: It’s ALWAYS DNS! (Week of May 25-May 31)
On this week’s episode of the Internet Report, I’m joined by my colleague, Michael Batchelder (aka Binky), to discuss a DNS-related service disruption that affected users trying to access Amazon.com. We also talk about a recently discovered DNS vulnerability that could leave DNS providers susceptible to DNS amplification DDoS attacks. If you’re curious about what went wrong with Amazon’s service last week and want to know more about the role of DNS and why it’s so important, don’t miss this episode.
Terms of Service
© Stitcher 2020