Stream Your Data Using Nothing But Python’s Requests Library

Understand when to stream data, how to configure a pipeline and learn from my mistakes building streaming pipelines.

Share
Light blue stream of data on a dark blue background.
Artist’s rendering of streaming data. Photo by Conny Schneider on Unsplash.

The Forgotten Streaming Library Hidden In Python

When you think of building streaming data pipelines you likely think of using a library like Apache Kafka that is tailor-made for streaming.

Cloud platforms like Google Cloud Platforms (GCP) and Amazon Web Services (AWS) support the streaming of data into their respective SQL environments. GCP, in particular, even provides services like the Billing API or Google Analytics connections that stream source data on a recurring basis.

On top of the infrastructure giants countless third-party “code-less” platforms have taken on the heavy lifting of setting up, maintaining and scaling streaming infrastructure.

Even though third parties sell platforms that will alleviate the “challenging” builds like streaming pipelines, as a data engineer, it is still important to know how to configure a streaming pipeline.

But going from creating relatively simple batch load jobs to configuring dynamic streaming pipelines can be, understandably, intimidating.

Aside from not understanding the concept of streaming data, it can be difficult to even know where to begin when creating such pipelines.

Enter your Fisher-Price “My First Streaming Pipeline”: Requests.

Build Your Pipeline To A Data Engineering Career

You’ve reached the limit of the public preview. The full version of this post includes the implementation details: The code, the edge cases, and the "why" behind the architecture.

When you join PipelineToDE, you get:

  • The DA → DE Pathway Course: A structured roadmap to bridge the gap between analysis and engineering.
  • Weekly Senior Deep Dives: Fresh, tactical insights on Python, Cloud (GCP/AWS), and modern orchestration delivered every week.
  • Production-Ready Blueprints: Access to 80+ protected stories and code repos from my time in the trenches as a Senior DE
  • The DE Job Board (Coming Soon): Exclusive access to a curated board of high-agency Data Engineering roles.