What is this pattern about? The Pipeline design pattern (also known as Chain of Command pattern) is a flexible way to handle a sequence of actions, where each handler in the chain processes the input data and passes it to the next handler. This pattern is commonly used in scenarios involving data processing, web scraping,… Continue reading A Practical Example of the Pipeline Pattern in Python
Articles on Data
OpenStreetMap, Overpass API and Python
By Judith Vockenroth on 17 April 2024
OpenStreetMap (OSM) is known for being an open source project that allows people to browse the world map and to plan routes. However it is more than that. Among others it provides a read-only API that allows users to query for very diverse map data: Overpass API Data structure of OSM To understand the structure… Continue reading OpenStreetMap, Overpass API and Python
Using Python in Excel with PyXLL
By PyBites Team on 10 October 2023
In this podcast episode we talk with Tony Roberts, creator of PyXLL (pronounced “pixel”). Listen here: Also released on our YouTube channel: We dive into the backstory of this Python Excel Add-in, why he built it, features, success stories, the business side of things, and also how it compares to Microsoft’s recently announced Python in… Continue reading Using Python in Excel with PyXLL
Meet Will Raphaelson: From Script to Production Flow With Prefect & Marvin AI
By PyBites Team on 25 September 2023
This week Robin Beer – one of our coaches – interviews Will Raphaelson, Principal Product Manager at Prefect. 😍 They talk about his use of Python, Prefect as a tool and its philosophy, open source + business and Marvin AI. 🐍 💪 And of course they share cool wins and books they are reading. 💡… Continue reading Meet Will Raphaelson: From Script to Production Flow With Prefect & Marvin AI
Making plots in your terminal with plotext
By Bob Belderbos on 22 August 2022
In this blog post a quick script to plot the frequency of our blog articles in the terminal. It’s good to see that we’re getting back on track 🙂 The code gist is here. First we import the libraries we are going to use. As always we separate Standard Library modules from 3rd party ones… Continue reading Making plots in your terminal with plotext
Data engineering involves more Python than you might think!
By PyBites Team on 29 June 2022
Listen here: This week we have Christo back on the show to talk about his experience in the data engineering field. He shares some valuable tips how to become a more effective data engineer which, surprisingly or not, increasingly requires a well-rounded Python developer skill set. Enjoy and feel free to reach out to Christo below … Christo’s website: https://www.christoolivier.com… Continue reading Data engineering involves more Python than you might think!
Case study: How to parse nested JSON
By Michael Aydinbas on 3 June 2022
I was asked to help parse a JSON file that is delivered by the iTunes Store Customer Reviews API JSON endpoint. It is not so important how this API works or if there are better APIs for this. Instead, let’s assume that we found our favorite API to work with and that our request makes… Continue reading Case study: How to parse nested JSON
Analyzing covid-19 data with pandas and matplotlib
By Cedric Sambre on 13 July 2020
One moment I was solving a pandas challenge, next I was analyzing real world covid-19 data with python…
Introduction to Building Custom Apache Airflow Operators
By Christo Olivier on 10 April 2020
If you work in data engineering, then the chance are high that you are using or have used Apache Airflow.
Automating PyBites Review Post Using Github API and collections.defaultdict
By Bob Belderbos on 13 November 2018
In this post I share a quick script I produced last week to automate a portion of our review post. There are some nice idioms that you might find useful.