According to the Zen of Python, “There should be one– and preferably only one –obvious way to do it.” It’s a good principle for designing a program: the more ways there are of doing something, the more confusing the software becomes, along with a host of other problems. In reality, though, there almost always is more than one way to accomplish something. The quotation even displays this fact: it places the dash in two different ways, neither of which are the obvious way.
Articles on BeautifulSoup
From Webscraper to Wordcloud
By Cedric Sambre on 27 November 2019
After going through the web scraping learning paths, I decided to get my hands dirty and apply my freshly gathered knowledge on a real life project. I explain some difficulties you might encounter while scraping and I also show some libraries that can help you visualizing data you have obtained.
Building a Python Tips API with Django REST Framework and Deploying it to Digital Ocean
By Bob Belderbos on 5 March 2019
In this article I will show you how to build a simple API for our growing collection of Python tips. First we make a simple Django app, defining the model. Next we use Django REST Framework to make an API supporting common CRUD operations. Then we will test it out using curl, Postman and Django REST’s browser front-end. Lastly we deploy the API to Digital Ocean so we can start using it via our Slack with a Slash Command, which I will cover in the next article. Sounds exciting? You bet it is! Let’s jump straight in!
Generating Beautiful Code Snippets with Carbon and Selenium
By Bob Belderbos on 26 February 2019
Did you notice our Python tips lately? They looks more sexy, don’t they? That’s thanks to Carbon which lets you create beautiful images of your source code. As much as I love its interface though, what if we can automate this process generating the image for us? That’s what we did and posting new tips to Twitter is now a breeze. In this article I will show you how using a bit of BeautifulSoup and selenium. Enjoy!
How to Parse Hidden HTML With Selenium Headless Mode and Deploy it to Heroku
By Bob Belderbos on 19 February 2019
Ever wondered how you scrape hidden (or JS generated) HTML? Selenium is your friend. Ever wondered how to run it without a browser popping up? Use headless mode. How would you run it remotely? Use Heroku. And how about autoposting to Slack and Twitter? With the right libraries and API setup little code is needed. In this 10 step guide I will show you how to build a Packt Free Learning Notifier which will accomplish all these tasks. Ready to learn some nice automation skills in Python?
Code Challenge 22 – Packt Free Ebook Web Scraper – Review
By PyBites Team on 12 June 2017
In this article we review last week’s Packt free ebook code challenge. We really scratched our own itch building both a notification service and an ebook download manager.
Next Time I Will Use Pandas to Parse Html Tables
By Bob Belderbos on 8 June 2017
Last week I did some html table parsing with BeautifulSoup and regex. It turns out there is an easier way to do this: Pandas.
Code Challenge 22 – Packt Free Ebook Web Scraper
By PyBites Team on 5 June 2017
Hi Pythonistas, a new week, a new ‘bite’ of Python coding! This week we will do some web scraping of Packt’s daily free ebook, sending out daily notifications. This week we even have a unique opportunity to sponsor the Python Community, read on … and happy coding!
Code Challenge 21 – Electricity Cost Calculation App – Review
By PyBites Team on 5 June 2017
In this article we review this week’s code challenge #21 for which we got some nice PRs.
Create a Simple Web Scraper with BeautifulSoup4
By Julian Sequeira on 11 January 2017
Learn to create a simple web scraper in Python using BeautifulSoup4