Displaying 1 to 17 from 17 results

requests-html - Pythonic HTML Parsing for Humans™

  •    HTML

This library intends to make parsing HTML (e.g. scraping the web) as simple and intuitive as possible.

MechanicalSoup - A Python library for automating interaction with websites.

  •    Python

A Python library for automating interaction with websites. MechanicalSoup automatically stores and sends cookies, follows redirects, and can follow links and submit forms. It doesn't do JavaScript. MechanicalSoup was created by M Hickford, who was a fond user of the Mechanize library. Unfortunately, Mechanize was incompatible with Python 3 until 2019 and its development stalled for several years. MechanicalSoup provides a similar API, built on Python giants Requests (for HTTP sessions) and BeautifulSoup (for document navigation). Since 2017 it is a project actively maintained by a small team including @hemberger and @moy.

Soup - Web Scraper in Go, similar to BeautifulSoup

  •    Go

soup is a small web scraper package for Go, with its interface highly similar to that of BeautifulSoup.

scrape-url - Scrape URLs with CSS selectors

  •    Javascript

Scrape URLs with CSS selectors and returns elements with jQuery-like interface.See example.js for more information.




Lyricaly - :musical_note: Lyricaly gets Lyrics delivered to your Terminal for any Song

  •    Python

:musical_note: Lyricaly gets Lyrics delivered to your Terminal for any Song. Uses Python beautifulsoup4 to scrap lyrics. pypi: lyricaly

househunterbot - Use Python, Google Spreadsheet, Google Shortener and CALLR API to automate your apartment search in Paris

  •    Python

Use Python, Google Spreadsheet, Google Shortener and CALLR API to automate your apartment search in Paris. Read the related article on the CALLR blog.


PacPaw - Pawn package manager for SA-MP

  •    Python

PacPaw is pawn package manager for SAMP wrriten in python and is still under developement.It mainly relies on webscraping with BeautifulSoup.In addition to it it also helps scripters for gathering snippets based on pawn and function references documented for SA-MP.

Data-Wrangling-with-Python - Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices

  •    Jupyter

Data is the new Oil and it is ruling the modern way of life through incredibly smart tools and transformative technologies. But oil does not come out in its final form from the rig. It has to be refined through a complex processing network. Similarly, data needs to be curated, massaged and refined to be used in intelligent algorithms and consumer products. This is called wrangling and (according to Forbes) all the good data scientists spend almost 60-80% of their time on this, each day, every project. It involves scraping the raw data from multiple sources (including web and database tables), imputing, formatting, transforming – basically making it ready, to be used flawlessly in the modeling process. This course aims to teach you all the core ideas behind this process and to equip you with the knowledge of the most popular tools and techniques in the domain. As the programming framework, we have chosen Python, the most widely used language for data science. We work through real-life examples, not toy datasets. At the end of this course, you will be confident to handle a myriad array of sources to extract, clean, transform, and format your data for the great machine learning app you are thinking of building. Hop on and be the part of this exciting journey.

wikipedia-reference-scraper - Wikipedia API wrapper for references

  •    Python

I just graduated from Physiology department, University of Ibadan. I started typing my final year project some days before submission deadline. I made use of Wikipedia for my literature review because each page is supported with enough references. The next task was to copy and paste the references. This was a lot of work considering the fact that some pages has over 200 references and I wasn't working with just one page. I decided to make use of Wikipedia API wrappers but all the ones I checked didn't do what I needed. So I decided to write a simple script that scraped Wikipedia page. It pulls the references from a Wikipedia page and saves the references in a file.

easy-scraping-tutorial - Simple but useful Python web scraping tutorial code.

  •    Jupyter

In these tutorials, we will learn to build some simple but useful scrapers from scratch. Get to know how we can read web page and select sections you need or even download files. If you understand Chinese, you are lucky! I made Chinese video + text tutorials for all of these contents. You can find it in 莫烦Python. Learning from code, I made two options for you.

CoWaPS - CodeWarsProfileScraper. Only meant for practicing scraping.

  •    Python

This is a web scraper written in python to scrape the profile of a user on code wars(a competitive programming website). Although the CodeWars API is publicly available I made this scraper just for the love of scraping. You need requests, BeautifulSoup and texttable to run this script.

Track-Stargazers - Have fun tracking your project's stargazers

  •    Javascript

Have fun tracking your project's stargazers. I saw this post at Codementor by Ionică Bizău which lead me to his project, so I decided to create something cool of my own.





We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.