Beautiful Soup - Python HTML/XML parser
Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Beautiful Soup parses anything you give it, and does the tree traversal stuff for you. You can tell it "Find all the links", or "Find all the links of class externalLink", or "Find all the links whose urls match "foo.com", or "Find the table heading that's got bold text, then give me that text."
- It won't choke if you give it bad markup
- It provides Pythonic idioms for navigating, searching, and modifying a parse tree
- It Converts incoming documents to Unicode and outgoing documents to UTF-8.