Very Low Activity

Project Summary

  Analyzed 2 days ago based on code collected 2 days ago.

A Python HTML/XML parser for quick turnaround projects like screen-scraping.
1. Beautiful Soup won't choke if you give it bad markup. It yields a parse tree that makes approximately as much sense as your original document. This is usually good enough to collect the data you need and run away.
2. Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need. You don't have to create a custom parser for each application.
3. Beautiful Soup automatically converts incoming documents to Unicode and outgoing documents to UTF-8. You don't have to think about encodings, unless the document doesn't specify an encoding and Beautiful Soup can't autodetect one.

Tags

  html python parser xml

Share

In a Nutshell, Beautiful Soup...

 

Activity

30 Day Summary May 19 2013 — Jun 18 2013

12 Month Summary Jun 18 2012 — Jun 18 2013

  • 0 Commits Down -3 (100%) from previous 12 months
  • 0 Contributors Down -1 (100%) from previous 12 months

Community

Ratings

4 users rate this project:
4.25
   
4.3/5.0
 
Click to add your rating
 
Review this Project!
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.