Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 7

Hpricot is a very flexible HTML parser, based on Tanaka Akira's HTree and John Resig's JQuery, but with the scanner recoded in C (using Ragel for scanning.) I've borrowed what I believe to be the best ideas from these wares to make Hpricot heaps of fun to use.

4.28
   
  0 reviews  |  72 users  |  27,902 lines of code  |  3 current contributors  |  Analyzed 3 days ago
 
 

Expat is a fast, non-validating, stream-oriented XML parsing library.

3.6
   
  0 reviews  |  39 users  |  22,710 lines of code  |  1 current contributor  |  Analyzed 8 days ago
 
 

Markdown is a text-to-HTML conversion tool for web writers. Markdown allows you to write using an easy-to-read, easy-to-write plain text format, then convert it to structurally valid XHTML (or HTML).

4.0
   
  0 reviews  |  34 users  |  0 current contributors
 
 

A Python HTML/XML parser for quick turnaround projects like screen-scraping. 1. Beautiful Soup won't choke if you give it bad markup. It yields a parse tree that makes approximately as much sense as your original document. This is usually good enough to collect the data you need and run away. ... [More] 2. Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need. You don't have to create a custom parser for each application. 3. Beautiful Soup automatically converts incoming documents to Unicode and outgoing documents to UTF-8. You don't have to think about encodings, unless the document doesn't specify an encoding and Beautiful Soup can't autodetect one. [Less]

4.25
   
  0 reviews  |  15 users  |  1,966 lines of code  |  0 current contributors  |  Analyzed 7 days ago
 
 

Texy is one of the most complex lightweight markup language. It allows adding of images, links, nested lists, tables and has full support for typography and CSS. Texy allows you to enter content using an easy to read Texy syntax which is filtered into structurally valid XHTML. No knowledge of HTML is required.

4.83333
   
  0 reviews  |  14 users  |  4,611 lines of code  |  2 current contributors  |  Analyzed 5 days ago
 
 

MarkdownJ is the pure Java port of Markdown (a text-to-html conversion tool written by John Gruber.)

3.0
   
  0 reviews  |  12 users  |  5,616 lines of code  |  2 current contributors  |  Analyzed 9 days ago
 
 

Nokogiri is a libxml wrapper. It features an HTML, XML, SAX, and Reader parser, as well as XPath and CSS interfaces for searching. Nokogiri is also a drop in replacement for Hpricot.

5.0
 
  0 reviews  |  11 users  |  44,528 lines of code  |  28 current contributors  |  Analyzed about 1 year ago
 
 

Html Agility Pack is an agile HTML parser library that proposes a read/write DOM and supports plain XPATH or XSLT. It allows you to parse "out of the web" HTML files. The parser is very tolerant with "real world" malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents (or streams).

4.5
   
  0 reviews  |  6 users  |  111,631 lines of code  |  3 current contributors  |  Analyzed 7 months ago
 
 

EQdkp Plus is an Open Source DKP system written in PHP/Mysql created to administer DKP points especially for World of Warcraft.

4.75
   
  0 reviews  |  5 users  |  691,940 lines of code  |  6 current contributors  |  Analyzed 8 months ago
 
 

reStructuredText format is a document description format. Like other LaTex or DocBook it can be converted toward a multitude of formats. These formats have usually invading syntax which, if it is necessary for very specific documents, becomes useless when it is used to quickly creating a simple ... [More] document. RST has a so simple syntax that it becomes almost invisible. JRST is a Java ReStructuredText parser enabling to create a tree representation document. It becomes easy to generate document representation towards differents fomats. [Less]

4.75
   
  0 reviews  |  3 users  |  237,875 lines of code  |  5 current contributors  |  Analyzed 8 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.