Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 1

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

5.0
 
  0 reviews  |  18 users  |  58,581 lines of code  |  7 current contributors  |  Analyzed about 12 hours ago
 
 

mime4j provides a parser, MimeStreamParser , for e-mail message streams in plain rfc822 and MIME format. The parser uses a callback mechanism to report parsing events such as the start of an entity header, the start of a body, etc. If you are familiar with the SAX XML parser interface you should ... [More] have no problem getting started with mime4j. The parser only deals with the structure of the message stream. It won't do any decoding of base64 or quoted-printable encoded header fields and bodies. This is intentional - the parser should only provide the most basic functionality needed to build more complex parsers. However, mime4j does include facilities to decode bodies and fields and the Message class described below handles decoding of fields and bodies transparently. [Less]

0
 
  0 reviews  |  4 users  |  35,856 lines of code  |  4 current contributors  |  Analyzed 7 days ago
 
 

Java parser for MIME message streams. The low-level streaming parser uses a callback mechanism to report parsing events (similar to SAX). Messages may also be parsed into a JavaMail MimeMessage-like structure using temporary files for large attachments. Note that this code base has been developed further as ApacheMime4j

0
 
  0 reviews  |  0 users  |  28,442 lines of code  |  0 current contributors  |  Analyzed 5 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.