Select a tag to browse associated projects and drill deeper into the tag cloud.
The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.
mime4j provides a parser, MimeStreamParser , for e-mail message streams in plain rfc822 and MIME format. The parser uses a callback mechanism to report parsing events such as the start of an entity header, the start of a body, etc. If you are familiar with the SAX XML parser interface you should ... [More]
Java parser for MIME message streams. The low-level streaming parser uses a callback mechanism to report parsing events (similar to SAX). Messages may also be parsed into a JavaMail MimeMessage-like structure using temporary files for large attachments. Note that this code base has been developed further as ApacheMime4j