[48 total ]
Libxml2 is the XML C parser and toolkit developed for the Gnome project (but usable outside of the Gnome platform).
Includes the xmllint tool for checking documents for well-formedness, validating documents against a DTD or XML Schema, and pretty printing XML input.
Rome is a set of Atom/RSS Java utilities that make it easy to work in Java with most syndication formats. Today it accepts all flavors of RSS (0.90, 0.91, 0.92, 0.93, 0.94, 1.0 and 2.0) and Atom 0.3 feeds. Rome includes a set of parsers and
... [More]
generators for the various flavors of feeds, as well as converters to convert from one format to another. The parsers can give you back Java objects that are either specific for the format you want to work with, or a generic normalized SyndFeed object that lets you work on with the data without bothering about the underlying format. [Less]
Expat is a fast, non-validating, stream-oriented XML parsing library.
Parse RSS and Atom feeds in Python
Markdown is a text-to-HTML conversion tool for web writers. Markdown allows you to write using an easy-to-read, easy-to-write plain text format, then convert it to structurally valid XHTML (or HTML).
SimplePie puts the 'simple' back into 'really simple syndication'. Flexible enough to suit newbies and veterans alike, SimplePie's focus has been two-fold: speed and ease of use. By thinking about the most useful ways to handle blogs, news sites, and podcasts, we've come up with an API that makes it easy to do cool things with your feeds.
Spirit is an object-oriented, recursive descent parser generator framework implemented using template meta-programming techniques. Expression templates allow Spirit to approximate the syntax of Extended Backus Normal Form (EBNF) completely in C++.
... [More]
The Spirit framework enables a target grammar to be written exclusively in C++. EBNF grammar specifications can mix freely with other C++ code and, thanks to the generative power of C++ templates, are immediately executable. [Less]
Apache PDFBox is an open source Java PDF library for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes
... [More]
several command line utilities.
* PDF to text extraction
* Merge PDF Documents
* PDF Document Encryption/Decryption
* Lucene Search Engine Integration
* Fill in form data FDF and XFDF
* Create a PDF from a text file
* Create image
Apache PDFBox is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consist [Less]
A runtime for VRML and X3D virtual worlds and models.
OpenVRML includes a core runtime library,
parsers for VRML97 and VRML-format X3D, an OpenGL renderer, and a Mozilla Web browser plug-in.
eden is a serializer/deserializer library using a subset of the ECMAScript notation, it allows to exchange and interpret data keeping the structure and the type.
args4j is a small Java class library that makes it easy to parse command line options/arguments in your CUI application.
This project is a modern C++ library with a focus on portability and program correctness. It strives to be easy to use right and hard to use wrong. Thus, it comes with extensive documentation and thorough debugging modes. The library provides a
... [More]
platform abstraction layer for common tasks such as interfacing with network services, handling threads, or creating graphical user interfaces. Additionally, the library implements many useful algorithms such as data compression routines, linked lists, binary search trees, linear algebra and matrix utilities, machine learning algorithms, XML and text parsing, and many other general utilities. [Less]
popt is a command line option parsing library. While it is similiar
to getopt, it contains a number of enhancements, including:
1) popt is fully reentrant
2) popt can parse arbitrary argv[] style arrays while getopt(3) makes this quite difficult
... [More]
3) popt allows users to alias command line arguments
4) popt provides convience functions for parsing strings into argv[] style arrays [Less]
Smarty-like template parser written in C++.
A framework of frameworks for rapid application development in Python. It includes packages for XML and XHTML parsing and generating, SNMP manager, SMI query API, Cisco-style CLI framework, QA automation, program control, and more.
Opensource Java library which includes various lightweight XML processing tools.
Major features are:
Generating POJO by DTD;
XML-POJO mapping via Java5 annotations or DTD;
XML manipulations using POJO without SAX/DOM;
Preprocessing of XML
... [More]
documents using expression language;
Binary XML;
RMI friendly XML;
Exporting to JSON;
XML marshall/unmarshall; [Less]
LPeg is a new pattern-matching library for Lua, based on Parsing Expression Grammars (PEGs). In this text, I assume you are familiar with PEGs. If you are not, you can get a quick start reading the Wikipedia Entry for PEGs. The nice thing about PEGs
... [More]
is that it has a formal basis (instead of being an ad-hoc set of features), allows an efficient and simple implementation, and does most things we expect from a pattern-matching library (and more, as we can define entire grammars). [Less]
A toolchain consisting of a C compiler and various tools for binary object handling.
An implementation of John Gruber’s Markdown text to HTML, written in C.
Hubbub is an HTML5 compliant parsing library, written in C. It is currently in development for use with NetSurf and is intended to be suitable for use in other projects too.
The XMLToolbox brings support for loading/saving XML files within GNU Octave. Two different formats are supported, the loading/saving of generic XML and a format compatible to the GEODISE XMLToolbox (http://www.geodise.org).
This is a simple templatized C++ library for parsing command line arguments. The library provides a simple, flexible object-oriented interface to the command line that automates argument parsing, USAGE creation and type casting.
Vaniglia is a Java library composed of a number of lightweight, very specific, and performance oriented java components.
Currently the following components are implemented:
- Command Protocol
- Crypto
- Extensions Framework
-
... [More]
RollingFileDailyFolderAppender for Log4J
- RollingFileFoldersBackupAppender for Log4J
- Parser
- Performance Monitor
- Polling
- Objects Pool
- Socket Communication Framework
- State Machine
- Template Engine
- Text Table
- Time Utilities
- Vaniglia Message Queue [Less]
XQilla is an XQuery and XPath 2 library and command line utility written in C++, implemented on top of the Xerces-C library.
eXtensible Binary Universal Protocol (XBUP) is attempt to create universal platform independent protocol for general usage. It should use best unary-binary encoding and most logical tree structure based on strong arguments.
JSON Parser/Constructor for Lua
Uses LPeg to decode JSON and tables to encode JSON
libextractor is a library used to extract meta-data from files of arbitrary type. libextractor is part of the GNU project.
The goal is to provide developers of file-sharing networks or WWW-indexing bots with a universal library to obtain simple
... [More]
keywords to match against queries.
Currently, libextractor supports the following formats: HTML, PDF, PS, OLE2 (DOC, XLS, PPT), OpenOffice (sxw), StarOffice (sdw), DVI, MAN, FLAC, MP3 (ID3v1 and ID3v2), NSF (NES Sound Format), SID, OGG, WAV, EXIV2, JPEG, GIF, PNG, TIFF, DEB, RPM, TAR(.GZ), ZIP, ELF, FLV, REAL, RIFF (AVI), MPEG, QT and ASF.
Also, various additional MIME types are detected. [Less]
This is a parser for VRML 1.0c based on flex and bison.
Faxpp is a small, fast and conformant XML pull parser written in C with an API that can return strings in any encoding including UTF-8 and UTF-16.