Select a tag to browse associated projects and drill deeper into the tag cloud.
A Python based HTML parser/tokenizer based on the WHATWG HTML specification for maximum compatibility with major desktop web browsers.
Jodd is an open-source Java utility library and set of frameworks. Jodd tools enriches JDK with many powerful and feature rich utilities. It helps with everyday task, makes code more robust and reliable. Jodd frameworks is set of lightweight application frameworks, compact yet powerful. Designed
The tool can copy tile and content from URL which is provided by you. For example, if you want to copy a post from Sina blog to a BBS, you can use this tool to copy title and content separately for you automatically. So, what you need to do, it is just to paste! The tool will also format the
Always ,We download many movies from HDC(A Chinese Private Tracker),but as time goes by,we forget all the movie information about contents,actors,picture information except the name displayed in local HDD.So I hope to write a java application to solve the problem. The Process WILL be as follows:
Main Features: want an easy-use web page parser? with built-in vb.net script supporting(including a small IDE environment) to deal complicated situation using IE / Htmlparser Core to parse ajax / html page built-in c/s struction allow you parse some of hardcore page with capacha or ip streagy
It's a general Markup Langauge parser. Any kind of markup language can be processed, including html, xhtml, wml, xml and so on. The powerful feature is that it can deal with wrong format html content.
This project use HTML parser to analysis some OJ's pending contests page.And make a summary page of these contests.
Copyright
©
2013
Black Duck Software, Inc.
and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a
Creative Commons Attribution 3.0 Unported License
. Ohloh
®
and the Ohloh logo are trademarks of
Black Duck Software, Inc.
in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.