Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 2

A Python based HTML parser/tokenizer based on the WHATWG HTML specification for maximum compatibility with major desktop web browsers.

5.0
 
  0 reviews  |  5 users  |  11,971 lines of code  |  7 current contributors  |  Analyzed 5 days ago
 
 

An online validation service for HTML and XHTML and other markup languages.

5.0
 
  0 reviews  |  2 users  |  244,646 lines of code  |  8 current contributors  |  Analyzed 4 days ago
 
 
Compare

Jodd is an open-source Java utility library and set of frameworks. Jodd tools enriches JDK with many powerful and feature rich utilities. It helps with everyday task, makes code more robust and reliable. Jodd frameworks is set of lightweight application frameworks, compact yet powerful. Designed ... [More] following the CoC, DRY and SCS principles, it makes development simple, but not simpler; you get 90% of the features with 10% of usual effort. Special attention is put into creating reusable and fastest possible code and still keeping it small, under 1.2 MB. Jodd is free software, released under the terms of the BSD license. [Less]

5.0
 
  0 reviews  |  1 user  |  148,342 lines of code  |  6 current contributors  |  Analyzed 5 days ago
 
 

Simple lightweight html parser written in .net

0
 
  0 reviews  |  0 users  |  4,017 lines of code  |  0 current contributors  |  Analyzed about 20 hours ago
 
 

The tool can copy tile and content from URL which is provided by you. For example, if you want to copy a post from Sina blog to a BBS, you can use this tool to copy title and content separately for you automatically. So, what you need to do, it is just to paste! The tool will also format the ... [More] content part for you, which means that you don't to modify the web content by hand. Now the tool supports Sina blogs, Blogspot blogs. [Less]

0
 
  0 reviews  |  0 users  |  44 lines of code  |  0 current contributors  |  Analyzed about 2 years ago
 
 

Hattrick Transfers

0
 
  0 reviews  |  0 users  |  0 current contributors
 
 

Always ,We download many movies from HDC(A Chinese Private Tracker),but as time goes by,we forget all the movie information about contents,actors,picture information except the name displayed in local HDD.So I hope to write a java application to solve the problem. The Process WILL be as follows: ... [More] 1.Iterator the directory,obtain the movie name; 2.Login the hdc website,search and obtain relatively information(So you need a HDC account) 3.Fullfill a Template,and generate a HTML document in the directory All the code is still in close because I need to finish the main function and be admitted by HDC Admin.... I have to use new version of httpClient,and can be explored in my spare time. so the schedule will be slower than your imagination. [Less]

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed about 2 years ago
 
 

Main Features: want an easy-use web page parser? with built-in vb.net script supporting(including a small IDE environment) to deal complicated situation using IE / Htmlparser Core to parse ajax / html page built-in c/s struction allow you parse some of hardcore page with capacha or ip streagy ... [More] "slow and steady" is better than "fast and broken" 主要特性: 容易使用的网络采集程序 内置的vb.net脚本语言(包含IDE环境)使得程序能够处理复杂网页情况(例如与大型数据库互动等) 使用IE/Htmalparser 双内核处理Ajax/普通Html网页 内建C/S结构允许用户使用多台客户端分布采集,同时亦能处理带有验证码或者ip策略的网站 对采集要求高的用户来说,慢点采总比采不到要好吧 :) Used Open-Source Project: CsexWB ==> a super good IE-Core(Trient) wrapper Code Editor ==> a c# IDE HtmlParser2003 ==> HtmlParser(JAVA) in C# versiton log4net ==> no need to describe, everybody should use it SharpICTCLT ==> chinese word segmentation in c# Used DLL FetionSDK ==> CMCC Fetion SDK DotRAS ==> Nice RAS Handler in C# [Less]

0
 
  0 reviews  |  0 users  |  118,797 lines of code  |  0 current contributors  |  Analyzed 6 days ago
 
 

It's a general Markup Langauge parser. Any kind of markup language can be processed, including html, xhtml, wml, xml and so on. The powerful feature is that it can deal with wrong format html content.

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed 3 days ago
 
 

This project use HTML parser to analysis some OJ's pending contests page.And make a summary page of these contests.

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed 2 days ago
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.