Browsing projects by Tag(s)

Select a tag to browse associated projects and drill deeper into the tag cloud.

Showing page 1 of 1

This project implemented lots of popular Data-Mining/Machine-Learning algorithms.All candidate algorithms must be proper to implemented on Distribution and|or Parallel computing platform, such as Hadoop. The ultimate goal of this project is to resolve the store and compute for very large dataset ... [More] , especial for high-dimension. I known it is very difficult for this topic, if you would like to join into this challenge, please mail to me: moonblue333@hotmail.com. Thanks Wei.Dong at cs.princeton.edu for LSH. Additional, There is a 'proof of concept' software about distribution database, the attachment is ting-0.5.0.zip. More information about it please refer to: http://www.sadbit.com or sadbit333.appspot.com (Do not ask for source-code password for this package: ting-0.5.0.zip(binary is OK); but password for any other package is OK.) The research focus in 2009: 1) how to prepare data input such as special normolization to fit the LSH to get better 3-rate. 2) how to construct a better kernel-LSH to fit the final similarity-metric, such EMD, grid-feature. 3) search and research the better similarity-metric algirthms. (so far, the EMD and grid-freature are better, at least better than original L1, L2.) I will update this summary to introduce all implemented main algorithms: Hash-Family: LocalitySensitiveHash ConsistentHash PerfectHash MinimalPerfectHash BloomFilter(Hash) CuckooHash DynamicHash ExtendableHash LinearHash Image-Processing: Color-Space Transformation Edge-Histogram EMD ImageGridFeatureExtraction Others Dimension-Reduction/Feature-Extraction: LLE Wavelet PCA ICA AI-Related: ANN SVM Distribution-Computing: Paxos (TODO)Failer-Detection-Algorithms [Less]

0
 
  0 reviews  |  0 users  |  0 current contributors  |  Analyzed 1 day ago
 
 

The general problems addressed by this utility are: 1. Guaranteeing that the staging table required to SWITCH in or out a partition is always created correctly ‘just in time’, with the right indexes, columns, foreign keys, and partition-specific check constraint and filegroup corresponding to ... [More] the target partition of a partitioned table. 2. Ensuring that any partition management scripts stay in synch with the possible index changes or foreign key constraints of partition tables over time. Without this utility, any change to a partition table’s DDL would require an equivalent change in a TSQL-based partition management script, along with associated testing, etc. 3. Providing an easy mechanism to quickly empty a selected partition from a partitioned table with a single command-line 4. Allowing fine-tuned partition data loading – including scenarios in which you want to create a staging table and populate it, and create indexes later prior to a SWITCH in operation. The utility must be run from the command line or within an SSIS package. Command line parameters determine behavior. You may perform one of four different functions depending on the COMMAND parameter. Additional parameters define the connection (server, integrated security, etc); database, schema, partitioned table name, etc. You have the option of identifying a partition to manage either by explicit partition number (/p:) OR by specifying a string representation of a value that can be input to the partition range function to determine a partition number (/v:). [Less]

0
 
  0 reviews  |  0 users  |  0 current contributors
 
 

Managing SQL Server Table partitions requires creating staging tables with indexes and constraints that match a specific partition of a permanent table. This utility creates staging tables on-demand, eliminating the need modify scripts when table or index structures change.

0
 
  0 reviews  |  0 users  |  0 current contributors
 
 
 
 

Creative Commons License Copyright © 2013 Black Duck Software, Inc. and its contributors, Some Rights Reserved. Unless otherwise marked, this work is licensed under a Creative Commons Attribution 3.0 Unported License . Ohloh ® and the Ohloh logo are trademarks of Black Duck Software, Inc. in the United States and/or other jurisdictions. All other trademarks are the property of their respective holders.