India rocks! Everything Indian! Indian blog!: March 2011

Friday, March 18, 2011

Following figure represents Service-based architecture and its relation to virtualization-pattern roles

See more at msdn.microsoft.com

Following figure represents the relationships between a WSDL service description and information that is stored in a UDDI service registry.

See more at msdn.microsoft.com

A list of open source frameworks which can be used for data processing and data mining

· HDFS – A distributed file system that provides high throughput access to application data

Hadoop Eco-system

·

Pig – A high – level data-flow language and execution framework for parallel computation

· ZooKeeper – A high – performance coordination service for distributed applications

· Hive – A data warehouse infrastructure that provides data summarization and ad hoc querying

Mahout – A scalable machine learning and data mining library

Hbase – A scalable, distributed database that supports structured data storage for large tables

Avro – A data serialization system.

MapReduce – A software framework for distributed processing of large data sets on compute clusters.

Chukwa – A data collection system for managing large distributed systems

Hadoop Common – The common utilities that support the other Hadoop sub – projects
Read more at itsitspace.blogspot.com

India rocks! Everything Indian! Indian blog!