musings and one liners

Tag: NoSQL

Greenplum to have it’s own Hadoop Distribution

Reported and analysed by Tony Baer in OnStrategies Perspectives, and reported by Derrick Harris in GigaOm’s in EMC, NetApp Make It a Big Day for Big Data Star Hadoop, we learn that EMC is using the on-going EMC World conference to its potential, and is announcing that they’re growing the Database division with the decision to sell their own Hadoop distribution with value add management tools and integration.… Continued

May 9, 2011
Yahoo Mulls Spinoff for Hadoop Software Unit

Yahoo is considering to turn Hadoop into a business, as reported by the Wall Street Journal. Ovum’s Tony Baer has a more detailed analysis at his blog in Yahoo to Hadoop: Show me the Money.

In the long run, we also expect IBM to make a stab at Hadoop and related technologies by extending its InfoSphere offerings -– it can see Cloudera-Informatica and Cloudera-MicroStrategy raise it one with its own InfoSphere DataStage and Cognos offerings, before it even talks about partnerships.… Continued

April 28, 2011
Google’s Megastore

I don’t think I’ve written about Google’s Megastore yet, so here’s a quick summary of worthwile resources.

Megastore is the data engine supporting the Google Application Engine. It’s a scalable structured data store providing full ACID semantics within partitions but lower consistency guarantees across partitions.… Continued

April 26, 2011
Necessity is the mother of NoSQL

The 451 group’s Matt Aslett argues that Necessity is the mother of NoSQL.

Necessity is particularly relevant when looking at the history of the NoSQL databases. While it is easy for the incumbent database vendor to dismiss the various NoSQL projects as development playthings, it is clear that the vast majority of NoSQL projects were developed by companies and individuals in response to the fact that the existing database products and vendors were not suitable to meet their requirements with regards to the other five factors: scalability, performance, relaxed consistency, agility and intricacy.… Continued

April 21, 2011
MySQL pre-releases integrated Memcached

MySQL just announced a pre-release snapshot which comes with an integrated Memcached plugin accessing the InnoDB storage engine directly: NoSQL to InnoDB with Memcached

The ever-increasing performance demands of web-based services have generated significant interest in providing NoSQL access methods to MySQL.… Continued

April 12, 2011
Structure Big Data Roundup

Good number of articles from Derrick Harris over at GigaOm rounding up the Structure Big Data Conference. First, there’s a look at Hadoop, Cloudera, and alternatives to Cloudera from IBM, DataStax, Hadapt etc. in As Big Data Takes Off, the Hadoop Wars Begin, and second there’s a piece about Why Big Data Startups Should Take a Narrow View:

[…] analyzing social media data is not the same, either in technique or in purpose, as analyzing user data to feed a recommendation engine for a site like Netflix.… Continued

March 29, 2011
Cloudera’s Olson Says Data Will Transform Industry

Great Bloomberg interview with Cloudera CEO Mike Olson on open source and big data.

Via the 451 group

March 26, 2011
Meet Mapr, a Competitor to Hadoop Leader Cloudera

Meet Mapr, a Competitor to Hadoop Leader Cloudera.

They are said to be building a proprietary replacement for the Hadoop Distributed File System that’s allegedly three times faster than the current open-source version. It comes with snapshots and no NameNode single point of failure (SPOF), and is supposed to be API-compatible with HDFS, so it can be a drop-in replacement.… Continued

March 25, 2011
Microsoft Graph DB Trinity

Microsoft published information about it’s research project Trinity, a hypergraph DB.

Trinity is a graph database and computation platform over distributed memory cloud. As a database, it provides features such as highly concurrent query processing, transaction, consistency control.… Continued

March 23, 2011