Oracle doing Hadoop and NoSQL

September 30th, 2011

Oracle docs show plans for Hadoop, NoSQL about the Oracle Loader for Hadoop, and Added Session: Big Data Appliance about Oracle’s Big Data Appliance.

Hadoop Update

September 24th, 2011

There are a few articles collecting dust in my feed reader from a period a couple months ago when I didn’t have much time staying up to date on what’s happening… here’s the rundown of the most important stuff about Hadoop.

Hadoop Appliances?

Yahoo!’s Hortonworks Hadoop distribution

NoSQL Updates

September 21st, 2011

There’s a lot going on in the NoSQL world, or maybe Derrick Harris was just exceptionally busy last night…


Andreessen Horowitz Leads $5.7M Round In Analytics Platform For Hadoop Data Platfora:

Platfora works with existing Hadoop clusters, including Cloudera, MapR, and Amazon EMR, among others, and automatically turns these huge amounts of data into dimensional and predictive dashboards, reports and insights. The company’s server architecture enables sub-second report delivery, analytics overlay, and drill down performance.

I wouldn’t be surprised if the guys building on top of Hadoop made a lot more money than those providing Hadoop itself.

Curt Monash about Forthcoming Oracle appliances, based on information from Oracle’s earnings call (full transcript) last week. There will be an IMDB appliance based on TimesTen for high speed analytics, and a Hadoop appliance for MapReduce jobs, targetted at data preprocessing and feeding into Oracle. It really looks like Oracle is full steam ahead on the appliance strategy, and is also starting to embrace the MapReduce and massively parallel models. All of that is likely to be announced in more details at Oracle Open World.

Reported and analysed by Tony Baer in OnStrategies Perspectives, and reported by Derrick Harris in GigaOm’s in EMC, NetApp Make It a Big Day for Big Data Star Hadoop, we learn that EMC is using the on-going EMC World conference to its potential, and is announcing that they’re growing the Database division with the decision to sell their own Hadoop distribution with value add management tools and integration. I expect to see more soon.

Yahoo is considering to turn Hadoop into a business, as reported by the Wall Street Journal. Ovum’s Tony Baer has a more detailed analysis at his blog in Yahoo to Hadoop: Show me the Money.

In the long run, we also expect IBM to make a stab at Hadoop and related technologies by extending its InfoSphere offerings -– it can see Cloudera-Informatica and Cloudera-MicroStrategy raise it one with its own InfoSphere DataStage and Cognos offerings, before it even talks about partnerships. Today we saw a shot from left field – Yahoo which invented the technology – is now saying it might spin off its Hadoop business to go up against Cloudera, and potentially IBM. In a way, its closing the doors after the horses left the barn as the creator of Hadoop is now part of Cloudera.



For Yahoo, this would clearly be a shot out of its comfort zone, as it is not a tools company. But it is hungry for monetizing its intellectual property, even if that property has already been open sourced. It’s redolent of Sun striving to monetize Java and we all know how that went. Obviously this will be an uphill battle for Yahoo, but at least this would be a spinoff so hopefully there won’t be distractions from the mother ship. Given Yahoo’s fortunes, we shouldn’t be surprised that they are now looking to maximize what they can get out of the family jewels.

More commercial offerings in NoSQL can only be a good thing.

Structure Big Data Roundup

March 29th, 2011

Good number of articles from Derrick Harris over at GigaOm rounding up the Structure Big Data Conference. First, there’s a look at Hadoop, Cloudera, and alternatives to Cloudera from IBM, DataStax, Hadapt etc. in As Big Data Takes Off, the Hadoop Wars Begin, and second there’s a piece about Why Big Data Startups Should Take a Narrow View:

[…] analyzing social media data is not the same, either in technique or in purpose, as analyzing user data to feed a recommendation engine for a site like Netflix. And herein lies the opportunity. […] It’s a situation just begging for startups to fill the void between big data tools and actually using them for a particular task.

So where are the NoSQL startups targetting the financial industry?

Great Bloomberg interview with Cloudera CEO Mike Olson on open source and big data.

Via the 451 group

Meet Mapr, a Competitor to Hadoop Leader Cloudera.

They are said to be building a proprietary replacement for the Hadoop  Distributed File System that’s allegedly three times faster than the  current open-source version. It comes with snapshots and no NameNode  single point of failure (SPOF), and is supposed to be API-compatible  with HDFS, so it can be a drop-in replacement.

Lots of famous names in the company, which probably explains why they were able to raise $9m funding without ever shipping anything. I don’t know enough about them right now, but it’ll be interesting to see if they have enough of an advantage to keep an edge on plain Hadoop and HDFS over time, as the open source version catches up.