September 30th, 2011
September 24th, 2011
There are a few articles collecting dust in my feed reader from a period a couple months ago when I didn’t have much time staying up to date on what’s happening… here’s the rundown of the most important stuff about Hadoop.
- Hardware for Hadoop
- Why you would want an appliance — and when you wouldn’t
- Some notes on Hadoop (mainly) and appliances
- EMC Puts Database & Hadoop in Same Big Data Analytics Box
Yahoo!’s Hortonworks Hadoop distribution
September 21st, 2011
There’s a lot going on in the NoSQL world, or maybe Derrick Harris was just exceptionally busy last night…
- EMC throws lots of hardware at Hadoop
- DataStax gets $11M, fuses NoSQL and Hadoop
- Neo raises $10.6M for Neo4j as graph DBs take off
- NoSQL Database Company Neo Technology Raises $10.6 Million
September 8th, 2011
Platfora works with existing Hadoop clusters, including Cloudera, MapR, and Amazon EMR, among others, and automatically turns these huge amounts of data into dimensional and predictive dashboards, reports and insights. The company’s server architecture enables sub-second report delivery, analytics overlay, and drill down performance.
I wouldn’t be surprised if the guys building on top of Hadoop made a lot more money than those providing Hadoop itself.
July 5th, 2011
Curt Monash about Forthcoming Oracle appliances, based on information from Oracle’s earnings call (full transcript) last week. There will be an IMDB appliance based on TimesTen for high speed analytics, and a Hadoop appliance for MapReduce jobs, targetted at data preprocessing and feeding into Oracle. It really looks like Oracle is full steam ahead on the appliance strategy, and is also starting to embrace the MapReduce and massively parallel models. All of that is likely to be announced in more details at Oracle Open World.
May 9th, 2011
Reported and analysed by Tony Baer in OnStrategies Perspectives, and reported by Derrick Harris in GigaOm’s in EMC, NetApp Make It a Big Day for Big Data Star Hadoop, we learn that EMC is using the on-going EMC World conference to its potential, and is announcing that they’re growing the Database division with the decision to sell their own Hadoop distribution with value add management tools and integration. I expect to see more soon.
April 28th, 2011
In the long run, we also expect IBM to make a stab at Hadoop and related technologies by extending its InfoSphere offerings -– it can see Cloudera-Informatica and Cloudera-MicroStrategy raise it one with its own InfoSphere DataStage and Cognos offerings, before it even talks about partnerships. Today we saw a shot from left field – Yahoo which invented the technology – is now saying it might spin off its Hadoop business to go up against Cloudera, and potentially IBM. In a way, its closing the doors after the horses left the barn as the creator of Hadoop is now part of Cloudera.
For Yahoo, this would clearly be a shot out of its comfort zone, as it is not a tools company. But it is hungry for monetizing its intellectual property, even if that property has already been open sourced. It’s redolent of Sun striving to monetize Java and we all know how that went. Obviously this will be an uphill battle for Yahoo, but at least this would be a spinoff so hopefully there won’t be distractions from the mother ship. Given Yahoo’s fortunes, we shouldn’t be surprised that they are now looking to maximize what they can get out of the family jewels.
More commercial offerings in NoSQL can only be a good thing.
March 29th, 2011
Good number of articles from Derrick Harris over at GigaOm rounding up the Structure Big Data Conference. First, there’s a look at Hadoop, Cloudera, and alternatives to Cloudera from IBM, DataStax, Hadapt etc. in As Big Data Takes Off, the Hadoop Wars Begin, and second there’s a piece about Why Big Data Startups Should Take a Narrow View:
[...] analyzing social media data is not the same, either in technique or in purpose, as analyzing user data to feed a recommendation engine for a site like Netflix. And herein lies the opportunity. [...] It’s a situation just begging for startups to fill the void between big data tools and actually using them for a particular task.
So where are the NoSQL startups targetting the financial industry?
March 26th, 2011
March 25th, 2011
They are said to be building a proprietary replacement for the Hadoop Distributed File System that’s allegedly three times faster than the current open-source version. It comes with snapshots and no NameNode single point of failure (SPOF), and is supposed to be API-compatible with HDFS, so it can be a drop-in replacement.
Lots of famous names in the company, which probably explains why they were able to raise $9m funding without ever shipping anything. I don’t know enough about them right now, but it’ll be interesting to see if they have enough of an advantage to keep an edge on plain Hadoop and HDFS over time, as the open source version catches up.