June 4th, 2010
Cloudera are looking at Considerations for Hadoop and BI in a little series of two articles (2nd part). BI tools traditionally were designed for small volumes of structured data where Hadoop generally stores data in complex formats at scale and processes data on read using MapReduce, so that can be quite a problem, and it’s good to see some guidance around this, because I know it’ll be one of the first questions when we look at it here too. Even though I guess our main interest would first be in storing large amount of non-relational data and query it with custom tools (i.e. MapReduce jobs), and BI only as an afterthought. BI tools is still how people think about this kind of problem.