The magazine--one of my favorites--introduced these innovative companies with this quote:
"In this new category, the editors of SD Times recognize that with exponential data growth comes exponential problem growth. It also creates a storage problem, a retrieval problem, and a problem in understanding it all, so organizations 'doing' Big Data get actionable information that keeps them a step or two ahead of the competition. These vendors are the ones who've tackled the giant data problem with aplomb."
And the winners are:
10gen developed the open-source MongoDB noSQL Big Data database. In a recent announcement, IBM and 10gen will jointly craft a new standard for enterprise databases specifically for the mobile market. See this article about the shake-up happening in the enterprise database market.
The Apache Software Foundation is a community of developers working on open-source software. It is the poster child for successful open-source software.
If you are a software professional, it is highly unlikely that you do not use at least one of their projects: Apache web server, Ant application development tools, ActiveMQ message queuing, Derby relational database, Flex web browser application development platform, Lucene search engine, the OpenOffice productivity suite, the Struts web application development framework, Subversion source code management system, the Tomcat Java app server, and others.
Within the Big Data space, Apache has some of the leading technologies: Cassandra platform, HBase read/write access, Mahout machine learning library, the Hadoop distributed computing platform, and the Pig analytics tool. If you are not familiar with Hadoop, it was initially based on papers published around 2004 by Google on how it was handling massive amounts of data.
Some of the original Big Data software developers from the Apache Software Foundation and companies such as Yahoo and Google quickly formed their own firms to take advantage of this emerging market (smart guys!). Cloudera is one such vendor specializing in Apache Hadoop.
DataStax was formed to specialize in the Apache Cassandra platform. You can read about them here.
FatCloud makes the FatDB NoSQL database for the Windows .NET platform.
Hortonworks is another leading firm specializing in the Apache Hadoop software.
Objectivity is the maker of a graph database and Big Data database tools.
Pentaho is an open-source BI software platform that is actively pursuing the Big Data analytics space. As part of that offering, they not long ago announced their Instaview data visualization product.
Splunk is targeting the Big Data niche of machine-generated data. Websites, servers, mobile phones, and other devices are constantly spitting out huge amounts of data. Splunk wants to help organizations unlock the hidden potential of this potentially actionable information.
As you read this list, the term "open-source" should have repeatedly jumped out at you. Either the editors of SD Times are in love with the OSS concept or there is a real revolution going on within the software industry (the real answer may be both, I suppose).
It is also interesting to see the Big Data power cluster of individuals originally associated with Apache projects: Cloudera, Hortonworks, and DataStax.
Along with mobile application development, today's hot space for software professionals is Big Data.