How much money did Cloudera raise

Ten promising big data startups

Stephan Wiesend

Big data is seen as a growth market, and a large number of new business areas are emerging here for new companies. We present ten interesting startups.

EnlargeTen promising big data startups
© iStockPhoto / kristian sekulic

While the IT industry is stagnating in areas such as hardware, the Big Data business area is a growth market. A study by IDC promises that worldwide sales in this business area will rise to 32.4 billion dollars by 2017 - with impressive annual growth rates of 27 percent. No wonder that financiers are showering startups in this area with money.

The developer of the successful NoSQL DBMS, MongoDB, has now collected over 300 million dollars from investors and the market leader for Hadoop distributions, Cloudera, has collected a total of 740 million dollars. In German job advertisements too, employers are increasingly looking for specialists with knowledge of Hadoop and NoSQL, but the big data market in this country is still quite small. So it is not surprising that the startups presented below mainly include US start-ups.

Hadoop hype is waning

The market for new companies has become narrower in recent years. In recent years, Hadoop, designed for computing processes with large amounts of data, has been a key issue. In the opinion of experts, however, the market for Hadoop distributions, which simplify its use, has already consolidated in 2012. The market is currently dominated by competitors Hortonworks and Cloudera as well as Amazon AWS. But there are other issues that remain for big data startups. Big data can serve as a basis for optimizing sales or for corporate security. One example in this area is Theta Ray. At the moment, the sensitive combination of new and old technologies, such as NoSQL and conventional DBMS, seems to be establishing itself as a new topic. In our overview we show ten interesting companies from the big data environment.

We start with a quick A-to-Z fact check using a series of pictures and then go into a little more detail about the individual startups ...


Framed offers a solution for the sensitive use of the collected customer data: The company promises not only to analyze previous customer relationships, but also to be able to predict the future behavior of a customer. The web service should, for example, identify those customers who are about to end the business relationship or who are about to make a purchase decision. In addition, the tool analyzes customers in order to enable customized promotional and marketing campaigns. An analytics provider is required to use it; Mix panel, for example, is directly supported. According to the company, customers are already "several hundred" companies.


The open source statistical software or programming language R, which is popular in universities, is an alternative to SAS and SPSS. Microsoft recently acquired Revolution Analytics, which specializes in R, and another startup is RStudio from Boston, which was founded in 2008 as an open source project. Since the end of 2013, RStudio, which enjoys a good reputation in open source circles, has been offering its software in a version for corporate customers. The main products are the platform-independent solutions RStudio IDE, a development environment for R as well as ggplot2 and dplyr. With Shiny, RStudio offers a framework for developing interactive web applications. The founder JJ Allaire is not unknown in specialist circles, so he developed the previously popular Cold Fusion.

EnlargeWith RStudio (web) applications can be developed in R.

Map-D Technologies

Nvidia is a rather unusual investor for a big data startup, the explanation is the technical basis of Map-D Technologies: No CPUs are used for the visualization of large amounts of data, but inexpensive standard graphics cards. The visualization platform can evaluate a database in real time and create interactive graphics. A special storage method for text solves a problem with the use of graphics cards - the low RAM.

The company's demo application, the "TweetMap", which evaluates Twitter feeds in real time and interactively, is particularly impressive. The founder Todd Mostak is known for his thesis at Harvard, "Social Media as Passive Polling: Using Twitter and Online Forums to Map Islamism in Egypt." Next to Nvidia, Google is the main investor. The utility for companies appears to be quite limited and the developer has also waived patent protection.


Parstream wants to create a database platform for the Internet of Things. The basis is the specialized DBMS ParStreamDB, which, in addition to conventional data, can manage streaming data in real time. According to its own information, the company, with teams in Cologne and Silicon Valley, is the market leader in real-time query time. Parstream supports JDBC and ODBC as interfaces as well as a C ++ API. COMPUTERWOCHE recently presented the software in detail. Customers include Mpreis, Etracker and BD4Travel.

EnlargeParstream wants to build a comprehensive analytics platform for the Internet of Things.

Boomerang Commerce

Amazon constantly adjusts the prices of millions of products in order to automatically react to offers from competitors. Boomerang Commerce wants to help its customers with this price competition and promises with a so-called Price Perception Index to ensure that the right products are priced low and suitable products are priced high. The system can test and simulate pricing strategies; Boomerang monitors the prices and possible stocks of a customer's competitors as SaaS in real time. Customers include Staples, Sears, DHGate, Radio Shar and Groupon Goods.