See the following -

3 Emerging Open Source Data Analytics Tools Beyond Apache Spark

On the data analytics front, profound change is in the air, and open source tools are leading many of the changes. Sure, you are probably familiar with some of the open source stars in this space, such as Hadoop and Apache Spark, but there is now a strong need for new tools that can holistically round out the data analytics ecosystem. Notably, many of these tools are customized to process streaming data...Streaming data analytics are needed for improved drug discovery...While Apache Spark grabs many of the headlines in the data analytics space, given billions of development dollars thrown at it by IBM and other companies, several unsung open source projects are also on the rise. Here are three emerging data analytics tools worth exploring:

How Apache Kafka is Powering a Real-Time Data Revolution

Two years ago, Neha Narkhede co-founded a company called Confluent to build on her team's work with Apache Kafka. In this interview, we talk about how lots of companies are deploying Kafka and how that has led to a very busy GitHub repo. Narkhede will keynote at All Things Open in Raleigh, NC next week. Q: What was it like leaving LinkedIn to start your own company? Narkhede: It was a great experience and a natural extension of the mission that my co-founders and I had been working on for the past several years—of bringing Apache Kafka and our vision for a new future for a company's data architecture built around streaming data to the forefront...