Tutorial: Spark application architecture and clusters

Before you begin your journey as an Apache Spark programmer, you should have a solid understanding of the Spark application architecture and how applications are executed on a Spark cluster. This article closely examines the components of a Spark application, looks at how these components work together, and looks at how Spark applications run on… Continue reading Tutorial: Spark application architecture and clusters

Review: MXNet deep learning shines with Gluon

When I reviewed MXNet v0.7 in 2016, I felt that it was a promising deep learning framework with excellent scalability (nearly linear on GPU clusters), good auto-differentiation, and state-of-the-art support for CUDA GPUs. I also felt that it needed work on its documentation and tutorials, and needed a lot more examples in its model zoo.… Continue reading Review: MXNet deep learning shines with Gluon

IDG Contributor Network: The future is cloudy, with a chance of success

I would have titled this post, “How to be a rainmaker in the cloud,” except the term rainmaker often refers to the selling process, which is already succeeding, and that success is a key contributing factor to why so many cloud initiatives are all wet. If nothing else, the popularity of cloud services has made… Continue reading IDG Contributor Network: The future is cloudy, with a chance of success

Real-time data processing with data streaming: new tools for a new era

Today, there are many data sources—such at IoT devices, user interaction events from mobile applications, financial service transactions, and health monitoring systems—that broadcast critical information in real time. Developers working with these data sources need to think about the architecture to capture real time streaming data at varying scales and complexities. To read this article in… Continue reading Real-time data processing with data streaming: new tools for a new era

Bossies 2018: The Best of Open Source Software Awards

Remember Snort? Or Asterisk? Or Jaspersoft or Zimbra? Heck, you might still be using them. All of these open source champions—InfoWorld Best of Open Source Software Award winners 10 years ago—are still going strong. And why not? They’re still perfectly useful. Ten years ago these tools were among the best answers to pressing needs in… Continue reading Bossies 2018: The Best of Open Source Software Awards

The best open source software for data storage and analytics

The best open source software for data storage and analytics Image by IDG Nothing is bigger these days than data, data, data. We have more data than ever before, and we have more ways to store and analyze it—SQL databases, NoSQL databases, distributed OLTP databases, distributed OLAP platforms, distributed hybrid OLTP/OLAP platforms. Our 2018 Bossie… Continue reading The best open source software for data storage and analytics