You’d be forgiven for passing by the announcement of Apache Spark 2.3. After all, it’s a point release, isn’t it? Sure, there will be some bug fixes, maybe an improvement or two to the MLLib framework, maybe an extra operator or something, but nothing all that major. That will be saved for Apache Spark 3.0,… Continue reading What’s new in Apache Spark? Low-latency streaming and Kubernetes
The core premise of data democratization is that every one of us can use, interpret, and work with data to make organizational decisions. Yet a recent survey found only 20 percent of business leaders consider themselves data literate and less than 25 percent feel proficient at working with data. Those are surprising numbers because these… Continue reading IDG Contributor Network: Helping your workforce become data-literate
Josh Klahr is vice president of product management at AtScale. Over the past few years, there has been a subtle but significant shift in the way that data is structured in databases. Whereas yesterday’s databases were typically limited to storing data in rows and tables, today’s modern databases often make use of nested data structures.… Continue reading How to bring BI and analytics to modern nested data structures
About seven years ago, I developed a 13-part functional model for what a big data and analytics (BDA) solution needed to contain. Key components included things like data governance/master data management, integration of data sources, integration of BDA with the underlying IT infrastructure, and collaboration capabilities. Chief among them, however, was search. Think about it:… Continue reading IDG Contributor Network: The wonders of AI—or the shortcomings of search?
Today’s big data analytics market is quite different from the industry of even a few years ago. The coming decade will see change, innovation, and disruption ripple through at every segment of this global industry. In the recently published annual update to its market study, Wikibon, the analyst group of SiliconAngle Media, found that the worldwide… Continue reading Big data analytics: The cloud-fueled shift now under way
In the era of digital transformation, omnichannel marketing, web-scale applications, and the internet of things (IoT), cost-effectively scaling the performance of existing applications is one of the most challenging issues facing enterprise architects and CTOs. In-memory data grids (IMDGs) meet this challenge, delivering massive speed and scalability gains without the need to rip and replace… Continue reading IDG Contributor Network: Adding speed and scalability to existing applications with in-memory data grids
PyTorch is a Python-based tensor computing library with high-level support for neural network architectures. It also supports offloading computation to GPUs. A product of Facebook’s AI research team and open sourced a little more than a year ago, PyTorch has fast become the first choice of many deep learning practitioners. To read this article… Continue reading PyTorch tutorial: Get started with deep learning in Python