Graph database vendors are broadening their applications by adding enterprise-focused features to help customers who are dealing with the burdens of huge troves of business-critical data.
In a move that is highlighting the growing maturity of the graph database marketplace, Neo4j recently unveiled its latest product, Neo4j for Graph Data Science, which is designed to make it easier for enterprises to use graph machine learning to expand their capabilities. Another vendor, Katana Graph, recently announced a collaboration with Intel to port and optimize its Katana Graph engine on Intel Xeon scalable processors, Xeon-based clusters and on Intel’s upcoming discrete GPUs.
Meanwhile, TigerGraph, recently unveiled the results of a new graph data management benchmark study that uses nearly 5TB of raw data on a cluster of machines to show the performance benefits enterprises can potentially receive using its graph database.
Graph databases are purpose-built to store and navigate what are called data “relationships,” according to documentation from Amazon Web Services. Relationships in graph databases are critical to bring disparate data together, using “nodes” to store data entities, and “edges” to store relationships between entities. Edges in graph databases include a start node, an end node, a type, and a direction. An edge can describe parent-child relationships, actions, ownership, and more. The number and kind of relationships a node in a graph database can have is unlimited.
Graph databases continue to gain new interest from enterprises because they connect all of a company’s internal and external datasets and pipelines for analysis, which can then create and deliver broad business insights that might not have been possible in the past.
Neo4j for Graph Data Science, Built for Enterprises
The latest version 1.4 of Neo4j for Graph Data Science specifically targets enterprise customers with graph-native machine learning functionality that’s being made available for business use. These capabilities are important for enterprises, according to Neo4j, because organizations don’t always know how to represent connected data for use in machine learning models. Version 1.4 includes graph embedding algorithms that learn the structure of a user’s graph, rather than relying on predetermined formulas to calculate specific features like centrality scores. Using AI, the updated product calculates the shape of the surrounding network for each piece of data inside of a graph, enabling far better machine learning predictions, according to the vendor. It can make predictions for fraud detection, tracking customer or patient journeys, drug discovery research and more.
For enterprises that are working to extract value from large stores of critical business data, graph databases can be an important tool, Mike Matchett, principal analyst of research firm, Small World Big Data, told EnterpriseAI.
“The recent news from graph database suppliers highlights the emergence of high performance, highly scalable graph solutions,” said Matchett. “This market is not just offering high-quality graph-based databases with native, advanced graph analysis support, but also provides solutions that can tackle very large data sets very quickly.”
These advances are allowing businesses to apply graph-based approaches to huge scales of network analysis, such as deeper healthcare models for testing and tracking for the transmission of diseases, and other very large data-driven challenges, he said. Such queries were formerly limited to highly-structured (OLAP) or parallel query (Hadoop) kinds of analyses, he added.
“In particular, AI, machine learning and graph ‘theory’ have a large logical connection, particularly in neural networking, but it hasn’t been practical until now to build integrated technology solutions that are both scalable and performant over real large data sets,” said Matchett.
“Graph databases are fast evolving to scale to larger data sets and process graph-based analytics, faster than ever before,” he added. “Graph representations can provide a far more natural and flexible way to model just about any data set [compared to data structured in tables/records with a fixed key-based schema], while intelligent graph analytics can analyze and extract insights about relationships embedded in that data that are otherwise simply not practical to process.”
Graph Databases Haven’t Yet Caught Traction with Enterprises
Graph-based approaches, including graph databases, remain underappreciated and overlooked by many enterprises, said Matchett. “But with increasing support for high performance and high scale, more barriers are coming down that might impede a much wider adoption. We think the opportunity for enterprises to leverage existing graph technologies to solve many pressing problems, including problems that are otherwise unsolvable, is already huge. The real barrier to adoption is one of awareness and training in graph approaches outside of the greenfield, advanced-thinking teams on the cutting edge.”
First published at Enterprise AI
- Neo4j advances machine learning compatibility for its graph database
- Top 5 Open Source Data Tools For Every Data Scientist
- What is a Graph Database? What is it Used For?
- Graph Database: What Is It and Why It Matters For Businesses?
- Analyst Commentary: Neo4j looks to break down barriers between graph analysis and data science