Introduction to Aura Graph Analytics

Introduction

Aura Graph Analytics is Neo4j’s on-demand cloud environment for running Graph Data Science workloads.

What You’ll Learn

By the end of this lesson, you’ll be able to:

Explain when Aura Graph Analytics is preferable to local GDS installations
Create and manage ephemeral GDS Sessions using the Python client
Project graphs remotely from AuraDB using gds.graph.project.remote()
Navigate the Cargo 2000 logistics dataset we’ll use throughout this module

The Business Question

We have 5 months of air cargo shipping data from a real freight forwarding company.

Our goal: Find inefficiencies in our routing decisions.

Are we using the fastest routes?
What alternatives exist that we’ve never tried?
How much time (and money) could we save?

This module teaches you to answer these questions using Graph Data Science.

Ephemeral compute sessions

Instead of installing GDS on a local database, you spin up ephemeral compute sessions that connect to your data source.

What you’ll learn

Using Aura Graph Analytics, we’ll:

Create and manage GDS Sessions
Project the logistics network remotely from AuraDB
Run pathfinding algorithms (Dijkstra, Yen’s K-Shortest Paths)
Compare historical routes against optimal paths
Write results back to the database

For centrality and community detection algorithms, see the Graph Algorithms module.

Theory vs practical

Each concept will be framed by our classic Movies dataset with follow-up practical lessons where you’ll apply your knowledge to a real-world logistics use-case.

When to Use Aura Graph Analytics

Aura Graph Analytics is useful when:

Running heavy GDS workloads without impacting your production database
Scaling compute resources independently from storage
Working with AuraDB instances that don’t have GDS installed locally
Needing temporary, cost-efficient analytics environments

GDS Sessions

A GDS Session is an ephemeral compute unit for running GDS workloads. There are three types:

Attached: Data source is an AuraDB instance
Self-managed: Data source is a self-managed Neo4j DBMS
Standalone: Data source is not Neo4j-based (e.g., pandas DataFrames)

Sessions automatically expire after inactivity (default: 1 hour, max: 7 days) and have a hard limit of 7 days total lifetime.

Get your credentials: Dashboard

To get your credentials, head to Aura:

Get your credentials: Account Page

Go to your account page, and then the API Keys tab to generate credentials.

The GdsSessions Object

With the GDS python client installed, create the entry point for session management:

python

from graphdatascience.session import GdsSessions, AuraAPICredentials # (1)

sessions = GdsSessions(
    api_credentials=AuraAPICredentials( # (2)
        client_id="my-aura-api-client-id",
        client_secret="my-aura-api-client-secret"
    )
)

Import session management classes from the GDS Python client
Authenticate with your Aura API credentials from the account page

Creating a GDS Session

Use get_or_create() to create or reconnect to a session:

python

gds = sessions.get_or_create(
    session_name="my-session", # (1)
    memory=SessionMemory.m_8GB,
    db_connection=DbmsConnectionInfo( # (2)
        aura_instance_id="mydbid",
        username="neo4j",
        password="my-password"
    ),
    ttl=timedelta(hours=2) # (3)
)

# Verify the connection
gds.verify_connectivity()

Session name allows you to reconnect to existing sessions
Point the session at your AuraDB instance as the data source
Time-to-live controls automatic expiration to manage costs

The `gds` object

The returned gds object works like the standard GraphDataScience client.

Remote Projection

Projecting a graph into a GDS Session uses gds.graph.project() with a Cypher query containing gds.graph.project.remote():

python

G_collab, result = gds.graph.project(
    "example-graph",    #Title of the graph
    """
    CALL {
      MATCH (a1:EntryPoint)-[r:HAS_HISTORICAL_ROUTE]->(:HistoricalRoute)-[:TERMINATES_AT]->(a2:Destination) # (1)
      WHERE a1 <> a2
      RETURN a1 AS source,
            a2 AS target,
            type(r) AS relType
    }
    RETURN gds.graph.project.remote(source, target, { # (2)
      sourceNodeLabels: labels(source),
      targetNodeLabels: labels(target),
      relationshipType: relType
    })
    """,
    undirected_relationship_types=["*"] # (3)
)

The CALL block defines which nodes and relationships to project from AuraDB
gds.graph.project.remote() sends the projection to the GDS Session
Undirected relationship types are specified outside the Cypher query

Managing Sessions

List active sessions:

python

sessions.list()

View your active sessions

In your Graph Analytics dashboard, you can see which sessions you are currently running.

the aura graph analytics dashboard with a session running.

Graph vs Session vs Projection

There are three containers here:

Main graph (AuraDB)
Aura Graph Analytics Session
Projected graph

Projection workflow

Your projection lives inside the session, entirely separate from the main graph.

a diagram showing the projection that lives inside the session

Delete a session

Delete a session when finished:

python

gds.delete()
# or
sessions.delete(session_name="my-session")

Deleting releases resources and stops costs. Data not written back is lost.

The Supply Chain Dataset

Throughout this module, we’ll analyze the Cargo 2000 freight forwarding dataset—real air cargo logistics data from IATA.

The graph contains:

EntryPoint nodes (source airports)
Destination nodes (delivery airports)
DepartureWarehouse, ArrivalWarehouse, TransferPoint nodes
RECEPTION, DEPARTURE, TRANSPORT, DELIVERY relationships

Each relationship has an effectiveMinutes property—actual time for that process step.

Run the notebook cells to get a sense of the data.

Understanding the Data Model

The Cargo 2000 dataset has two layers:

Raw Operations Layer: EntryPoint → RECEPTION → DepartureWarehouse → DEPARTURE → TransferPoint → TRANSPORT → ArrivalWarehouse → DELIVERY → Destination

Historical Summary Layer: EntryPoint -[:HAS_HISTORICAL_ROUTE]→ HistoricalRoute -[:TERMINATES_AT]→ Destination

The raw layer shows every step. The summary layer captures which origin-destination pairs we’ve actually shipped.

We’ll use both: raw operations for finding optimal routes, historical summaries for comparing against past decisions.

Summary

Aura Graph Analytics provides:

On-demand, ephemeral compute for GDS workloads
Separation of compute from your database
Automatic session expiration to control costs
The same algorithm API as the standard GDS client

Next: Creating sessions and running complete GDS workflows.

Graph Data Science in Practice

GDS Foundations

Community Detection for Fraud

GDS Python Client

Aura Graph Analytics

Introduction to Aura Graph Analytics

Introduction

What You’ll Learn

The Business Question

Ephemeral compute sessions

What you’ll learn

Theory vs practical

When to Use Aura Graph Analytics

GDS Sessions

Get your credentials: Dashboard

Get your credentials: Account Page

The GdsSessions Object

Creating a GDS Session

The `gds` object

Remote Projection

Managing Sessions

View your active sessions

Graph vs Session vs Projection

Projection workflow

Delete a session

The Supply Chain Dataset

Understanding the Data Model

Summary

Chatbot

Graph Data Science in Practice

GDS Foundations

Community Detection for Fraud

GDS Python Client

Aura Graph Analytics

Introduction to Aura Graph Analytics

Introduction

What You’ll Learn

The Business Question

Ephemeral compute sessions

What you’ll learn

Theory vs practical

When to Use Aura Graph Analytics

GDS Sessions

Get your credentials: Dashboard

Get your credentials: Account Page

The GdsSessions Object

Creating a GDS Session

The gds object

Remote Projection

Managing Sessions

View your active sessions

Graph vs Session vs Projection

Projection workflow

Delete a session

The Supply Chain Dataset

Understanding the Data Model

Summary

Chatbot

The `gds` object