Execution Modes and Configuration

Introduction

Every GDS algorithm supports five execution modes. Each mode determines where your results go and how you can use them.

Diagram showing five execution modes connected to a central node

Understanding these modes is essential for building effective GDS workflows.

What You’ll Learn

By the end of this lesson, you’ll be able to:

Choose the appropriate execution mode (stream, stats, mutate, write, estimate) for your task
Build efficient workflows by combining modes strategically
Configure algorithm parameters to control behaviour
Use estimate mode to plan resource requirements before running expensive algorithms

The Five Execution Modes

Mode	What it does
stream	Returns results to your query
stats	Returns summary statistics only
mutate	Stores results in the projection
write	Persists results to your database
estimate	Checks memory requirements

Mode

What it does

stream

Returns results to your query

stats

Returns summary statistics only

mutate

Stores results in the projection

write

Persists results to your database

estimate

Checks memory requirements

The Syntax Pattern

All algorithms follow the same pattern:

cypher

CALL gds.<algorithm>.<mode>( // (1)
  'graph-name', // (2)
  { configuration } // (3)
)
YIELD <results> // (4)
RETURN <what you want>

Algorithm name and execution mode
Name of your in-memory projection
Optional configuration map
Each mode yields different result fields

The mode determines what gets yielded and where results are stored.

Project a graph

cypher

MATCH (source:Actor)-[r:ACTED_IN]->(:Movie)<-[:ACTED_IN]-(target:Actor) // (1)
WITH gds.graph.project('actor-network', source, target) AS g // (2)
RETURN g.graphName, g.nodeCount, g.relationshipCount // (3)

Project the graph
Assign the result to a variable g
Return the graph name, node count and relationship count from the variable g

Stream Mode

Returns results directly in your query output. Nothing is stored.

cypher

CALL gds.pageRank.stream('actor-network', {}) // (1)
YIELD nodeId, score // (2)
RETURN gds.util.asNode(nodeId).name AS actor, score
ORDER BY score DESC
LIMIT 10

Run PageRank in stream mode
Results are yielded row-by-row to your query

Results exist only for the duration of your query.

When to Use Stream

Exploring results before deciding whether to store them
Running one-off analyses
Feeding results into other Cypher operations
Exporting to CSV or pandas via the Python driver

Stream is your go-to for exploration.

Stats Mode

Runs the algorithm but returns only summary statistics—no individual node results.

cypher

CALL gds.louvain.stats('actor-network', {}) // (1)
YIELD communityCount, modularity, ranLevels // (2)
RETURN communityCount, modularity, ranLevels // (3)

Run Louvain in stats mode—no per-node results
Yields aggregate metrics only
Return the community count, modularity and random levels

When to Use Stats

Understanding overall algorithm behaviour
Testing configurations before committing
Checking community counts or score distributions
Quick iteration on parameter tuning

Stats is faster than streaming thousands of rows when you only need the summary.

Mutate Mode

Stores results as properties in your projection--not in your database.

cypher

CALL gds.pageRank.mutate('actor-network', {
  mutateProperty: 'pageRankScore' // (1)
})
YIELD nodePropertiesWritten // (2)
RETURN nodePropertiesWritten // (3)

Property name for storing results in the projection
Yields count of properties written to the projection
Return the count of properties written to the projection

Results stay in memory until you drop the graph.

When to Use Mutate

Chaining algorithms (one algorithm’s output feeds into another)
Building ML feature pipelines
Keeping your database clean of intermediate results
Comparing multiple algorithm runs

Mutate is for workflows, not final outputs.

Write Mode

Persists results as properties in your Neo4j database.

cypher

CALL gds.pageRank.write('actor-network', {
  writeProperty: 'pageRank' // (1)
})
YIELD nodePropertiesWritten // (2)
RETURN nodePropertiesWritten // (3)

Property name for persisting results to the database
Yields count of properties written to your database
Return the count of properties written to your database

Results survive after dropping the projection.

When to Use Write

Making results available to applications
Sharing insights via dashboards
Preserving results for future queries
Avoiding re-running expensive algorithms

Write is for production outputs.

Estimate Mode

Tells you memory requirements without running the algorithm.

cypher

CALL gds.pageRank.write.estimate('actor-network', { // (1)
  writeProperty: 'pageRank'
})
YIELD nodeCount, relationshipCount, requiredMemory // (2)
RETURN nodeCount, relationshipCount, requiredMemory // (3)

Append .estimate to any execution mode
Yields graph size and memory requirements
Return the node count, relationship count and required memory

Every execution mode has an estimate variant: gds.<algorithm>.<mode>.estimate()

When to Use Estimate

Planning before running expensive algorithms
Checking if your heap can handle the operation
Deciding whether to sample or run on the full graph

Estimate before you commit to long-running operations.

A Typical Workflow

Estimate — check memory requirements
Stats — understand the distribution, test configurations
Stream — explore specific results
Mutate — chain algorithms together
Write — persist final results

You won’t always use all five, but knowing when to use each makes your workflow efficient.

Algorithm Configuration

Most algorithms accept a configuration map:

cypher

CALL gds.pageRank.stream('actor-network', {
  maxIterations: 40, // (1)
  dampingFactor: 0.95 // (2)
})

Number of iterations the algorithm will run
Probability of following a link vs. random jump

Configuration controls how the algorithm behaves.

Universal Configuration Options

Available across most algorithms:

concurrency — parallel threads (default: 4)
nodeLabels — filter to specific node types
relationshipTypes — filter to specific relationship types

Algorithm-Specific Options

Each algorithm has unique parameters:

PageRank:

maxIterations — how many passes (default: 20)
dampingFactor — probability of following links (default: 0.85)

Louvain/Leiden:

maxLevels — hierarchy depth (default: 10)

You’ll learn specific parameters as you use each algorithm.

Configuration Strategy

Start with defaults — run without configuration
Use stats to test — check distributions and counts
Adjust parameters — based on what you observe
Validate results — do they answer your question?

Don’t over-configure early. Defaults are sensible starting points.

Summary

Five execution modes control where results go:

Stream — view results (exploration)
Stats — summary only (testing)
Mutate — store in projection (chaining)
Write — persist to database (production)
Estimate — check memory (planning)

Configuration parameters control algorithm behaviour. Start with defaults, use stats to test, then tune as needed.

In the next lesson, you’ll learn how projection configuration—undirected relationships and weights—affects algorithm behaviour.

Graph Data Science in Practice

GDS Foundations

Community Detection for Fraud

Execution Modes and Configuration

Introduction

What You’ll Learn

The Five Execution Modes

The Syntax Pattern

Project a graph

Stream Mode

When to Use Stream

Stats Mode

When to Use Stats

Mutate Mode

When to Use Mutate

Write Mode

When to Use Write

Estimate Mode

When to Use Estimate

A Typical Workflow

Algorithm Configuration

Universal Configuration Options

Algorithm-Specific Options

Configuration Strategy

Summary

Chatbot

Data Model