AI on Your Lakehouse: Context Comes in Shapes, Not Queries
Course Description
Your agent can reach your data but still can’t use it reliably. Vector search and Text2SQL each hand it a slice - similar passages, or a guessed SQL join - not the view to know what is truly relevant and how it connects. Answers come back confident but wrong. The problem isn’t caused by a bad model or bad query, but rather a lack of context, and context comes in shapes.
In this hands-on workshop, you give an agent three reusable shapes:
-
Connections (Paths) - how structured warehouse data joins and connects
-
Table of Contents (Trees & Links) - navigate documents
-
Themes (Communities) - surface patterns nobody named
These shapes answer the questions a modern stack gets quietly wrong - not just the single lookups it already handles most of the time, but the estate-level ones: the patterns across an entire document set, what your records show happening that the documentation never covers, and how things connect across a boundary no one query spans. Each needs the agent to cover, navigate, or follow the whole structure - not retrieve a similar slice.
You work the pattern on AutoFix Group, a fictional auto-repair chain: manuals, bulletins, and recalls as PDFs alongside vehicles, work orders, and parts in a warehouse. In a hosted Codespace, you and your coding agent build a service-advisor skill shape by shape, then put it to work across both halves - federating the warehouse with the graph instead of migrating it.
The pattern is portable by design: the warehouse here is BigQuery, but swap the connector and the same shapes work on Snowflake, Databricks, or anywhere your data lives.
Prerequisites
Before taking this workshop, you should have:
-
A basic understanding of AI agents
-
The ability to read and run basic SQL queries
-
Familiarity with data warehouse or lakehouse concepts (tables, keys)
For the hands-on path you will need a coding agent (this workshop assumes Claude Code) together with the Codespace - which provides read-only access to the BigQuery dataset and your Neo4j sandbox. The embedded sandbox window lets you inspect and query the graph by hand as you go.
Duration
2 hours core path, plus around 20 minutes of optional practice.
What you will learn
-
Why agent context is a problem of shapes, not queries - and where vector search and Text2SQL fall short
-
How to build the connections shape with neocarta - the warehouse join paths, read from BigQuery as metadata
-
How to navigate documents with trees and links
-
How to surface themes with Leiden community detection
-
How an agent answers the estate-level questions a modern stack gets wrong - covering the whole document library, proving what is undocumented, and connecting documents to records across the boundary
-
How to write graph queries via agentic coding - using neo4j-cypher-skill with a spec instead of hand-writing Cypher
-
How to build an agent skill a coding agent runs through the Neo4j CLI, and port the pattern to another lakehouse
This workshop includes
-
11 lessons
-
5 hands-on challenges
-
1 knowledge check
Get Support
If you find yourself stuck at any stage then our friendly community will be happy to help. You can reach out for help on the Neo4j Community Site, or head over to the Neo4j Discord server for real-time discussions.
Feedback
If you have any comments or feedback on this course you can email us on graphacademy@neo4j.com.