The Estate Questions

Challenge

You have a shape for each of Module 1’s three questions. Now you put them to work on the questions that run the shop.

You will warm up on Dani’s car - where one shape is not enough - then zoom out to the estate: the patterns across all the bulletins and recalls, and the field codes the library never documented.

Warm-up: Dani’s car takes two shapes

The navigation and theme tools tell the agent what the documents say. The connections shape and a live warehouse query tell it what actually worked - the agent grounds the candidates in the documents, then reads the repair history from the warehouse.

The agent does not run a hand-written query. It retrieves the warehouse schema from the connections MCP (get_full_metadata_schema - every table with its foreign-key references, the join paths from Module 2), grounds the code in Neo4j, then writes and runs the SQL with run_sql.py, joining along those foreign keys.

text
A prompt for your coding agent
For VIN FAL20T20220002 with code P0301: what fixed this on similar
vehicles?

Watch it call the connections tools, write the SQL from the foreign keys, and run it. The candidate parts come from the bulletin sections that name the code; the warehouse says which of them actually held up.

The Right Fix

In the agent’s federated answer, which part fixed code P0301 on vehicles like Dani’s with zero comebacks?

  • ❏ IC-2042-A, the original ignition coil

  • ✓ IC-2042-B, the revised ignition coil

  • ❏ SP-1108, the spark plug set

  • ❏ CAT-5500, the catalytic converter

Hint

Rank by comebacks: the best-performing part is the one used often with none of those vehicles coming back.

Solution

The correct answer is IC-2042-B, the revised ignition coil.

It was used on 18 work orders for Falcon 2.0T vehicles diagnosed with P0301, and none of those vehicles came back. The original coil IC-2042-A was used 11 times and came back 6 times, and the spark-plug-only fix SP-1108 came back 6 times out of 29 - which is exactly the pattern bulletin TSB-22-301 warns about.

The estate: proving a negative

That was one car. The estate questions ask about the whole library at once - and the sharpest is the one a similarity tool can never answer. Documentation gaps:

text
A prompt for your coding agent
Are there mismatches between our documented procedures
and the actual problems in the field? What documentation is missing, and what
documentation isn't used?

A similarity tool cannot answer this honestly. Asked what is undocumented, it returns the nearest-looking sections and concludes the codes are "documented, just buried" - a confident, wrong claim, because similarity always returns something and can never prove absence. The tree shape enumerates: it walks every document and reports the codes with zero hits.

Verify it in the sandbox - the query is the proof

The check is the shape doing its job. Run this against the document graph; both codes return zero library hits:

cypher
UNWIND ['P0335', 'P0340'] AS code
OPTIONAL MATCH (s:Section)
  WHERE toLower(s.content) CONTAINS toLower(code)
RETURN code, count(s) AS library_hits

P0335 (crankshaft position sensor) and P0340 (camshaft position sensor) are high-comeback codes out in the field - and the library never documents them. That zero is the answer.

Proving a Negative

The agent enumerated the field codes against the document library. Which pair of fault codes is failing in the field but has zero documentation anywhere in the library?

  • ❏ P0301 and P0302, the cylinder misfire codes

  • ✓ P0335 and P0340, the crankshaft and camshaft position sensor codes

  • ❏ P0420, the catalyst efficiency code

  • ❏ U0121, the lost-communication-with-ABS code

Hint

Run the sandbox query from the lesson. The undocumented codes are the ones that return zero library hits - similarity search would have called them "documented, just buried," which is exactly the wrong answer.

Solution

The correct answer is P0335 and P0340 - the crankshaft and camshaft position sensor codes.

Both are high-comeback codes out in the field, yet not one of the library’s documents mentions either - the sandbox query returns zero hits for both. The other options are all well documented: the misfire codes anchor the ignition-coil bulletins, P0420 has its catalyst sections, and U0121 appears in the ABS material.

This is the question similarity retrieval gets confidently wrong. Asked what is undocumented, it returns the nearest-looking sections and reports the codes as "documented, just buried." It can never prove absence, because it always returns something. Walking the whole document structure - the tree shape - is what lets the agent say, truthfully, "there is nothing here."

The estate: coverage, not sampling

The other estate question - Morgan wants the common patterns across the whole library:

text
A prompt for your coding agent
What are the common patterns across our bulletins and recalls? How
many of our cars does each affect?

This is a coverage question, and it is where similarity retrieval breaks. Ask a top-k retriever for "patterns across all our notices" and it hands back the nearest handful - it samples, and it cannot tell you what it skipped. That is not a vendor flaw; it is what similarity retrieval is, and it is the engine under every lakehouse AI-search feature.

To feel the difference, run a plain keyword search for a broad theme first - your search.py over content_search returns the top matching sections, not the whole set. Then run themes.py: it clusters all 112 bulletins and recalls into the dozen-or-so recurring failure modes they actually represent. The shape covers the structure; the search samples it. Only then does the warehouse count make sense - the agent reads the warehouse on each pattern’s parts and codes to size the fleet exposure.

Connections at scale

The third Module 1 question - "how do these records relate?" - is the connections shape at scale. On AutoFix’s six-table warehouse, Text2SQL handles the joins fine; the failure only appears on a real warehouse with hundreds of look-alike, very wide tables, where the schema does not fit in a prompt and the model brute-forces metadata, picks the wrong table, and loops.

On independent enterprise benchmarks, models hit a *78.57% error rate on queries touching four or more tables*[1]. The connections shape retrieves the right table and the join paths by meaning before generating SQL, so the agent never scans the haystack. You built that shape in Module 2; at production scale, it is what keeps Text2SQL reliable and cheap.

Summary

The agent answered the estate questions by putting the shapes together:

  • Warm-up - two shapes, one answer: grounded the candidates in the documents, then read the warehouse for what held up

  • Absence - the document graph proved P0335/P0340 have zero documentation, a negative similarity cannot prove

  • Coverage - themes clustered all 112 notices into their recurring failure modes; similarity would have sampled a handful and missed the rest

  • Connection at scale - the connections shape keeps Text2SQL honest on warehouses too large to fit in a prompt


1. Independent Text2SQL benchmarks: Falcon: A Comprehensive Chinese Text-to-SQL Benchmark for Enterprise-Grade Evaluation (Luo et al., arXiv:2510.24762, 2025) reports 78.57% error on queries touching four or more tables - https://arxiv.org/abs/2510.24762; Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows (arXiv:2411.07763, ICLR 2025) reports frontier models at 10-17% on real enterprise schemas - https://arxiv.org/abs/2411.07763.

Chatbot

How can I help you today?

Data Model

Your data model will appear here.