Skip to content

Entity Communication Networks

Course Duration
2 hours
Categories
Intermediate

Welcome to the Entity Communication Networks course.

In this hands-on course, you will extract text from raw documents, parse it into structured records, and build a communication network in Neo4j. You’ll handle OCR noise, build rule-based and LLM-based parsers, train an NER model, and combine them into a hybrid pipeline that imports a metadata graph of entities and their communications.

By the end, you’ll also receive a prompt pack — an LLM project that examines your own data, references the techniques from this course, and builds a custom pipeline for any document source. The course teaches you the concepts; the pack applies them to your data.

Prerequisites

Before taking this course, you should have:

  • Completed the Cypher Fundamentals course, or equivalent Cypher experience

  • Basic understanding of Neo4j property graph concepts

  • Familiarity with Python

What you will learn

  • How to extract text from PDF documents using PyMuPDF with OCR fallback

  • How to parse unstructured document text into structured records

  • How to build rule-based parsers that handle OCR noise and layout variation

  • How to train and evaluate an NER model for entity extraction

  • How to use LLMs for parsing with token-efficient prompt strategies

  • How to combine templates, NER, and LLMs into a hybrid pipeline

  • How to normalize and import structured records into Neo4j

Duration

2 hours

This course includes

  • 23 lessons across 2 modules

  • 30 quiz questions

  • 14 Jupyter notebooks

  • 4,911 real Enron email PDFs

  • Pre-trained NER models and annotation data

  • A prompt pack for applying the techniques to your own data

Get Support

If you find yourself stuck at any stage then our friendly community will be happy to help. You can reach out for help on the Neo4j Community Site, or head over to the Neo4j Discord server for real-time discussions.

Feedback

If you have any comments or feedback on this course you can email us on graphacademy@neo4j.com.