Other Import Options

LOAD CSV is a convenient tool for importing data into Neo4j; there are however some considerations:

  • You may need to write Cypher statements to transform the data into the desired format. Depending on the data complexity, this can be time-consuming.

  • LOAD CSV may not be suitable for large data sets. The definition of large data sets depends on multiple factors, including the data structure, the use case, and the environment. A guideline of over 10 million rows is a good indicator for considering other import options.

  • You may need to import data from multiple sources, and LOAD CSV may not be the best option for all of them.

  • Complex business rules may require integration to other systems or data sources.

There are a multitude of options for importing data into Neo4j. The following sections provide an overview of some of the possibilities.

Neo4j Data Importer

The Neo4j Data Importer is a UI-based data importing tool. The data importer is ideal to get started quickly with testing and prototyping. However, there are limited options for data transformation, and it is unsuitable for large data sets.

The Neo4j Data Importer UI

neo4j-admin

The neo4j-admin import command line interface supports importing large data sets. neo4j-admin import converts CSV files into the internal binary format of Neo4j and can import millions of rows within minutes.

You need to structure the import files in a specific way, and the database needs to be offline when the import is run.

ETL (Extract, Transform, Load) Tool

An ETL tool, for example Apache Hop, is a good choice for importing data from multiple sources. ETL tools generally support various data sources, can transform data into the desired format, and have visualization tools.

Custom integration using Neo4j drivers

Building a custom application to load data into the graph database is a good option if you have complex business rules or need to integrate with other systems. There are several GraphAcademy courses for developers where you can learn how to build applications using Neo4j drivers.

Check Your Understanding

1. LOAD CSV vs. neo4j-admin

True or False - The neo4j-admin import command line interface is more flexible than the LOAD CSV Cypher clause.

  • ❏ True

  • ✓ False

Hint

To use the neo4j-admin import command line interface, you need to structure the import files in a specific way.

Solution

The statement is False - to use the neo4j-admin import command line interface, you need to structure the import files in a specific way.

Summary

In this lesson, you learned about other options for importing data into Neo4j.

Chatbot

Hi, I am an Educational Learning Assistant for Intelligent Network Exploration. You can call me E.L.A.I.N.E.

How can I help you today?