Practical 9

Execute queries in the Neo4j graph database and in Gephi tool perform the operations of loading CSV data, running graph statistics scripts, and displaying various graphical layouts.


What is a Graph Database?

Very literally, a graph database is a database structured to consider the relationships between data as similarly important to the data itself. It is intended to retain data without constricting it to a pre-defined model. Alternatively, the material is processed as we first sketch it out-explaining how each particular person communicates with or is connected to another.


What is Neo4j?

Neo4j is a NoSQL, open-source, native graph database that provides the applications with an ACID-compliant transactional backend. The original development started in 2003, but since 2007 it has been freely available. The source code, written in Java and Scala, can be downloaded free of charge on GitHub, or as a user-friendly desktop application. Neo4j has both a Group Version of the database and an enterprise  Edition.



Difference between Graph and Relational Database




  1. Select a database to work on or if not then create. 
  2. Click start and select Open browser from the graph apps menu.
  3. Run query in the editor ($). 
  4. Note and analyze patterns amongst the nodes. 
  5. Check if any relations exist as it would be beneficial for analysis. 
  6. Repeat steps 3,4 and 5 for other activities.















Gephi: 

Gephi is an open-source network analysis and visualization software package written in Java on the NetBeans platform.
The Gephi Toolkit project package essential modules (Graph, Layout, Filters, IO…) in a standard Java library, which any Java project can use for getting things done. The toolkit is just a single JAR that anyone could reuse in new Java applications and achieve tasks that can be done in Gephi automatically, from a command-line program for instance. The ability to use Gephi features like this in other Java applications boost possibilities and promise to be very useful.


Import DataSet


No. of edges and Nodes



Initial Network


Degree


Degree Distribution











No comments:

Post a Comment

Home

Data Science  What is Data Science? Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and sys...