Tutorials showing how to use OpenSanctions data in use cases.
Knowledge Graph Conference 2025 will be held on May 5-9 in NYC, at Cornell Tech on the beautiful grounds of Roosevelt Island.
Prashanth Rao and I will lead a tutorial workshop on Monday, May 5, 11:00-12:30 US Eastern
“Creating High-Quality Knowledge Graphs From Structured and Unstructured Data”
We’ll work through examples using slices of OpenSanctions and Open Ownership datasets:
- Run Senzing entity resolution to identify entities and relations among them.
- Build a graph in KĂązuDB using Polars.
- Show interactive visualizations in Jupyter notebooks with yWorks, and with the KĂązu Explorer UI.
There’s a helpful combination of open source tooling in the second and third steps: KùzuDB embedded graph data (can run on your smartphone!), Polars for scalable dataframes, yWorks visualizations within Jupyter notebooks, etc.
The materials are available on GitHub (to be shared at the workshop), which you can run on a laptop. This workshop will be held in-person at Cornell Tech, and also available for remote attendees.
PS: message me if you want a discount code for registration
Hope the workshop goes really well next week, @paco. Always appreciate the great work you are doing to show off use cases for Open Ownership data.
I had a good call with Prashanth and the Kuzu team a couple of months ago so will be keen to check out the materials once they are released on Github.
Thank you Stephen!
Here’s our repo for the tutorial at KGC next week:
Notably, here’s an integration of KùzuDB + yWorks + Jupyter showing an open source stack for graph analytics and interactive visualization, with each component running as in-process libraries. In other words, these follow more of a DuckDB pattern and can be launched simply as Python packages, not as platforms which require ops overhead, stateful deployments, etc. KùzuDB provides both graph and vector data, which is not memory-bound.
Any feedback is much appreciated!
Flagging here that @paco and Prashanth have now written up a walkthrough blog post about their Knowledge Graph Conference 2025 presentation:
Join us for “Graph Power Hour!” V1E7, with our special guest Stephen Abbott Pugh
Tue Jun 17, 09:00-10:00 US Pacific
Stephen will present about understanding beneficial ownership, along with live Q&A.
UBO analysis is at the heart of AML/KYC practices in FinCrime, risk-based analysis, sanctions, investigative journalism, and so on. The analysis, modeling, and calculations required are complicated due to trusts, nominees, etc.
Stephen will discuss uses cases, and also the BODS standard for semantic modeling, and I’ll add in about a recent tutorial we did using a slice of this data with Kùzu Inc.