William Lyon Software, technology, startups, etc...

Congressional PageRank - Analyzing US Congress With Neo4j and Apache Spark

This post shows how we can apply graph analytics to US Congressional data to find influential legislators in Congress. Using the Mazerunner open source graph analytics project we are able to use Apache Spark GraphX alongside Neo4j to run the PageRank algorithm on a collaboration graph of US Congress.

Introducing legis-graph - US Congressional Data With Govtrack and Neo4j

Interactions among members of any large organization are naturally a graph, yet the tools we use to analyze data about these organizations often ignore the graphiness of the data and instead map the data into structures (such as relational databases) that make taking advantage of the relationships in the data much more difficult when it comes time to analyze the data. Collaboration networks are a perfect example. So let's focus on one of the most powerful collaboration networks in the world, the US Congress.

Natural Language Processing With Neo4j - Mining Paradigmatic Word Associations

Discovering paradigmatic word associations is one of the core tasks of Natural Language Processing. In this post we explore a graph based method for mining paradigmatic word associations from a text corpus.

Content Recommendation From Links Shared on Twitter Using Neo4j and Python

Using article links shared on Twitter to determine a user's interests, we show how to build a personalized content recommender system to suggest other articles the user might be interested in.

Using Neo4j Spatial and Mapbox to search for businesses by location

We show how to load data from the Yelp Academic Dataset into Neo4j, then build a webapp to perform geospatial queries on the data and visualize the results on a map.