- Classes begin February, 22nd.
Networks are a fundamental tool for modeling complex social, technological, and biological systems. The explosive growth of social networks, the continually-expanding WWW-space, and the large-scale data availability in biological sciences have led to algorithms and systems that routinely handle voluminous data modeled as graphs. This course will focus on the analysis of massive networks which provide several computational, algorithmic, and modeling challenges.
Topics to be covered include:
In this course students will learn and exploit graph mining algorithms and tools to reveal insights on the social, technological, and natural worlds, by means of studying their underlying network structure and interconnections. A key element of the course will be the exposure to hands-on practice, through in-class and at-home assignments, with graph algorithms in the context of both Graph Compute Engines (Apache Spark GraphFrames) and Graph Databases (Neo4J). As the course progresses the students will familiarize themselves with increasingly more complicated graph mining tasks and will develop their own code to analyze real-world network datasets and come up with valuable insights.