Graph neural networks for information extraction with PyTorch

Augusto Stoffel

Data Science Deep Learning Machine-Learning Natural Language Processing

See in schedule

In this talk, I will provide a primer on a class of deep learning models known as graph neural networks (GNNs). These models generalize the well-known convolutional neural networks (CNNs) ubiquitous in computer vision, and the approach of the talk will emphasize that analogy. Specifically, after a quick general recap on graphs, we will see how to think of an image as a graph where nodes represent pixels and edges represent the relationship of being a neighbor. After expressing the formulas for a CNN in terms of this graph, it is not hard to imagine ways to pass from the graph of an image to arbitrary graphs. This way, one naturally arrives at a basic GNN architecture. Other recent advances, for instance graph attention networks, can also be surveyed, time permitting.

We will then survey the existing Python implementations and supporting libraries, with a focus on the PyTorch framework and on the PyTorch Geometric library in particular.

Next, I will turn to applications in the field of NLP, and in particular information extraction. I will focus on the problem of understanding documents in tabular format. Unlike regular paragraphs of text, these documents contain more information than just an ordered sequence of words. We will see how graphs can be used to encode the spatial disposition of words, capturing enough information to allow training deep learning models with good accuracy and generalization capabilities.

Type: Talk (30 mins); Python level: Intermediate; Domain level: Intermediate


Augusto Stoffel

dida

I am a mathematician with a computers science background working as a developer in the field of machine learning. After studying computer engineering in Brazil and completing a PhD in mathematics at University of Notre Dame, USA, I was a postdoc in Bonn and Greifswald, Germany, doing research in the field of algebraic topology and its application as a foundation of quantum field theory. Then I moved to industry, and currently work as a machine learning scientist at dida Datenschmiede GmbH.